Reinforcement Learning-Based Robust Resource Scheduling for Dynamic LEO Satellite Networks

Authors

DOI:

https://doi.org/10.4108/eetsis.10581

Keywords:

Low Earth Orbit satellite (LEO), NB-IoT, Robust Resource Scheduling, PPO

Abstract

INTRODUCTION: Low Earth Orbit (LEO) satellite communication extends Narrowband Internet of Things (NB-IoT) coverage for global 6G IoT services. However, long propagation delays and high mobility cause outdated channel state information (CSI), degrading system performance.                                                                                                                     

OBJECTIVES: This study aims to design a robust resource scheduling strategy that mitigates CSI outdatedness, improves resource utilization, and supports large-scale IoT connectivity in dynamic LEO satellite environments.                                  

METHODS: We propose a reinforcement learning–based robust scheduling framework. The Chebyshev inequality transforms probabilistic signal-to-noise ratio (SNR) constraints into deterministic bounds using statistical moments. A multi-objective optimization problem is formulated to maximize served terminals and minimize resource fragmentation. The Proximal Policy Optimization (PPO) algorithm enables intelligent allocation across network slices under dynamic conditions.                                                                                                                                                                                    

RESULTS: Simulation results demonstrate that the proposed approach achieves higher scheduling success rates and better resource utilization compared with baseline methods. The reinforcement learning agent adapts effectively to environmental variations, maintaining stable performance even under severe CSI outdatedness.                                                                      

CONCLUSION: The robust reinforcement learning–based scheduling provides an effective solution for NB-IoT over LEO satellites, enhancing reliability and scalability of future 6G global IoT networks.

References

[1] Hassebo A. The Road to 6G, Vision, Drivers, Trends, and Challenges. In: IEEE 12th Annual Computing and Communication Workshop and Conference (CCWC); 2022; Las Vegas, NV, USA. IEEE; 2022. p. 1112-1116.

[2] Iqbal M, Abdullah AYM, Shabnam F. An Application Based Comparative Study of LPWAN Technologies for IoT Environment. In: IEEE Region 10 Symposium (TENSYMP); 2020; Dhaka, Bangladesh. IEEE; 2020. p. 1857-1860.

[3] Azari MM, Solanki S, Chatzinotas S, Kodheli O, Sallouha H, Colpaert A, Mendoza Montoya JF, Pollin S, Haqiqatnejad A, Mostaani A, Lagunas E, Ottersten B. Evolution of Non-Terrestrial Networks From 5G to 6G: A Survey. IEEE Commun Surv Tutor. 2022; 24(4):2633-2672.

[4] Mangalvedhe N, Ratasuk R, Ghosh A. NB-IoT deployment study for low power wide area cellular IoT. In: IEEE 27th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC); 2016; Valencia, Spain. IEEE; 2016. p. 1-6.

[5] Charbit G, Medles A, Jose P, et al. Satellite and Cellular Networks Integration - A System Overview. In: Joint European Conference on Networks and Communications & 6G Summit (EuCNC/6G Summit); 2021; Porto, Portugal. IEEE; 2021. p. 118-123.

[6] Azari MM, Solanki S, Chatzinotas S, et al. Evolution of non-terrestrial networks from 5G to 6G: A survey. IEEE Commun Surv Tutor. 2022; 24(4):2633-2672.

[7] Gou L, Bian D, Nie Y, Zhang G, Zhou H, Shi Y, Zhang L. Hierarchical Resource Management for Mega-LEO Satellite Constellation. Sensors. 2025; 25(3):902.

[8] Kodheli O, Lagunas E, Maturo N, et al. Satellite communications in the new space era: A survey and future challenges. IEEE Commun Surv Tutor. 2020; 23(1):70-109.

[9] Y. Liu, X. Xie, and Z. Wang. CSI feedback with model-driven deep learning of massive MIMO systems. IEEE Communications Letters, 2020; 24(3): 518-522.

[10] T. Zhang, W. Chen, F. Yang, et al. Letaief. Robust transmission design for intelligent reflecting surface-aided secure communication systems with imperfect cascaded CSI. IEEE Transactions on Wireless Communications. 2022; 21(4): 2487-2501.

[11] Kodheli O, Andrenacci S, Maturo N, Chatzinotas S, Zimmer F. Resource Allocation Approach for Differential Doppler Reduction in NB-IoT over LEO Satellite. In: Advanced Satellite Multimedia Systems Conference and the 15th Signal Processing for Space Communications Workshop (ASMS/SPSC); 2018; Berlin, Germany. IEEE; 2018. p. 1-8.

[12] Maity I, Chougrani H, Chatzinotas S. Fairness-Aware Inter-Slice Scheduler for IoT Services Over Satellite. IEEE Open J Commun Soc. 2023; 4:3040-3050.

[13] Huang C, Chen G, Xiao P, Xiao Y, Han Z, Chambers JA. Joint Offloading and Resource Allocation for Hybrid Cloud and Edge Computing in SAGINs: A Decision Assisted Hybrid Action Space Deep Reinforcement Learning Approach. IEEE J Sel Areas Commun. 2024; 42(5):1029-1043.

[14] Maity I, Chougrani H, Chatzinotas S. Fairness-Aware Inter-Slice Scheduler for IoT Services Over Satellite. IEEE Open J Commun Soc. 2023; 4: 3040-3050.

[15] Gou L, Bian D, Nie Y, Zhang G, Zhou H, Shi Y, Zhang L. Hierarchical Resource Management for Mega-LEO Satellite Constellation. Sensors. 2025; 25(3):902.

[16] Lin Z, Ni Z, Kuang L, Jiang C, Huang Z. Satellite-Terrestrial Coordinated Multi-Satellite Beam Hopping Scheduling Based on Multi-Agent Deep Reinforcement Learning. IEEE Trans Wirel Commun. 2024; 23(8):10091-10103.

[17] Chu J, Chen X, Zhong C, et al. Robust Design for NOMA-Based Multibeam LEO Satellite Internet of Things. IEEE Internet Things J. 2021; 8(3):1959-1970.

[18] Xu H, Chen X, Huang X, et al. Uncertainty-Aware Scheduling for Effective Data Collection from Environmental IoT Devices through LEO Satellites. Future Gener Comput Syst. 2025; 166(5): 107656-107672.

[19] Lee J, Park C, Park S, Molisch AF. Handover Protocol Learning for LEO Satellite Networks: Access Delay and Collision Minimization. IEEE Trans Wirel Commun. 2024; 23(7):7624-7637.

[20] J. Zhang, Y. Wei, E. Björnson, et al. Performance analysis and optimization of MIMO-NOMA downlink with imperfect CSI. IEEE Transactions on Communications. 2020; 68(5): 2982-2996.

[21] J. Schulman, F. Wolski, P. Dhariwal, et al. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347, 2017.

[22] A. Guidotti, A. Vanelli-Coralli, M. Conti, et al. Architectures and key technical challenges for 5G systems incorporating satellites. IEEE Transactions on Vehicular Technology. 2019; 68(3): 2624-2639.

[23] Merias P. Study on Narrow-Band Internet of Things (NB-IoT)/enhanced Machine Type Communication (eMTC) support for Non-Terrestrial Networks (NTN). In: 3GPP TSG RAN, editor. Proceedings of the 3GPP TSG RAN Meeting RP-86; 2019 Dec 9-12; Sitges, Spain. Sophia Antipolis: 3GPP; 2019. (Doc. RP-193235).

[24] Lin Z, Lin M, Champagne B, et al. Robust Hybrid Beamforming for Satellite-Terrestrial Integrated Networks. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020: 8792-8796

Downloads

Published

20-04-2026

Issue

Section

Scheduling optimization and load balancing in scalable distributed systems

How to Cite

1.
N.N. Song, Y. Z. Li, Y. K. Zhao, J. Li, W. Li. Reinforcement Learning-Based Robust Resource Scheduling for Dynamic LEO Satellite Networks. EAI Endorsed Scal Inf Syst [Internet]. 2026 Apr. 20 [cited 2026 Apr. 21];12(9). Available from: https://publications.eai.eu/index.php/sis/article/view/10581