TY - JOUR
T1 - Autonomous Non-Terrestrial Base Station Deployment for Non-Terrestrial Networks
T2 - A Reinforcement Learning Approach
AU - Lien, Shao Yu
AU - Deng, Der Jiunn
N1 - Publisher Copyright:
© 1967-2012 IEEE.
PY - 2022/10/1
Y1 - 2022/10/1
N2 - Non-Terrestrial Networks (NTN) consisting of multiple space-borne and airborne non-terrestrial base stations (NT-BSs) have recently been introduced by 3GPP as a new paradigm of infrastructure to extend the capacity and coverage of existing terrestrial networks to further support non-terrestrial UEs (NT-UEs). Mobility of NT-BSs and NT-UEs however leads to a dynamic and non-stationary environment, which creates unique challenges in the coverage optimization particularly for the dynamic deployment of multiple NT-BSs. Under the dynamic and non-stationary environment, each NT-BS should autonomously predict not only moving trajectories of NT-UEs and other NT-BSs but also the probability of presence of NT-UEs at any given location, and consequently a new machine learning (ML) scheme is desired. In this paper, instead of adopting the recent innovation of deep reinforcement learning (DRL) approaches inducing an unaffordable complexity for NT-BSs with a limited computing capability, new reinforcement learning (RL) schemes are therefore proposed, by which each of multiple NT-BSs autonomously determines the deployment trajectories to maximize the number of NT-UEs that can access NT-BSs. Through the comprehensive analysis, we justify the convergence of the performance of the proposed schemes. The simulation results also show the effectiveness of the proposed schemes and the outstanding performances as compared with state-of-the-art schemes.
AB - Non-Terrestrial Networks (NTN) consisting of multiple space-borne and airborne non-terrestrial base stations (NT-BSs) have recently been introduced by 3GPP as a new paradigm of infrastructure to extend the capacity and coverage of existing terrestrial networks to further support non-terrestrial UEs (NT-UEs). Mobility of NT-BSs and NT-UEs however leads to a dynamic and non-stationary environment, which creates unique challenges in the coverage optimization particularly for the dynamic deployment of multiple NT-BSs. Under the dynamic and non-stationary environment, each NT-BS should autonomously predict not only moving trajectories of NT-UEs and other NT-BSs but also the probability of presence of NT-UEs at any given location, and consequently a new machine learning (ML) scheme is desired. In this paper, instead of adopting the recent innovation of deep reinforcement learning (DRL) approaches inducing an unaffordable complexity for NT-BSs with a limited computing capability, new reinforcement learning (RL) schemes are therefore proposed, by which each of multiple NT-BSs autonomously determines the deployment trajectories to maximize the number of NT-UEs that can access NT-BSs. Through the comprehensive analysis, we justify the convergence of the performance of the proposed schemes. The simulation results also show the effectiveness of the proposed schemes and the outstanding performances as compared with state-of-the-art schemes.
KW - autonomous deployment
KW - coverage optimization
KW - multiple NT-BSs
KW - non-stationary environment
KW - NT-UEs
KW - NTN
KW - RL
UR - http://www.scopus.com/inward/record.url?scp=85140767607&partnerID=8YFLogxK
U2 - 10.1109/TVT.2022.3182908
DO - 10.1109/TVT.2022.3182908
M3 - Article
AN - SCOPUS:85140767607
SN - 0018-9545
VL - 71
SP - 10894
EP - 10909
JO - IEEE Transactions on Vehicular Technology
JF - IEEE Transactions on Vehicular Technology
IS - 10
ER -