TY - JOUR
T1 - Multi-Tier Deep Reinforcement Learning for Non-Terrestrial Networks
AU - Cao, Yang
AU - Lien, Shao Yu
AU - Liang, Ying Chang
AU - Niyato, Dusit
N1 - Publisher Copyright:
© 2002-2012 IEEE.
PY - 2024/6/1
Y1 - 2024/6/1
N2 - To provide global coverage and ubiquitous wireless services, non-terrestrial networks (NTNs) composed of space-tier, air-tier, and ground-tier stations, have been regarded as a key enabling technology toward the sixth generation (6G) networks. Such a multi-tier architecture, however, induces fundamental challenges in optimizing the overall system performance of the NTN in terms of signaling overheads, complexity, and even analytical formulation of the optimization problem. Interestingly, deep reinforcement learning (DRL) is an effective approach to tackle 'complex' optimization problems. When applied to the NTN, however, different DRL methods should be developed for various stations due to different constraints in each tier, which raises fundamental issues involving the interaction, orchestration, and protocol design of multi-tier DRL architecture. In this article, we provide the essential principles for multi-tier DRL for the NTN, and propose a generalized multi-tier architecture to embrace different deployment scenarios of NTNs. A case study of joint optimization of spectrum chunk allocation, trajectory design, and user association for space-tier, air-tier, and ground-tier stations is provided. The simulation results indicate that different trade-offs between the overall throughput, optimization dimension, and computational complexity of stations can be achieved through configuring the proposed architectures.
AB - To provide global coverage and ubiquitous wireless services, non-terrestrial networks (NTNs) composed of space-tier, air-tier, and ground-tier stations, have been regarded as a key enabling technology toward the sixth generation (6G) networks. Such a multi-tier architecture, however, induces fundamental challenges in optimizing the overall system performance of the NTN in terms of signaling overheads, complexity, and even analytical formulation of the optimization problem. Interestingly, deep reinforcement learning (DRL) is an effective approach to tackle 'complex' optimization problems. When applied to the NTN, however, different DRL methods should be developed for various stations due to different constraints in each tier, which raises fundamental issues involving the interaction, orchestration, and protocol design of multi-tier DRL architecture. In this article, we provide the essential principles for multi-tier DRL for the NTN, and propose a generalized multi-tier architecture to embrace different deployment scenarios of NTNs. A case study of joint optimization of spectrum chunk allocation, trajectory design, and user association for space-tier, air-tier, and ground-tier stations is provided. The simulation results indicate that different trade-offs between the overall throughput, optimization dimension, and computational complexity of stations can be achieved through configuring the proposed architectures.
UR - http://www.scopus.com/inward/record.url?scp=85184339081&partnerID=8YFLogxK
U2 - 10.1109/MWC.018.2200429
DO - 10.1109/MWC.018.2200429
M3 - Article
AN - SCOPUS:85184339081
SN - 1536-1284
VL - 31
SP - 194
EP - 201
JO - IEEE Wireless Communications
JF - IEEE Wireless Communications
IS - 3
ER -