TY - JOUR
T1 - Deep Reinforcement Learning for Multi-User Access Control in Non-Terrestrial Networks
AU - Cao, Yang
AU - Lien, Shao Yu
AU - Liang, Ying Chang
N1 - Publisher Copyright:
© 1972-2012 IEEE.
PY - 2021/3
Y1 - 2021/3
N2 - Non-Terrestrial Networks (NTNs) composed of space-borne (e.g., satellites) and airborne vehicles (e.g., drones and blimps) have recently been proposed by 3GPP as a new paradigm of infrastructures to enhance the capacity and coverage of existing terrestrial wireless networks. The mobility of non-Terrestrial base stations (NT-BSs) however leads to a dynamic environment, which imposes unique challenges for handover and throughput optimization particularly in multi-user access control for NTNs. To achieve performance optimization, each terrestrial user equipment (UE) should autonomously estimate the dynamics of moving NT-BSs, which is different from the existing user access control schemes in terrestrial wireless networks. Consequently, new learning schemes for optimum multi-user access control are desired. In this article, we therefore propose a UE-driven deep reinforcement learning (DRL) based scheme, in which a centralized agent deployed at the backhaul side of NT-BSs is responsible for training the parameter of a deep Q-network (DQN), and each UE independently makes its own access decisions based on the parameter from the trained DQN. With the proposed scheme, each UE is able to access a proper NT-BS intelligently to enhance the long-Term system throughput and avoid frequent handovers among NT-BSs. Through comprehensive simulation studies, we justify the performance of the proposed scheme, and show its effectiveness in addressing the fundamental issues in the NTNs deployment.
AB - Non-Terrestrial Networks (NTNs) composed of space-borne (e.g., satellites) and airborne vehicles (e.g., drones and blimps) have recently been proposed by 3GPP as a new paradigm of infrastructures to enhance the capacity and coverage of existing terrestrial wireless networks. The mobility of non-Terrestrial base stations (NT-BSs) however leads to a dynamic environment, which imposes unique challenges for handover and throughput optimization particularly in multi-user access control for NTNs. To achieve performance optimization, each terrestrial user equipment (UE) should autonomously estimate the dynamics of moving NT-BSs, which is different from the existing user access control schemes in terrestrial wireless networks. Consequently, new learning schemes for optimum multi-user access control are desired. In this article, we therefore propose a UE-driven deep reinforcement learning (DRL) based scheme, in which a centralized agent deployed at the backhaul side of NT-BSs is responsible for training the parameter of a deep Q-network (DQN), and each UE independently makes its own access decisions based on the parameter from the trained DQN. With the proposed scheme, each UE is able to access a proper NT-BS intelligently to enhance the long-Term system throughput and avoid frequent handovers among NT-BSs. Through comprehensive simulation studies, we justify the performance of the proposed scheme, and show its effectiveness in addressing the fundamental issues in the NTNs deployment.
KW - deep reinforcement learning (DRL)
KW - handovers
KW - multi-user access control
KW - Non-Terrestrial networks (NTNs)
KW - UE-driven scheme
UR - http://www.scopus.com/inward/record.url?scp=85097373420&partnerID=8YFLogxK
U2 - 10.1109/TCOMM.2020.3041347
DO - 10.1109/TCOMM.2020.3041347
M3 - Article
AN - SCOPUS:85097373420
SN - 0090-6778
VL - 69
SP - 1605
EP - 1619
JO - IEEE Transactions on Communications
JF - IEEE Transactions on Communications
IS - 3
M1 - 9273081
ER -