Collaborative Deep Reinforcement Learning for Resource Optimization in Non-Terrestrial Networks

Yang Cao*, Shao Yu Lien, Ying Chang Liang, Dusit Niyato, Xuemin Sherman Shen

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

Non-terrestrial networks (NTNs) with low-earth orbit (LEO) satellites have been regarded as promising remedies to support global ubiquitous wireless services. Due to the rapid mobility of LEO satellite, inter-beam/satellite handovers happen frequently for a specific user equipment (UE). To tackle this issue, earth-fixed cell scenarios have been under studied, in which the LEO satellite adjusts its beam direction towards a fixed area within its dwell duration, to maintain stable transmission performance for the UE. Therefore, it is required that the LEO satellite performs real-time resource allocation, which however is unaffordable by the LEO satellite with limited computing capability. To address this issue, in this paper, we propose a two-time-scale collaborative deep reinforcement learning (DRL) scheme for beam management and resource allocation in NTNs, in which LEO satellite and UE with different control cycles update their decision-making policies through a sequential manner. Specifically, UE updates its policy subject to improving the value functions of both the agents. Furthermore, the LEO satellite only makes decisions through finite-step rollouts with a reference decision trajectory received from the UE. Simulation results show that the proposed scheme can effectively balance the throughput performance and computational complexity over traditional greedy-searching schemes.

Original languageEnglish
Title of host publication2023 IEEE 34th Annual International Symposium on Personal, Indoor and Mobile Radio Communications
Subtitle of host publication6G The Next Horizon - From Connected People and Things to Connected Intelligence, PIMRC 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665464833
DOIs
StatePublished - 2023
Event34th IEEE Annual International Symposium on Personal, Indoor and Mobile Radio Communications, PIMRC 2023 - Toronto, Canada
Duration: 5 Sep 20238 Sep 2023

Publication series

NameIEEE International Symposium on Personal, Indoor and Mobile Radio Communications, PIMRC

Conference

Conference34th IEEE Annual International Symposium on Personal, Indoor and Mobile Radio Communications, PIMRC 2023
Country/TerritoryCanada
CityToronto
Period5/09/238/09/23

Keywords

  • Non-terrestrial networks (NTNs)
  • deep reinforcement learning (DRL)
  • earth-fixed cell
  • multi-time-scale Markov decision process (MMDPs)
  • resource allocation

Fingerprint

Dive into the research topics of 'Collaborative Deep Reinforcement Learning for Resource Optimization in Non-Terrestrial Networks'. Together they form a unique fingerprint.

Cite this