Deep Reinforcement Learning for MEC Streaming with Joint User Association and Resource Management

Po Yu Chou, Wei Yu Chen, Chih Yu Wang, Ren Hung Hwang, Wen Tsuen Chen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

18 Scopus citations

Abstract

Mobile Edge Computing (MEC) is a promising technique in the 5G Era to improve the Quality of Experience (QoE) for online video streaming due to its ability to reduce the backhaul transmission by caching certain content. However, it still takes effort to address the user association and video quality selection problem under the limited resource of MEC to fully support the low-latency demand for live video streaming. We found the optimization problem to be a non-linear integer programming, which is impossible to obtain a globally optimal solution under polynomial time. In this paper, we first reformulate this problem as a Markov Decision Process (MDP) and develop a Deep Deterministic Policy Gradient (DDPG) based algorithm exploiting the supply-demand interpretation of the Lagrange dual problem. Simulation results show that our proposed approach achieves significant QoE improvement especially in the low wireless resource and high user number scenario compared to other baselines.

Original languageEnglish
Title of host publication2020 IEEE International Conference on Communications, ICC 2020 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728150895
DOIs
StatePublished - Jun 2020
Event2020 IEEE International Conference on Communications, ICC 2020 - Dublin, Ireland
Duration: 7 Jun 202011 Jun 2020

Publication series

NameIEEE International Conference on Communications
Volume2020-June
ISSN (Print)1550-3607

Conference

Conference2020 IEEE International Conference on Communications, ICC 2020
Country/TerritoryIreland
CityDublin
Period7/06/2011/06/20

Fingerprint

Dive into the research topics of 'Deep Reinforcement Learning for MEC Streaming with Joint User Association and Resource Management'. Together they form a unique fingerprint.

Cite this