Collaborative Partially-Observable Reinforcement Learning Using Wireless Communications

Eisaku Ko, Kwang Cheng Chen, Shao Yu Lien

研究成果: Conference contribution同行評審

3 引文 斯高帕斯(Scopus)

摘要

Each robot utilizes the reinforcement learning (RL) to control its maneuver and these robots can collaborate to accomplish a common goal to form a collaborative multi-agent system (MAS). Due to the constraints of distributive locations and different poses of robots, in practice, each agent (robot) in such a collaborative MAS can only partially observe the environment and other agents (such as competitive agents), and consequently operate based on its belief of the state(s). The alignment of the beliefs of collaborative agents can be therefore enhanced by adopting wireless communications, but is rarely studied in literature. To explore wireless communications applied to collaborative partially-observable reinforcement learning (PORL), we propose that each collaborative agent predicts the environment dynamics, including the behavior of those agents outside the collaborative MAS, and then constructs the learning-based belief of the world (i.e. global state). To assist such prediction and learning, we modify the RL assisted by the wireless communication functionality into two stages: prediction of the state and local actor-and-critic on global value(s). In other words, while one agent predicts and learns its own policy, another agent can updates critics on the sequence of history to update global value(s) that can further assist to validate the prediction. From numerical experiments, we find that the timing of communication or information exchange among collaborative agents has critical impact on the duration of learning and prediction, and thus the performance of MAS, which suggests the desirable communication for distributed PORL among collaborative agents toward an efficient MAS.

原文English
主出版物標題ICC 2021 - IEEE International Conference on Communications, Proceedings
發行者Institute of Electrical and Electronics Engineers Inc.
ISBN(電子)9781728171227
DOIs
出版狀態Published - 6月 2021
事件2021 IEEE International Conference on Communications, ICC 2021 - Virtual, Online, 加拿大
持續時間: 14 6月 202123 6月 2021

出版系列

名字IEEE International Conference on Communications
ISSN(列印)1550-3607

Conference

Conference2021 IEEE International Conference on Communications, ICC 2021
國家/地區加拿大
城市Virtual, Online
期間14/06/2123/06/21

指紋

深入研究「Collaborative Partially-Observable Reinforcement Learning Using Wireless Communications」主題。共同形成了獨特的指紋。

引用此