Collaborative Partially-Observable Reinforcement Learning Using Wireless Communications

Eisaku Ko, Kwang Cheng Chen, Shao Yu Lien

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Each robot utilizes the reinforcement learning (RL) to control its maneuver and these robots can collaborate to accomplish a common goal to form a collaborative multi-agent system (MAS). Due to the constraints of distributive locations and different poses of robots, in practice, each agent (robot) in such a collaborative MAS can only partially observe the environment and other agents (such as competitive agents), and consequently operate based on its belief of the state(s). The alignment of the beliefs of collaborative agents can be therefore enhanced by adopting wireless communications, but is rarely studied in literature. To explore wireless communications applied to collaborative partially-observable reinforcement learning (PORL), we propose that each collaborative agent predicts the environment dynamics, including the behavior of those agents outside the collaborative MAS, and then constructs the learning-based belief of the world (i.e. global state). To assist such prediction and learning, we modify the RL assisted by the wireless communication functionality into two stages: prediction of the state and local actor-and-critic on global value(s). In other words, while one agent predicts and learns its own policy, another agent can updates critics on the sequence of history to update global value(s) that can further assist to validate the prediction. From numerical experiments, we find that the timing of communication or information exchange among collaborative agents has critical impact on the duration of learning and prediction, and thus the performance of MAS, which suggests the desirable communication for distributed PORL among collaborative agents toward an efficient MAS.

Original languageEnglish
Title of host publicationICC 2021 - IEEE International Conference on Communications, Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728171227
DOIs
StatePublished - Jun 2021
Event2021 IEEE International Conference on Communications, ICC 2021 - Virtual, Online, Canada
Duration: 14 Jun 202123 Jun 2021

Publication series

NameIEEE International Conference on Communications
ISSN (Print)1550-3607

Conference

Conference2021 IEEE International Conference on Communications, ICC 2021
Country/TerritoryCanada
CityVirtual, Online
Period14/06/2123/06/21

Keywords

  • artificial intelligence
  • collaborative robots
  • Hidden Markov Model
  • machine learning
  • MAS
  • RL
  • wireless communications

Fingerprint

Dive into the research topics of 'Collaborative Partially-Observable Reinforcement Learning Using Wireless Communications'. Together they form a unique fingerprint.

Cite this