Effects of Reward Terms in Agent-Based Box-Manipulation Animation Using Deep Reinforcement Learning

Hsiang Yu Yang, Chien Chou Wong, Sai-Keung Wong

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Agent-based box manipulation has wide applications in computer animation and robotics. Deep reinforcement learning can be applied to generate animations of agent-based box manipulation. This paper focuses on push-manipulation in an agent-based animation. A policy is learned in a learning session in which an agent receives a reward that is a combination of different types of reward terms. Based on the received reward, the policy is improved gradually. In this paper, we investigate the effects of each reward term in-depth in a framework that is integrated with deep reinforcement learning. We also propose a simple way to produce different animation types. We performed several examples and analyzed our findings in details.

Original languageEnglish
Title of host publicationProceedings - 2019 International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Number of pages6
ISBN (Electronic)9781728146669
DOIs
StatePublished - 21 Nov 2019
Event24th International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2019 - Kaohsiung, Taiwan
Duration: 21 Nov 201923 Nov 2019

Publication series

NameProceedings - 2019 International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2019

Conference

Conference24th International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2019
Country/TerritoryTaiwan
CityKaohsiung
Period21/11/1923/11/19

Keywords

  • agent-based
  • animation
  • box manipulation
  • reinforcement learning

Fingerprint

Dive into the research topics of 'Effects of Reward Terms in Agent-Based Box-Manipulation Animation Using Deep Reinforcement Learning'. Together they form a unique fingerprint.

Cite this