TY - GEN
T1 - Combining Deep Deterministic Policy Gradient with Cross-Entropy Method
AU - Lai, Tung Yi
AU - Hsueh, Chu Hsuan
AU - Lin, You Hsuan
AU - Chu, Yeong Jia Roger
AU - Hsueh, Bo Yang
AU - Wu, I. Chen
PY - 2019/11
Y1 - 2019/11
N2 - This paper proposes a deep reinforcement learning algorithm for solving robotic tasks, such as grasping objects. We propose in this paper a combination of cross-entropy optimization (CE) with deep deterministic policy gradient (DDPG). More specifically, where in the CE method, we first sample from a Gaussian distribution with zero as its initial mean, we now set the initial mean to DDPG's output instead. The resulting algorithm is referred to as the DDPG-CE method. Next, to negate the effects of bad samples, we improve on DDPG-CE by substituting the CE component with a weighted CE method, resulting in the DDPG-WCE algorithm. Experiments show that DDPG-WCE achieves a higher success rate on grasping previously unseen objects, than other approaches, such as supervised learning, DDPG, CE, and DDPG-CE.
AB - This paper proposes a deep reinforcement learning algorithm for solving robotic tasks, such as grasping objects. We propose in this paper a combination of cross-entropy optimization (CE) with deep deterministic policy gradient (DDPG). More specifically, where in the CE method, we first sample from a Gaussian distribution with zero as its initial mean, we now set the initial mean to DDPG's output instead. The resulting algorithm is referred to as the DDPG-CE method. Next, to negate the effects of bad samples, we improve on DDPG-CE by substituting the CE component with a weighted CE method, resulting in the DDPG-WCE algorithm. Experiments show that DDPG-WCE achieves a higher success rate on grasping previously unseen objects, than other approaches, such as supervised learning, DDPG, CE, and DDPG-CE.
KW - cross-entropy method
KW - deep deterministic policy gradient
KW - object grasping
KW - reinforcement learning
KW - robotics
UR - http://www.scopus.com/inward/record.url?scp=85079042483&partnerID=8YFLogxK
U2 - 10.1109/TAAI48200.2019.8959942
DO - 10.1109/TAAI48200.2019.8959942
M3 - Conference contribution
AN - SCOPUS:85079042483
T3 - Proceedings - 2019 International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2019
BT - Proceedings - 2019 International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 24th International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2019
Y2 - 21 November 2019 through 23 November 2019
ER -