Stochastic Temporal Difference Learning for Sequence Data

Jen Tzung Chien, Yi Chung Chiu

研究成果: Conference contribution同行評審

3 引文 斯高帕斯(Scopus)

摘要

Planning is crucial to train an agent via model-based reinforcement learning who can predict distant observations to reflect his/her past experience. Such a planning method is theoretically and computationally attractive in comparison with traditional learning which relies on step-by-step prediction. However, it is more challenging to build a learning machine which can predict and plan randomly across multiple time steps rather than act step by step. To reflect this flexibility in learning process, we need to predict future states directly without going through all intermediate states. Accordingly, this paper develops the stochastic temporal difference learning where the sequence data are represented with multiple jumpy states while the stochastic state space model is learned by maximizing the evidence lower bound of log likelihood of training data. A general solution with various number of jumpy states is developed and formulated. Experiments demonstrate the merit of the proposed sequential machine to find predictive states to roll forward with jumps as well as predict words.

原文English
主出版物標題IJCNN 2021 - International Joint Conference on Neural Networks, Proceedings
發行者Institute of Electrical and Electronics Engineers Inc.
ISBN(電子)9780738133669
DOIs
出版狀態Published - 18 7月 2021
事件2021 International Joint Conference on Neural Networks, IJCNN 2021 - Virtual, Shenzhen, 中國
持續時間: 18 7月 202122 7月 2021

出版系列

名字Proceedings of the International Joint Conference on Neural Networks
2021-July

Conference

Conference2021 International Joint Conference on Neural Networks, IJCNN 2021
國家/地區中國
城市Virtual, Shenzhen
期間18/07/2122/07/21

指紋

深入研究「Stochastic Temporal Difference Learning for Sequence Data」主題。共同形成了獨特的指紋。

引用此