TY - JOUR
T1 - Knowledge distillation for portfolio management using multi-agent reinforcement learning
AU - Chen, Min You
AU - Chen, Chiao Ting
AU - Huang, Szu Hao
N1 - Publisher Copyright:
© 2023 Elsevier Ltd
PY - 2023/8
Y1 - 2023/8
N2 - Many studies have employed reinforcement learning (RL) techniques to successfully create portfolio strategies in recent years. However, since financial markets are extremely noisy, past research has found it challenging to train a stable RL agent using historical data. In this work, we first apply a role-aware multi-agent system to model volatile security markets. Three major roles that are used in our system are presented, and while maximizing their own targets in the Taiwan stock exchange (TWSE) historical data, they also observe trading behavior and compete with other agents. To build a trading strategy, we construct a student–teacher framework in which multi-agent targeting distills the market information and a student RL model is taught using the distilled target. The results show that our method is capable of developing profitable strategies in a quickly changing financial market. In addition, our market distilling technique has the potential to develop a flexible asset allocation strategy by using different student networks.
AB - Many studies have employed reinforcement learning (RL) techniques to successfully create portfolio strategies in recent years. However, since financial markets are extremely noisy, past research has found it challenging to train a stable RL agent using historical data. In this work, we first apply a role-aware multi-agent system to model volatile security markets. Three major roles that are used in our system are presented, and while maximizing their own targets in the Taiwan stock exchange (TWSE) historical data, they also observe trading behavior and compete with other agents. To build a trading strategy, we construct a student–teacher framework in which multi-agent targeting distills the market information and a student RL model is taught using the distilled target. The results show that our method is capable of developing profitable strategies in a quickly changing financial market. In addition, our market distilling technique has the potential to develop a flexible asset allocation strategy by using different student networks.
KW - Artificial stock market
KW - Knowledge distillation
KW - Multi-agent reinforcement learning
KW - Portfolio management
UR - http://www.scopus.com/inward/record.url?scp=85165536376&partnerID=8YFLogxK
U2 - 10.1016/j.aei.2023.102096
DO - 10.1016/j.aei.2023.102096
M3 - Article
AN - SCOPUS:85165536376
SN - 1474-0346
VL - 57
JO - Advanced Engineering Informatics
JF - Advanced Engineering Informatics
M1 - 102096
ER -