Temporal difference learning for Connect6

I-Chen Wu*, Hsin Ti Tsai, Hung Hsuan Lin, Yi Shan Lin, Chieh Min Chang, Ping Hung Lin

*此作品的通信作者

研究成果: Conference contribution同行評審

7 引文 斯高帕斯(Scopus)

摘要

In this paper, we apply temporal difference (TD) learning to Connect6, and successfully use TD(0) to improve the strength of a Connect6 program, NCTU6. The program won several computer Connect6 tournaments and also many man-machine Connect6 tournaments from 2006 to 2011. From our experiments, the best improved version of TD learning achieves about a 58% win rate against the original NCTU6 program. This paper discusses three implementation issues that improve the program. The program has a convincing performance in removing winning/losing moves via threat-space search in TD learning.

原文English
主出版物標題Advances in Computer Games - 13th International Conference, ACG 2011, Revised Selected Papers
頁面121-133
頁數13
DOIs
出版狀態Published - 20 8月 2012
事件13th International Conference on Advances in Computer Games, ACG 2011 - Tilburg, Netherlands
持續時間: 20 11月 201122 11月 2011

出版系列

名字Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
7168 LNCS
ISSN(列印)0302-9743
ISSN(電子)1611-3349

Conference

Conference13th International Conference on Advances in Computer Games, ACG 2011
國家/地區Netherlands
城市Tilburg
期間20/11/1122/11/11

指紋

深入研究「Temporal difference learning for Connect6」主題。共同形成了獨特的指紋。

引用此