Temporal difference learning for Connect6

I-Chen Wu*, Hsin Ti Tsai, Hung Hsuan Lin, Yi Shan Lin, Chieh Min Chang, Ping Hung Lin

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Scopus citations

Abstract

In this paper, we apply temporal difference (TD) learning to Connect6, and successfully use TD(0) to improve the strength of a Connect6 program, NCTU6. The program won several computer Connect6 tournaments and also many man-machine Connect6 tournaments from 2006 to 2011. From our experiments, the best improved version of TD learning achieves about a 58% win rate against the original NCTU6 program. This paper discusses three implementation issues that improve the program. The program has a convincing performance in removing winning/losing moves via threat-space search in TD learning.

Original languageEnglish
Title of host publicationAdvances in Computer Games - 13th International Conference, ACG 2011, Revised Selected Papers
Pages121-133
Number of pages13
DOIs
StatePublished - 20 Aug 2012
Event13th International Conference on Advances in Computer Games, ACG 2011 - Tilburg, Netherlands
Duration: 20 Nov 201122 Nov 2011

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7168 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference13th International Conference on Advances in Computer Games, ACG 2011
Country/TerritoryNetherlands
CityTilburg
Period20/11/1122/11/11

Fingerprint

Dive into the research topics of 'Temporal difference learning for Connect6'. Together they form a unique fingerprint.

Cite this