Neurohex: A Deep Q-learning Hex Agent

Young, Kenny; Hayward, Ryan; Vasan, Gautham

Full-text links:

Download:

Current browse context:

cs.AI

< prev | next >

new | recent | 1604

Change to browse by:

Computer Science > Artificial Intelligence

Title: Neurohex: A Deep Q-learning Hex Agent

Authors: Kenny Young, Ryan Hayward, Gautham Vasan

(Submitted on 24 Apr 2016 (v1), last revised 26 Apr 2016 (this version, v2))

Abstract: DeepMind's recent spectacular success in using deep convolutional neural nets and machine learning to build superhuman level agents --- e.g. for Atari games via deep Q-learning and for the game of Go via Reinforcement Learning --- raises many questions, including to what extent these methods will succeed in other domains. In this paper we consider DQL for the game of Hex: after supervised initialization, we use selfplay to train NeuroHex, an 11-layer CNN that plays Hex on the 13x13 board. Hex is the classic two-player alternate-turn stone placement game played on a rhombus of hexagonal cells in which the winner is whomever connects their two opposing sides. Despite the large action and state space, our system trains a Q-network capable of strong play with no search. After two weeks of Q-learning, NeuroHex achieves win-rates of 20.4% as first player and 2.1% as second player against a 1-second/move version of MoHex, the current ICGA Olympiad Hex champion. Our data suggests further improvement might be possible with more training time.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:1604.07097 [cs.AI]
	(or arXiv:1604.07097v2 [cs.AI] for this version)

Submission history

From: Kenneth Young [view email]
[v1] Sun, 24 Apr 2016 23:56:37 GMT (1214kb,D)
[v2] Tue, 26 Apr 2016 02:26:14 GMT (1212kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1604.07097

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Artificial Intelligence

Title: Neurohex: A Deep Q-learning Hex Agent

Submission history