Puzzle Solving without Search or Human Knowledge: An Unnatural Language Approach

Noever, David; Burdick, Ryerson

Full-text links:

Download:

PDF only

Current browse context:

cs.LG

< prev | next >

new | recent | 2109

Computer Science > Machine Learning

Title: Puzzle Solving without Search or Human Knowledge: An Unnatural Language Approach

Authors: David Noever, Ryerson Burdick

(Submitted on 7 Sep 2021)

Abstract: The application of Generative Pre-trained Transformer (GPT-2) to learn text-archived game notation provides a model environment for exploring sparse reward gameplay. The transformer architecture proves amenable to training on solved text archives describing mazes, Rubik's Cube, and Sudoku solvers. The method benefits from fine-tuning the transformer architecture to visualize plausible strategies derived outside any guidance from human heuristics or domain expertise. The large search space ($>10^{19}$) for the games provides a puzzle environment in which the solution has few intermediate rewards and a final move that solves the challenge.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2109.02797 [cs.LG]
	(or arXiv:2109.02797v1 [cs.LG] for this version)

Submission history

From: Ryerson Burdick [view email]
[v1] Tue, 7 Sep 2021 01:20:28 GMT (824kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2109.02797

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Puzzle Solving without Search or Human Knowledge: An Unnatural Language Approach

Submission history