Bayesian Opponent Exploitation in Imperfect-Information Games

Ganzfried, Sam; Sun, Qingyun

Full-text links:

Download:

Current browse context:

cs.GT

< prev | next >

new | recent | 1603

Computer Science > Computer Science and Game Theory

Title: Bayesian Opponent Exploitation in Imperfect-Information Games

Authors: Sam Ganzfried, Qingyun Sun

(Submitted on 10 Mar 2016 (v1), last revised 28 Jun 2018 (this version, v6))

Abstract: Two fundamental problems in computational game theory are computing a Nash equilibrium and learning to exploit opponents given observations of their play (opponent exploitation). The latter is perhaps even more important than the former: Nash equilibrium does not have a compelling theoretical justification in game classes other than two-player zero-sum, and for all games one can potentially do better by exploiting perceived weaknesses of the opponent than by following a static equilibrium strategy throughout the match. The natural setting for opponent exploitation is the Bayesian setting where we have a prior model that is integrated with observations to create a posterior opponent model that we respond to. The most natural, and a well-studied prior distribution is the Dirichlet distribution. An exact polynomial-time algorithm is known for best-responding to the posterior distribution for an opponent assuming a Dirichlet prior with multinomial sampling in normal-form games; however, for imperfect-information games the best known algorithm is based on approximating an infinite integral without theoretical guarantees. We present the first exact algorithm for a natural class of imperfect-information games. We demonstrate that our algorithm runs quickly in practice and outperforms the best prior approaches. We also present an algorithm for the uniform prior setting.

Subjects:	Computer Science and Game Theory (cs.GT); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA); Probability (math.PR); Computation (stat.CO)
Cite as:	arXiv:1603.03491 [cs.GT]
	(or arXiv:1603.03491v6 [cs.GT] for this version)

Submission history

From: Sam Ganzfried [view email]
[v1] Thu, 10 Mar 2016 23:50:51 GMT (26kb,D)
[v2] Sat, 17 Sep 2016 19:35:22 GMT (34kb,D)
[v3] Fri, 18 Nov 2016 06:23:30 GMT (53kb,D)
[v4] Mon, 13 Feb 2017 22:04:34 GMT (57kb,D)
[v5] Wed, 27 Jun 2018 02:35:11 GMT (58kb,D)
[v6] Thu, 28 Jun 2018 00:55:09 GMT (58kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1603.03491

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Science and Game Theory

Title: Bayesian Opponent Exploitation in Imperfect-Information Games

Submission history