We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.GT

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Science and Game Theory

Title: Bayesian Opponent Exploitation in Imperfect-Information Games

Abstract: Two fundamental problems in computational game theory are computing a Nash equilibrium and learning to exploit opponents given observations of their play (opponent exploitation). The latter is perhaps even more important than the former: Nash equilibrium does not have a compelling theoretical justification in game classes other than two-player zero-sum, and for all games one can potentially do better by exploiting perceived weaknesses of the opponent than by following a static equilibrium strategy throughout the match. The natural setting for opponent exploitation is the Bayesian setting where we have a prior model that is integrated with observations to create a posterior opponent model that we respond to. The most natural, and a well-studied prior distribution is the Dirichlet distribution. An exact polynomial-time algorithm is known for best-responding to the posterior distribution for an opponent assuming a Dirichlet prior with multinomial sampling in normal-form games; however, for imperfect-information games the best known algorithm is based on approximating an infinite integral without theoretical guarantees. We present the first exact algorithm for a natural class of imperfect-information games. We demonstrate that our algorithm runs quickly in practice and outperforms the best prior approaches. We also present an algorithm for the uniform prior setting.
Subjects: Computer Science and Game Theory (cs.GT); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA); Probability (math.PR); Computation (stat.CO)
Cite as: arXiv:1603.03491 [cs.GT]
  (or arXiv:1603.03491v6 [cs.GT] for this version)

Submission history

From: Sam Ganzfried [view email]
[v1] Thu, 10 Mar 2016 23:50:51 GMT (26kb,D)
[v2] Sat, 17 Sep 2016 19:35:22 GMT (34kb,D)
[v3] Fri, 18 Nov 2016 06:23:30 GMT (53kb,D)
[v4] Mon, 13 Feb 2017 22:04:34 GMT (57kb,D)
[v5] Wed, 27 Jun 2018 02:35:11 GMT (58kb,D)
[v6] Thu, 28 Jun 2018 00:55:09 GMT (58kb,D)

Link back to: arXiv, form interface, contact.