We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.GT

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computer Science and Game Theory

Title: Best-Response Dynamics and Fictitious Play in Identical-Interest and Zero-Sum Stochastic Games

Abstract: This paper combines ideas from Q-learning and fictitious play to define three reinforcement learning procedures which converge to the set of stationary mixed Nash equilibria in identical interest discounted stochastic games. First, we analyse three continuous-time systems that generalize the best-response dynamics defined by Leslie et al. for zero-sum discounted stochastic games. Under some assumptions depending on the system, the dynamics are shown to converge to the set of stationary equilibria in identical interest discounted stochastic games. Then, we introduce three analog discrete-time procedures in the spirit of Sayin et al. and demonstrate their convergence to the set of stationary equilibria using our results in continuous time together with stochastic approximation techniques. Some numerical experiments complement our theoretical findings.
Comments: Preprint, accepted at ICML 2022
Subjects: Computer Science and Game Theory (cs.GT)
Cite as: arXiv:2111.04317 [cs.GT]
  (or arXiv:2111.04317v2 [cs.GT] for this version)

Submission history

From: Lucas Baudin [view email]
[v1] Mon, 8 Nov 2021 08:06:57 GMT (318kb,D)
[v2] Mon, 16 May 2022 11:37:12 GMT (644kb)

Link back to: arXiv, form interface, contact.