Unified Reinforcement Q-Learning for Mean Field Game and Control Problems

Angiuli, Andrea; Fouque, Jean-Pierre; Laurière, Mathieu

Full-text links:

Download:

Current browse context:

math.OC

< prev | next >

new | recent | 2006

Mathematics > Optimization and Control

Title: Unified Reinforcement Q-Learning for Mean Field Game and Control Problems

Authors: Andrea Angiuli, Jean-Pierre Fouque, Mathieu Laurière

(Submitted on 24 Jun 2020 (v1), last revised 31 May 2021 (this version, v3))

Abstract: We present a Reinforcement Learning (RL) algorithm to solve infinite horizon asymptotic Mean Field Game (MFG) and Mean Field Control (MFC) problems. Our approach can be described as a unified two-timescale Mean Field Q-learning: The \emph{same} algorithm can learn either the MFG or the MFC solution by simply tuning the ratio of two learning parameters. The algorithm is in discrete time and space where the agent not only provides an action to the environment but also a distribution of the state in order to take into account the mean field feature of the problem. Importantly, we assume that the agent can not observe the population's distribution and needs to estimate it in a model-free manner. The asymptotic MFG and MFC problems are also presented in continuous time and space, and compared with classical (non-asymptotic or stationary) MFG and MFC problems. They lead to explicit solutions in the linear-quadratic (LQ) case that are used as benchmarks for the results of our algorithm.

Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
Cite as:	arXiv:2006.13912 [math.OC]
	(or arXiv:2006.13912v3 [math.OC] for this version)

Submission history

From: Mathieu Laurière [view email]
[v1] Wed, 24 Jun 2020 17:45:44 GMT (990kb)
[v2] Mon, 29 Mar 2021 17:26:15 GMT (278kb)
[v3] Mon, 31 May 2021 17:08:26 GMT (1279kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> math > arXiv:2006.13912

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Mathematics > Optimization and Control

Title: Unified Reinforcement Q-Learning for Mean Field Game and Control Problems

Submission history