Coordinating Followers to Reach Better Equilibria: End-to-End Gradient Descent for Stackelberg Games

Wang, Kai; Xu, Lily; Perrault, Andrew; Reiter, Michael K.; Tambe, Milind

Full-text links:

Download:

Current browse context:

cs.GT

< prev | next >

new | recent | 2106

Change to browse by:

Computer Science > Computer Science and Game Theory

Title: Coordinating Followers to Reach Better Equilibria: End-to-End Gradient Descent for Stackelberg Games

Authors: Kai Wang, Lily Xu, Andrew Perrault, Michael K. Reiter, Milind Tambe

(Submitted on 6 Jun 2021 (v1), last revised 3 Dec 2021 (this version, v2))

Abstract: A growing body of work in game theory extends the traditional Stackelberg game to settings with one leader and multiple followers who play a Nash equilibrium. Standard approaches for computing equilibria in these games reformulate the followers' best response as constraints in the leader's optimization problem. These reformulation approaches can sometimes be effective, but often get trapped in low-quality solutions when followers' objectives are non-linear or non-quadratic. Moreover, these approaches assume a unique equilibrium or a specific equilibrium concept, e.g., optimistic or pessimistic, which is a limiting assumption in many situations. To overcome these limitations, we propose a stochastic gradient descent--based approach, where the leader's strategy is updated by differentiating through the followers' best responses. We frame the leader's optimization as a learning problem against followers' equilibrium, which allows us to decouple the followers' equilibrium constraints from the leader's problem. This approach also addresses cases with multiple equilibria and arbitrary equilibrium selection procedures by back-propagating through a sampled Nash equilibrium. To this end, this paper introduces a novel concept called equilibrium flow to formally characterize the set of equilibrium selection processes where the gradient with respect to a sampled equilibrium is an unbiased estimate of the true gradient. We evaluate our approach experimentally against existing baselines in three Stackelberg problems with multiple followers and find that in each case, our approach is able to achieve higher utility for the leader.

Subjects:	Computer Science and Game Theory (cs.GT)
Cite as:	arXiv:2106.03278 [cs.GT]
	(or arXiv:2106.03278v2 [cs.GT] for this version)

Submission history

From: Kai Wang [view email]
[v1] Sun, 6 Jun 2021 23:43:29 GMT (300kb,D)
[v2] Fri, 3 Dec 2021 23:28:06 GMT (311kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2106.03278v2

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Science and Game Theory

Title: Coordinating Followers to Reach Better Equilibria: End-to-End Gradient Descent for Stackelberg Games

Submission history