Stackelberg Mean-payoff Games with a Rationally Bounded Adversarial Follower

Balachander, Mrudula; Guha, Shibashis; Raskin, Jean-François

Full-text links:

Download:

Current browse context:

math.OC

< prev | next >

new | recent | 2007

Mathematics > Optimization and Control

Title: Stackelberg Mean-payoff Games with a Rationally Bounded Adversarial Follower

Authors: Mrudula Balachander, Shibashis Guha, Jean-François Raskin

(Submitted on 13 Jul 2020 (v1), revised 5 Nov 2020 (this version, v3), latest version 2 Aug 2021 (v5))

Abstract: Two-player Stackelberg games are non-zero sum strategic games between a leader (Player 0) and a follower (Player 1). Such games are played sequentially: first, the leader announces her strategy, second, the follower chooses his strategy, and then both players receive their respective payoff which is a function of the two strategies. The function that maps strategies to pairs of payoffs is known by the two players. As a consequence, if we assume that the follower is perfectly rational then we can deduce that the follower responds by playing a so-called best-response to the strategy of the leader in order to maximise his own payoff. In turn, the leader should choose a strategy that maximizes the value that she receives when the follower chooses a best-response to her strategy. If we cannot impose which best-response is chosen by the follower, we say that the setting is adversarial. However, sometimes, a more realistic assumption is to consider that the follower has only bounded rationality: the follower responds with one of his $\epsilon$-best responses, for some fixed $\epsilon$ > 0.
In this paper, we study the $\epsilon$-optimal Adversarial Stackelberg Value, $ASV^{\epsilon}$ for short, which is the value that the leader can obtain against any $\epsilon$-best response of a rationally bounded adversarial follower. The $ASV^{\epsilon}$ of Player 0 is the supremum of the values that Player 0 can obtain by announcing her strategy to Player 1 who in turn responds with an $\epsilon$-optimal strategy. We consider the setting of infinite duration games played on graphs with mean-payoff objectives.

Comments:	Added a more detailed example for Computation of ASV-epsilon
Subjects:	Optimization and Control (math.OC); Computer Science and Game Theory (cs.GT); Combinatorics (math.CO)
ACM classes:	F.1.1; G.1.6
Cite as:	arXiv:2007.07209 [math.OC]
	(or arXiv:2007.07209v3 [math.OC] for this version)

Submission history

From: Shibashis Guha [view email]
[v1] Mon, 13 Jul 2020 17:44:46 GMT (57kb)
[v2] Thu, 16 Jul 2020 16:48:45 GMT (56kb)
[v3] Thu, 5 Nov 2020 15:39:50 GMT (64kb)
[v4] Sun, 7 Feb 2021 10:32:51 GMT (91kb)
[v5] Mon, 2 Aug 2021 19:39:27 GMT (111kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> math > arXiv:2007.07209v3

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Mathematics > Optimization and Control

Title: Stackelberg Mean-payoff Games with a Rationally Bounded Adversarial Follower

Submission history