Learning to Persuade on the Fly: Robustness Against Ignorance

Zu, You; Iyer, Krishnamurthy; Xu, Haifeng

Full-text links:

Download:

Current browse context:

cs.GT

< prev | next >

new | recent | 2102

Computer Science > Computer Science and Game Theory

Title: Learning to Persuade on the Fly: Robustness Against Ignorance

Authors: You Zu, Krishnamurthy Iyer, Haifeng Xu

(Submitted on 19 Feb 2021 (v1), last revised 3 May 2024 (this version, v2))

Abstract: Motivated by information sharing in online platforms, we study repeated persuasion between a sender and a stream of receivers where at each time, the sender observes a payoff-relevant state drawn independently and identically from an unknown distribution, and shares state information with the receivers who each choose an action. The sender seeks to persuade the receivers into taking actions aligned with the sender's preference by selectively sharing state information. However, in contrast to the standard models, neither the sender nor the receivers know the distribution, and the sender has to persuade while learning the distribution on the fly.
We study the sender's learning problem of making persuasive action recommendations to achieve low regret against the optimal persuasion mechanism with the knowledge of the distribution. To do this, we first propose and motivate a persuasiveness criterion for the unknown distribution setting that centers robustness as a requirement in the face of uncertainty. Our main result is an algorithm that, with high probability, is robustly-persuasive and achieves $O(\sqrt{T\log T})$ regret, where $T$ is the horizon length. Intuitively, at each time our algorithm maintains a set of candidate distributions, and chooses a signaling mechanism that is simultaneously persuasive for all of them. Core to our proof is a tight analysis about the cost of robust persuasion, which may be of independent interest. We further prove that this regret order is optimal (up to logarithmic terms) by showing that no algorithm can achieve regret better than $\Omega(\sqrt{T})$.

Comments:	Accepted at Operations Research. Preliminary version appeared as an extended abstract in EC 2021
Subjects:	Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG); Theoretical Economics (econ.TH)
MSC classes:	91A28, 68W27, 68Q25
ACM classes:	F.2; G.3
Cite as:	arXiv:2102.10156 [cs.GT]
	(or arXiv:2102.10156v2 [cs.GT] for this version)

Submission history

From: Krishnamurthy Iyer [view email]
[v1] Fri, 19 Feb 2021 21:02:15 GMT (54kb,D)
[v2] Fri, 3 May 2024 05:08:29 GMT (92kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2102.10156

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Science and Game Theory

Title: Learning to Persuade on the Fly: Robustness Against Ignorance

Submission history