On Private and Robust Bandits

Wu, Yulian; Zhou, Xingyu; Tao, Youming; Wang, Di

Full-text links:

Download:

Current browse context:

cs.CR

< prev | next >

new | recent | 2302

Computer Science > Machine Learning

Title: On Private and Robust Bandits

Authors: Yulian Wu, Xingyu Zhou, Youming Tao, Di Wang

(Submitted on 6 Feb 2023 (v1), last revised 4 Mar 2023 (this version, v2))

Abstract: We study private and robust multi-armed bandits (MABs), where the agent receives Huber's contaminated heavy-tailed rewards and meanwhile needs to ensure differential privacy. We first present its minimax lower bound, characterizing the information-theoretic limit of regret with respect to privacy budget, contamination level and heavy-tailedness. Then, we propose a meta-algorithm that builds on a private and robust mean estimation sub-routine \texttt{PRM} that essentially relies on reward truncation and the Laplace mechanism only. For two different heavy-tailed settings, we give specific schemes of \texttt{PRM}, which enable us to achieve nearly-optimal regret. As by-products of our main results, we also give the first minimax lower bound for private heavy-tailed MABs (i.e., without contamination). Moreover, our two proposed truncation-based \texttt{PRM} achieve the optimal trade-off between estimation accuracy, privacy and robustness. Finally, we support our theoretical results with experimental studies.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
Cite as:	arXiv:2302.02526 [cs.LG]
	(or arXiv:2302.02526v2 [cs.LG] for this version)

Submission history

From: Yulian Wu [view email]
[v1] Mon, 6 Feb 2023 01:55:06 GMT (667kb,D)
[v2] Sat, 4 Mar 2023 11:27:16 GMT (668kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2302.02526

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: On Private and Robust Bandits

Submission history