We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Best-Arm Identification for Quantile Bandits with Privacy

Abstract: We study the best-arm identification problem in multi-armed bandits with stochastic, potentially private rewards, when the goal is to identify the arm with the highest quantile at a fixed, prescribed level. First, we propose a (non-private) successive elimination algorithm for strictly optimal best-arm identification, we show that our algorithm is $\delta$-PAC and we characterize its sample complexity. Further, we provide a lower bound on the expected number of pulls, showing that the proposed algorithm is essentially optimal up to logarithmic factors. Both upper and lower complexity bounds depend on a special definition of the associated suboptimality gap, designed in particular for the quantile bandit problem, as we show when the gap approaches zero, best-arm identification is impossible. Second, motivated by applications where the rewards are private, we provide a differentially private successive elimination algorithm whose sample complexity is finite even for distributions with infinite support-size, and we characterize its sample complexity as well. Our algorithms do not require prior knowledge of either the suboptimality gap or other statistical information related to the bandit problem at hand.
Comments: 24 pages, 4 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as: arXiv:2006.06792 [stat.ML]
  (or arXiv:2006.06792v1 [stat.ML] for this version)

Submission history

From: Konstantinos Nikolakakis [view email]
[v1] Thu, 11 Jun 2020 20:23:43 GMT (689kb,D)
[v2] Mon, 26 Apr 2021 21:05:06 GMT (818kb,D)
[v3] Sun, 23 May 2021 23:08:14 GMT (785kb,D)
[v4] Sun, 4 Dec 2022 12:32:00 GMT (752kb,D)

Link back to: arXiv, form interface, contact.