We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.ST

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Statistics Theory

Title: A KL-LUCB Bandit Algorithm for Large-Scale Crowdsourcing

Abstract: This paper focuses on best-arm identification in multi-armed bandits with bounded rewards. We develop an algorithm that is a fusion of lil-UCB and KL-LUCB, offering the best qualities of the two algorithms in one method. This is achieved by proving a novel anytime confidence bound for the mean of bounded distributions, which is the analogue of the LIL-type bounds recently developed for sub-Gaussian distributions. We corroborate our theoretical results with numerical experiments based on the New Yorker Cartoon Caption Contest.
Subjects: Statistics Theory (math.ST)
Cite as: arXiv:1709.03570 [math.ST]
  (or arXiv:1709.03570v1 [math.ST] for this version)

Submission history

From: Ervin Tánczos [view email]
[v1] Mon, 11 Sep 2017 20:14:59 GMT (265kb,D)

Link back to: arXiv, form interface, contact.