A KL-LUCB Bandit Algorithm for Large-Scale Crowdsourcing

Mankoff, Bob; Nowak, Robert; Tanczos, Ervin

Full-text links:

Download:

Current browse context:

math.ST

< prev | next >

new | recent | 1709

Mathematics > Statistics Theory

Title: A KL-LUCB Bandit Algorithm for Large-Scale Crowdsourcing

Authors: Bob Mankoff, Robert Nowak, Ervin Tanczos

(Submitted on 11 Sep 2017)

Abstract: This paper focuses on best-arm identification in multi-armed bandits with bounded rewards. We develop an algorithm that is a fusion of lil-UCB and KL-LUCB, offering the best qualities of the two algorithms in one method. This is achieved by proving a novel anytime confidence bound for the mean of bounded distributions, which is the analogue of the LIL-type bounds recently developed for sub-Gaussian distributions. We corroborate our theoretical results with numerical experiments based on the New Yorker Cartoon Caption Contest.

Subjects:	Statistics Theory (math.ST)
Cite as:	arXiv:1709.03570 [math.ST]
	(or arXiv:1709.03570v1 [math.ST] for this version)

Submission history

From: Ervin Tánczos [view email]
[v1] Mon, 11 Sep 2017 20:14:59 GMT (265kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> math > arXiv:1709.03570

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Mathematics > Statistics Theory

Title: A KL-LUCB Bandit Algorithm for Large-Scale Crowdsourcing

Submission history