Fully adaptive algorithm for pure exploration in linear bandits

Xu, Liyuan; Honda, Junya; Sugiyama, Masashi

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 1710

Statistics > Machine Learning

Title: Fully adaptive algorithm for pure exploration in linear bandits

Authors: Liyuan Xu, Junya Honda, Masashi Sugiyama

(Submitted on 16 Oct 2017)

Abstract: We propose the first fully-adaptive algorithm for pure exploration in linear bandits---the task to find the arm with the largest expected reward, which depends on an unknown parameter linearly. While existing methods partially or entirely fix sequences of arm selections before observing rewards, our method adaptively changes the arm selection strategy based on past observations at each round. We show our sample complexity matches the achievable lower bound up to a constant factor in an extreme case. Furthermore, we evaluate the performance of the methods by simulations based on both synthetic setting and real-world data, in which our method shows vast improvement over existing methods.

Subjects:	Machine Learning (stat.ML)
Cite as:	arXiv:1710.05552 [stat.ML]
	(or arXiv:1710.05552v1 [stat.ML] for this version)

Submission history

From: Liyuan Xu [view email]
[v1] Mon, 16 Oct 2017 08:16:50 GMT (56kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:1710.05552

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: Fully adaptive algorithm for pure exploration in linear bandits

Submission history