Correlated Multiarmed Bandit Problem: Bayesian Algorithms and Regret Analysis

Srivastava, Vaibhav; Reverdy, Paul; Leonard, Naomi Ehrich

Full-text links:

Download:

Current browse context:

math.OC

< prev | next >

new | recent | 1507

Mathematics > Optimization and Control

Title: Correlated Multiarmed Bandit Problem: Bayesian Algorithms and Regret Analysis

Authors: Vaibhav Srivastava, Paul Reverdy, Naomi Ehrich Leonard

(Submitted on 5 Jul 2015 (v1), last revised 7 Jul 2015 (this version, v2))

Abstract: We consider the correlated multiarmed bandit (MAB) problem in which the rewards associated with each arm are modeled by a multivariate Gaussian random variable, and we investigate the influence of the assumptions in the Bayesian prior on the performance of the upper credible limit (UCL) algorithm and a new correlated UCL algorithm. We rigorously characterize the influence of accuracy, confidence, and correlation scale in the prior on the decision-making performance of the algorithms. Our results show how priors and correlation structure can be leveraged to improve performance.

Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1507.01160 [math.OC]
	(or arXiv:1507.01160v2 [math.OC] for this version)

Submission history

From: Vaibhav Srivastava [view email]
[v1] Sun, 5 Jul 2015 02:16:25 GMT (512kb,D)
[v2] Tue, 7 Jul 2015 22:27:35 GMT (262kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> math > arXiv:1507.01160

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Mathematics > Optimization and Control

Title: Correlated Multiarmed Bandit Problem: Bayesian Algorithms and Regret Analysis

Submission history