We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.OC

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Optimization and Control

Title: Correlated Multiarmed Bandit Problem: Bayesian Algorithms and Regret Analysis

Abstract: We consider the correlated multiarmed bandit (MAB) problem in which the rewards associated with each arm are modeled by a multivariate Gaussian random variable, and we investigate the influence of the assumptions in the Bayesian prior on the performance of the upper credible limit (UCL) algorithm and a new correlated UCL algorithm. We rigorously characterize the influence of accuracy, confidence, and correlation scale in the prior on the decision-making performance of the algorithms. Our results show how priors and correlation structure can be leveraged to improve performance.
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as: arXiv:1507.01160 [math.OC]
  (or arXiv:1507.01160v2 [math.OC] for this version)

Submission history

From: Vaibhav Srivastava [view email]
[v1] Sun, 5 Jul 2015 02:16:25 GMT (512kb,D)
[v2] Tue, 7 Jul 2015 22:27:35 GMT (262kb,D)

Link back to: arXiv, form interface, contact.