Current browse context:
math.OC
Change to browse by:
References & Citations
Mathematics > Optimization and Control
Title: Correlated Multiarmed Bandit Problem: Bayesian Algorithms and Regret Analysis
(Submitted on 5 Jul 2015 (v1), last revised 7 Jul 2015 (this version, v2))
Abstract: We consider the correlated multiarmed bandit (MAB) problem in which the rewards associated with each arm are modeled by a multivariate Gaussian random variable, and we investigate the influence of the assumptions in the Bayesian prior on the performance of the upper credible limit (UCL) algorithm and a new correlated UCL algorithm. We rigorously characterize the influence of accuracy, confidence, and correlation scale in the prior on the decision-making performance of the algorithms. Our results show how priors and correlation structure can be leveraged to improve performance.
Submission history
From: Vaibhav Srivastava [view email][v1] Sun, 5 Jul 2015 02:16:25 GMT (512kb,D)
[v2] Tue, 7 Jul 2015 22:27:35 GMT (262kb,D)
Link back to: arXiv, form interface, contact.