Current browse context:
cs.LG
Change to browse by:
References & Citations
Computer Science > Machine Learning
Title: (Bandit) Convex Optimization with Biased Noisy Gradient Oracles
(Submitted on 22 Sep 2016 (v1), last revised 4 Jul 2020 (this version, v2))
Abstract: Algorithms for bandit convex optimization and online learning often rely on constructing noisy gradient estimates, which are then used in appropriately adjusted first-order algorithms, replacing actual gradients. Depending on the properties of the function to be optimized and the nature of ``noise'' in the bandit feedback, the bias and variance of gradient estimates exhibit various tradeoffs. In this paper we propose a novel framework that replaces the specific gradient estimation methods with an abstract oracle. With the help of the new framework we unify previous works, reproducing their results in a clean and concise fashion, while, perhaps more importantly, the framework also allows us to formally show that to achieve the optimal root-$n$ rate either the algorithms that use existing gradient estimators, or the proof techniques used to analyze them have to go beyond what exists today.
Submission history
From: L.A. Prashanth [view email][v1] Thu, 22 Sep 2016 17:56:38 GMT (424kb,D)
[v2] Sat, 4 Jul 2020 22:16:51 GMT (382kb,D)
Link back to: arXiv, form interface, contact.