(Bandit) Convex Optimization with Biased Noisy Gradient Oracles

Hu, Xiaowei; A., Prashanth L.; György, András; Szepesvári, Csaba

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1609

Computer Science > Machine Learning

Title: (Bandit) Convex Optimization with Biased Noisy Gradient Oracles

Authors: Xiaowei Hu, Prashanth L.A., András György, Csaba Szepesvári

(Submitted on 22 Sep 2016 (this version), latest version 4 Jul 2020 (v2))

Abstract: Algorithms for bandit convex optimization and online learning often rely on constructing noisy gradient estimates, which are then used in appropriately adjusted first-order algorithms, replacing actual gradients. Depending on the properties of the function to be optimized and the nature of "noise" in the bandit feedback, the bias and variance of gradient estimates exhibit various tradeoffs. In this paper we propose a novel framework that replaces the specific gradient estimation methods with an abstract oracle. With the help of the new framework we unify previous works, reproducing their results in a clean and concise fashion, while, perhaps more importantly, the framework also allows us to formally show that to achieve the optimal root-$n$ rate either the algorithms that use existing gradient estimators, or the proof techniques used to analyze them have to go beyond what exists today.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1609.07087 [cs.LG]
	(or arXiv:1609.07087v1 [cs.LG] for this version)

Submission history

From: L.A. Prashanth [view email]
[v1] Thu, 22 Sep 2016 17:56:38 GMT (424kb,D)
[v2] Sat, 4 Jul 2020 22:16:51 GMT (382kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1609.07087v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: (Bandit) Convex Optimization with Biased Noisy Gradient Oracles

Submission history