(Bandit) Convex Optimization with Biased Noisy Gradient Oracles

Hu, Xiaowei; A., Prashanth L.; György, András; Szepesvári, Csaba

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1609

Computer Science > Machine Learning

Title: (Bandit) Convex Optimization with Biased Noisy Gradient Oracles

Authors: Xiaowei Hu, Prashanth L.A., András György, Csaba Szepesvári

(Submitted on 22 Sep 2016 (v1), last revised 4 Jul 2020 (this version, v2))

Abstract: Algorithms for bandit convex optimization and online learning often rely on constructing noisy gradient estimates, which are then used in appropriately adjusted first-order algorithms, replacing actual gradients. Depending on the properties of the function to be optimized and the nature of ``noise'' in the bandit feedback, the bias and variance of gradient estimates exhibit various tradeoffs. In this paper we propose a novel framework that replaces the specific gradient estimation methods with an abstract oracle. With the help of the new framework we unify previous works, reproducing their results in a clean and concise fashion, while, perhaps more importantly, the framework also allows us to formally show that to achieve the optimal root-$n$ rate either the algorithms that use existing gradient estimators, or the proof techniques used to analyze them have to go beyond what exists today.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1609.07087 [cs.LG]
	(or arXiv:1609.07087v2 [cs.LG] for this version)

Submission history

From: L.A. Prashanth [view email]
[v1] Thu, 22 Sep 2016 17:56:38 GMT (424kb,D)
[v2] Sat, 4 Jul 2020 22:16:51 GMT (382kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1609.07087

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: (Bandit) Convex Optimization with Biased Noisy Gradient Oracles

Submission history