We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.OC

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Optimization and Control

Title: A New Algorithm With Lower Complexity for Bilevel Optimization

Abstract: Many stochastic algorithms have been proposed to solve the bilevel optimization problem, where the lower level function is strongly convex and the upper level value function is nonconvex. In particular, exising Hessian inverse-free algorithms that utilize momentum recursion or variance reduction technqiues can reach an $\epsilon$-stationary point with a complexity of $\tilde{O}(\epsilon^{-1.5})$ under usual smoothness conditions. However, $\tilde{O}(\epsilon^{-1.5})$ is a complexity higher than $O(\epsilon^{-1.5})$. How to make a Hessian inverse-free algorithm achieve the complexity of $O(\epsilon^{-1.5})$ under usual smoothness conditions remains an unresolved problem. In this paper, we propose a new Hessian inverse-free algorithm based on the projected stochastic gradient descent method and variance reduction technique of SPIDER. This algorithm can achieve a complexity of $O(\epsilon^{-1.5})$ under usual smoothness conditions whether it runs in a fully single loop or double loop structure. Finally, we validate our theoretical results through synthetic experiments and demonstrate the efficiency of our algorithm in some machine learning applications.
Subjects: Optimization and Control (math.OC)
Cite as: arXiv:2404.11377 [math.OC]
  (or arXiv:2404.11377v1 [math.OC] for this version)

Submission history

From: Haimei Huo [view email]
[v1] Wed, 17 Apr 2024 13:33:53 GMT (2568kb,D)

Link back to: arXiv, form interface, contact.