We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: OptNet: Differentiable Optimization as a Layer in Neural Networks

Abstract: This paper presents OptNet, a network architecture that integrates optimization problems (here, specifically in the form of quadratic programs) as individual layers in larger end-to-end trainable deep networks. These layers encode constraints and complex dependencies between the hidden states that traditional convolutional and fully-connected layers often cannot capture. We explore the foundations for such an architecture: we show how techniques from sensitivity analysis, bilevel optimization, and implicit differentiation can be used to exactly differentiate through these layers and with respect to layer parameters; we develop a highly efficient solver for these layers that exploits fast GPU-based batch solves within a primal-dual interior point method, and which provides backpropagation gradients with virtually no additional cost on top of the solve; and we highlight the application of these approaches in several problems. In one notable example, the method is learns to play mini-Sudoku (4x4) given just input and output games, with no a-priori information about the rules of the game; this highlights the ability of OptNet to learn hard constraints better than other neural architectures.
Comments: ICML 2017
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as: arXiv:1703.00443 [cs.LG]
  (or arXiv:1703.00443v5 [cs.LG] for this version)

Submission history

From: Brandon Amos [view email]
[v1] Wed, 1 Mar 2017 18:58:48 GMT (445kb,D)
[v2] Wed, 14 Jun 2017 17:59:07 GMT (966kb,D)
[v3] Fri, 12 Jan 2018 19:44:25 GMT (971kb,D)
[v4] Mon, 14 Oct 2019 18:03:26 GMT (662kb,D)
[v5] Thu, 2 Dec 2021 17:34:50 GMT (662kb,D)

Link back to: arXiv, form interface, contact.