Continuous Deep Equilibrium Models: Training Neural ODEs faster by integrating them to Infinity

Pal, Avik; Edelman, Alan; Rackauckas, Christopher

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2201

Computer Science > Machine Learning

Title: Continuous Deep Equilibrium Models: Training Neural ODEs faster by integrating them to Infinity

Authors: Avik Pal, Alan Edelman, Christopher Rackauckas

(Submitted on 28 Jan 2022 (v1), last revised 3 Mar 2023 (this version, v4))

Abstract: Implicit models separate the definition of a layer from the description of its solution process. While implicit layers allow features such as depth to adapt to new scenarios and inputs automatically, this adaptivity makes its computational expense challenging to predict. In this manuscript, we increase the "implicitness" of the DEQ by redefining the method in terms of an infinite time neural ODE, which paradoxically decreases the training cost over a standard neural ODE by 2-4x. Additionally, we address the question: is there a way to simultaneously achieve the robustness of implicit layers while allowing the reduced computational expense of an explicit layer? To solve this, we develop Skip and Skip Reg. DEQ, an implicit-explicit (IMEX) layer that simultaneously trains an explicit prediction followed by an implicit correction. We show that training this explicit predictor is free and even decreases the training time by 1.11-3.19x. Together, this manuscript shows how bridging the dichotomy of implicit and explicit deep learning can combine the advantages of both techniques.

Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Dynamical Systems (math.DS)
Cite as:	arXiv:2201.12240 [cs.LG]
	(or arXiv:2201.12240v4 [cs.LG] for this version)

Submission history

From: Avik Pal [view email]
[v1] Fri, 28 Jan 2022 16:51:54 GMT (242kb,D)
[v2] Fri, 4 Feb 2022 19:36:55 GMT (487kb,D)
[v3] Wed, 1 Mar 2023 15:38:54 GMT (749kb,D)
[v4] Fri, 3 Mar 2023 16:34:22 GMT (749kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2201.12240

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Continuous Deep Equilibrium Models: Training Neural ODEs faster by integrating them to Infinity

Submission history