We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Training Deep Neural Networks via Branch-and-Bound

Abstract: In this paper, we propose BPGrad, a novel approximate algorithm for deep nueral network training, based on adaptive estimates of feasible region via branch-and-bound. The method is based on the assumption of Lipschitz continuity in objective function, and as a result, it can adaptively determine the step size for the current gradient given the history of previous updates. We prove that, by repeating such a branch-and-pruning procedure, it can achieve the optimal solution within finite iterations. A computationally efficient solver based on BPGrad has been proposed to train the deep neural networks. Empirical results demonstrate that BPGrad solver works well in practice and compares favorably to other stochastic optimization methods in the tasks of object recognition, detection, and segmentation. The code is available at \url{this https URL}.
Comments: 29 pages, 17 figures. arXiv admin note: substantial text overlap with arXiv:1711.06959
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2104.01730 [cs.CV]
  (or arXiv:2104.01730v2 [cs.CV] for this version)

Submission history

From: Yuanwei Wu [view email]
[v1] Mon, 5 Apr 2021 00:43:03 GMT (10654kb,D)
[v2] Sun, 24 Oct 2021 02:15:37 GMT (10204kb,D)

Link back to: arXiv, form interface, contact.