We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: To Ensemble or Not Ensemble: When does End-To-End Training Fail?

Abstract: End-to-End training (E2E) is becoming more and more popular to train complex Deep Network architectures. An interesting question is whether this trend will continue-are there any clear failure cases for E2E training? We study this question in depth, for the specific case of E2E training an ensemble of networks. Our strategy is to blend the gradient smoothly in between two extremes: from independent training of the networks, up to to full E2E training. We find clear failure cases, where over-parameterized models cannot be trained E2E. A surprising result is that the optimum can sometimes lie in between the two, neither an ensemble or an E2E system. The work also uncovers links to Dropout, and raises questions around the nature of ensemble diversity and multi-branch networks.
Comments: Code: this https URL Preprint updated to reflect version accepted for publication at ECML
Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
DOI: 10.13140/RG.2.2.28091.46880
Cite as: arXiv:1902.04422 [stat.ML]
  (or arXiv:1902.04422v4 [stat.ML] for this version)

Submission history

From: Andrew Webb [view email]
[v1] Tue, 12 Feb 2019 14:56:06 GMT (620kb,D)
[v2] Tue, 26 Feb 2019 11:15:03 GMT (620kb,D)
[v3] Mon, 29 Jun 2020 10:25:34 GMT (883kb,D)
[v4] Thu, 6 Aug 2020 09:48:03 GMT (883kb,D)

Link back to: arXiv, form interface, contact.