Limits of End-to-End Learning

Glasmachers, Tobias

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1704

Computer Science > Machine Learning

Title: Limits of End-to-End Learning

Authors: Tobias Glasmachers

(Submitted on 26 Apr 2017)

Abstract: End-to-end learning refers to training a possibly complex learning system by applying gradient-based learning to the system as a whole. End-to-end learning system is specifically designed so that all modules are differentiable. In effect, not only a central learning machine, but also all "peripheral" modules like representation learning and memory formation are covered by a holistic learning process. The power of end-to-end learning has been demonstrated on many tasks, like playing a whole array of Atari video games with a single architecture. While pushing for solutions to more challenging tasks, network architectures keep growing more and more complex.
In this paper we ask the question whether and to what extent end-to-end learning is a future-proof technique in the sense of scaling to complex and diverse data processing architectures. We point out potential inefficiencies, and we argue in particular that end-to-end learning does not make optimal use of the modular design of present neural networks. Our surprisingly simple experiments demonstrate these inefficiencies, up to the complete breakdown of learning.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1704.08305 [cs.LG]
	(or arXiv:1704.08305v1 [cs.LG] for this version)

Submission history

From: Tobias Glasmachers [view email]
[v1] Wed, 26 Apr 2017 19:12:37 GMT (76kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1704.08305

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Limits of End-to-End Learning

Submission history