We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Regularization Shortcomings for Continual Learning

Abstract: In classical machine learning, the data streamed to the algorithms is assumed to be independent and identically distributed. Otherwise, if the data distribution changes through time, the algorithm risks to remember only the data from the current state of the distribution and forget everything else. Continual learning is a sub-field of machine learning that aims to find automatic learning processes to solve non-iid problems. The main challenges of continual learning are two-fold. Firstly, to detect concept-drift in the distribution and secondly to remember what happened before a concept-drift. In this article, we study a specific case of continual learning approaches: \textit{the regularization method}. It consists of finding a smart regularization term that will protect important parameters from being modified to not forget. We show in this article, that in the context of multi-task learning for classification, this process does not learn to discriminate classes from different tasks. We propose theoretical reasoning to prove this shortcoming and illustrate it with examples and experiments with the "MNIST Fellowship" dataset.
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as: arXiv:1912.03049 [cs.LG]
  (or arXiv:1912.03049v1 [cs.LG] for this version)

Submission history

From: Timothée Lesort [view email]
[v1] Fri, 6 Dec 2019 10:11:18 GMT (395kb,D)
[v2] Fri, 7 Feb 2020 12:10:55 GMT (443kb,D)
[v3] Tue, 8 Dec 2020 17:25:56 GMT (498kb,D)
[v4] Sun, 4 Apr 2021 00:21:23 GMT (754kb,D)

Link back to: arXiv, form interface, contact.