Current browse context:
cs.LG
Change to browse by:
References & Citations
Computer Science > Machine Learning
Title: Effective Regularization Through Loss-Function Metalearning
(Submitted on 2 Oct 2020 (v1), last revised 28 Oct 2021 (this version, v2))
Abstract: Evolutionary optimization, such as the TaylorGLO method, can be used to discover novel, customized loss functions for deep neural networks, resulting in improved performance, faster training, and improved data utilization. A likely explanation is that such functions discourage overfitting, leading to effective regularization. This paper demonstrates theoretically that this is indeed the case for TaylorGLO: Decomposition of learning rules makes it possible to characterize the training dynamics and show that the loss functions evolved by TaylorGLO balance the pull to zero error, and a push away from it to avoid overfitting. They may also automatically take advantage of label smoothing. This analysis leads to an invariant that can be utilized to make the metalearning process more efficient in practice; the mechanism also results in networks that are robust against adversarial attacks. Loss-function evolution can thus be seen as a well-founded new aspect of metalearning in neural networks.
Submission history
From: Santiago Gonzalez [view email][v1] Fri, 2 Oct 2020 05:22:21 GMT (2556kb,D)
[v2] Thu, 28 Oct 2021 04:47:05 GMT (7560kb,D)
Link back to: arXiv, form interface, contact.