Current browse context:
stat.ML
Change to browse by:
References & Citations
Computer Science > Machine Learning
Title: Stochastic Function Norm Regularization of Deep Networks
(Submitted on 30 May 2016 (v1), last revised 30 Aug 2019 (this version, v3))
Abstract: Deep neural networks have had an enormous impact on image analysis. State-of-the-art training methods, based on weight decay and DropOut, result in impressive performance when a very large training set is available. However, they tend to have large problems overfitting to small data sets. Indeed, the available regularization methods deal with the complexity of the network function only indirectly. In this paper, we study the feasibility of directly using the $L_2$ function norm for regularization. Two methods to integrate this new regularization in the stochastic backpropagation are proposed. Moreover, the convergence of these new algorithms is studied. We finally show that they outperform the state-of-the-art methods in the low sample regime on benchmark datasets (MNIST and CIFAR10). The obtained results demonstrate very clear improvement, especially in the context of small sample regimes with data laying in a low dimensional manifold. Source code of the method can be found at \url{this https URL}.
Submission history
From: Matthew Blaschko [view email][v1] Mon, 30 May 2016 01:49:18 GMT (178kb,D)
[v2] Wed, 7 Dec 2016 14:14:30 GMT (189kb,D)
[v3] Fri, 30 Aug 2019 14:38:32 GMT (292kb,D)
Link back to: arXiv, form interface, contact.