We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Machine Learning

Title: Variance Based Samples Weighting for Supervised Deep Learning

Authors: Paul Novello (CEA, X, Inria), Gaël Poëtte (CEA), David Lugato (CEA), Pietro Congedo (X, Inria)
Abstract: In the context of supervised learning of a function by a Neural Network (NN), we claim and empirically justify that a NN yields better results when the distribution of the data set focuses on regions where the function to learn is steeper. We first traduce this assumption in a mathematically workable way using Taylor expansion. Then, theoretical derivations allow to construct a methodology that we call Variance Based Samples Weighting (VBSW). VBSW uses local variance of the labels to weight the training points. This methodology is general, scalable, cost effective, and significantly increases the performances of a large class of NNs for various classification and regression tasks on image, text and multivariate data. We highlight its benefits with experiments involving NNs from shallow linear NN to Resnet or Bert.
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
Cite as: arXiv:2101.07561 [stat.ML]
  (or arXiv:2101.07561v2 [stat.ML] for this version)

Submission history

From: Paul Novello [view email]
[v1] Tue, 19 Jan 2021 11:08:40 GMT (1253kb,D)
[v2] Thu, 28 Jan 2021 12:50:28 GMT (1274kb,D)
[v3] Tue, 27 Sep 2022 15:37:42 GMT (4717kb,D)

Link back to: arXiv, form interface, contact.