We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: An Instance-Dependent Simulation Framework for Learning with Label Noise

Abstract: We propose a simulation framework for generating instance-dependent noisy labels via a pseudo-labeling paradigm. We show that the distribution of the synthetic noisy labels generated with our framework is closer to human labels compared to independent and class-conditional random flipping. Equipped with controllable label noise, we study the negative impact of noisy labels across a few practical settings to understand when label noise is more problematic. We also benchmark several existing algorithms for learning with noisy labels and compare their behavior on our synthetic datasets and on the datasets with independent random label noise. Additionally, with the availability of annotator information from our simulation framework, we propose a new technique, Label Quality Model (LQM), that leverages annotator features to predict and correct against noisy labels. We show that by adding LQM as a label correction step before applying existing noisy label techniques, we can further improve the models' performance.
Comments: Datasets released at this https URL
Subjects: Machine Learning (cs.LG); Human-Computer Interaction (cs.HC)
Cite as: arXiv:2107.11413 [cs.LG]
  (or arXiv:2107.11413v4 [cs.LG] for this version)

Submission history

From: Dong Yin [view email]
[v1] Fri, 23 Jul 2021 18:53:53 GMT (1551kb,D)
[v2] Sun, 29 Aug 2021 04:24:14 GMT (1551kb,D)
[v3] Tue, 28 Sep 2021 18:26:13 GMT (1552kb,D)
[v4] Sun, 17 Oct 2021 21:20:16 GMT (1552kb,D)

Link back to: arXiv, form interface, contact.