We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Beyond Cats and Dogs: Semi-supervised Classification of fuzzy labels with overclustering

Abstract: A long-standing issue with deep learning is the need for large and consistently labeled datasets. Although the current research in semi-supervised learning can decrease the required amount of annotated data by a factor of 10 or even more, this line of research still uses distinct classes like cats and dogs. However, in the real-world we often encounter problems where different experts have different opinions, thus producing fuzzy labels. We propose a novel framework for handling semi-supervised classifications of such fuzzy labels. Our framework is based on the idea of overclustering to detect substructures in these fuzzy labels. We propose a novel loss to improve the overclustering capability of our framework and show on the common image classification dataset STL-10 that it is faster and has better overclustering performance than previous work. On a real-world plankton dataset, we illustrate the benefit of overclustering for fuzzy labels and show that we beat previous state-of-the-art semisupervised methods. Moreover, we acquire 5 to 10% more consistent predictions of substructures.
Comments: Reworked version available at arXiv:2110.06630, Published in Sensors 2021 (see DOI link)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
DOI: 10.3390/s21196661
Cite as: arXiv:2012.01768 [cs.CV]
  (or arXiv:2012.01768v2 [cs.CV] for this version)

Submission history

From: Lars Schmarje [view email]
[v1] Thu, 3 Dec 2020 08:54:25 GMT (911kb,D)
[v2] Tue, 19 Oct 2021 12:16:16 GMT (911kb,D)

Link back to: arXiv, form interface, contact.