Elucidating Meta-Structures of Noisy Labels in Semantic Segmentation by Deep Neural Networks

Luo, Yaoru; Liu, Guole; Guo, Yuanhao; Yang, Ge

Full-text links:

Download:

PDF only

Current browse context:

cs.CV

< prev | next >

new | recent | 2205

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Elucidating Meta-Structures of Noisy Labels in Semantic Segmentation by Deep Neural Networks

Authors: Yaoru Luo, Guole Liu, Yuanhao Guo, Ge Yang

(Submitted on 30 Apr 2022 (this version), latest version 8 Oct 2022 (v3))

Abstract: The supervised training of deep neural networks (DNNs) by noisy labels has been studied extensively in image classification but much less in image segmentation. So far, our understanding of the learning behavior of DNNs trained by noisy segmentation labels remains limited. In this study, we address this deficiency in both binary segmentation of biological microscopy images and multi-class segmentation of natural images. We classify segmentation labels according to their noise transition matrices (NTM) and compare performance of DNNs trained by different types of labels. When we randomly sample a small fraction (e.g., 10%) or flipping a large fraction (e.g., 90%) of the ground-truth labels to train DNNs, their segmentation performance remains largely the same. This indicates that DNNs learn structures hidden in labels rather than pixel-level labels per se in their supervised training for semantic segmentation. We call these hidden structures "meta-structures". When we use labels with different perturbations to the meta-structures to train DNNs, their performance in feature extraction and segmentation degrades consistently. In contrast, addition of meta-structure information substantially improves performance of an unsupervised model in binary semantic segmentation. We formulate meta-structures mathematically as spatial density distributions and quantify semantic information of different types of labels, which we find to correlate strongly with ranks of their NTM. We show theoretically and experimentally how this formulation explains key observed learning behavior of DNNs.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2205.00160 [cs.CV]
	(or arXiv:2205.00160v1 [cs.CV] for this version)

Submission history

From: Yaoru Luo [view email]
[v1] Sat, 30 Apr 2022 04:54:31 GMT (10260kb)
[v2] Mon, 15 Aug 2022 02:15:47 GMT (15486kb)
[v3] Sat, 8 Oct 2022 00:54:27 GMT (17983kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2205.00160v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Elucidating Meta-Structures of Noisy Labels in Semantic Segmentation by Deep Neural Networks

Submission history