We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: FocusNet: Classifying Better by Focusing on Confusing Classes

Abstract: Nowadays, most classification networks use one-hot encoding to represent categorical data because of its simplicity. However, one-hot encoding may affect the generalization ability as it neglects inter-class correlations. We observe that, even when a neural network trained with one-hot labels produces incorrect predictions, it still pays attention to the target image region and reveals which classes confuse the network. Inspired by this observation, we propose a confusion-focusing mechanism to address the class-confusion issue. Our confusion-focusing mechanism is implemented by a two-branch network architecture. Its baseline branch generates confusing classes, and its FocusNet branch, whose architecture is flexible, discriminates correct labels from these confusing classes. We also introduce a novel focus-picking loss function to improve classification accuracy by encouraging FocusNet to focus on the most confusing classes. The experimental results validate that our FocusNet is effective for image classification on common datasets, and that our focus-picking loss function can also benefit the current neural networks in improving their classification accuracy.
Comments: Accepted by Pattern Recognition 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Journal reference: Pattern Recognition 2022
DOI: 10.1016/j.patcog.2022.108709
Cite as: arXiv:2110.07307 [cs.CV]
  (or arXiv:2110.07307v3 [cs.CV] for this version)

Submission history

From: Xue Zhang [view email]
[v1] Thu, 14 Oct 2021 12:22:59 GMT (9986kb,D)
[v2] Wed, 20 Apr 2022 02:08:08 GMT (10103kb,D)
[v3] Wed, 5 Oct 2022 07:45:38 GMT (10103kb,D)

Link back to: arXiv, form interface, contact.