We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: SaliencyMix: A Saliency Guided Data Augmentation Strategy for Better Regularization

Abstract: Advanced data augmentation strategies have widely been studied to improve the generalization ability of deep learning models. Regional dropout is one of the popular solutions that guides the model to focus on less discriminative parts by randomly removing image regions, resulting in improved regularization. However, such information removal is undesirable. On the other hand, recent strategies suggest to randomly cut and mix patches and their labels among training images, to enjoy the advantages of regional dropout without having any pointless pixel in the augmented images. We argue that such random selection strategies of the patches may not necessarily represent sufficient information about the corresponding object and thereby mixing the labels according to that uninformative patch enables the model to learn unexpected feature representation. Therefore, we propose SaliencyMix that carefully selects a representative image patch with the help of a saliency map and mixes this indicative patch with the target image, thus leading the model to learn more appropriate feature representation. SaliencyMix achieves the best known top-1 error of 21.26% and 20.09% for ResNet-50 and ResNet-101 architectures on ImageNet classification, respectively, and also improves the model robustness against adversarial perturbations. Furthermore, models that are trained with SaliencyMix help to improve the object detection performance. Source code is available at this https URL
Comments: 12 pages, 5 figures, 5 tables
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
MSC classes: 68T07
ACM classes: I.2; I.4
Journal reference: International Conference On Learning Representations (ICLR) 2021
Cite as: arXiv:2006.01791 [cs.LG]
  (or arXiv:2006.01791v2 [cs.LG] for this version)

Submission history

From: A F M Shahab Uddin [view email]
[v1] Tue, 2 Jun 2020 17:18:34 GMT (1598kb,D)
[v2] Tue, 27 Jul 2021 13:02:06 GMT (1328kb,D)

Link back to: arXiv, form interface, contact.