DisturbLabel: Regularizing CNN on the Loss Layer

Xie, Lingxi; Wang, Jingdong; Wei, Zhen; Wang, Meng; Tian, Qi

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 1605

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: DisturbLabel: Regularizing CNN on the Loss Layer

Authors: Lingxi Xie, Jingdong Wang, Zhen Wei, Meng Wang, Qi Tian

(Submitted on 30 Apr 2016)

Abstract: During a long period of time we are combating over-fitting in the CNN training process with model regularization, including weight decay, model averaging, data augmentation, etc. In this paper, we present DisturbLabel, an extremely simple algorithm which randomly replaces a part of labels as incorrect values in each iteration. Although it seems weird to intentionally generate incorrect training labels, we show that DisturbLabel prevents the network training from over-fitting by implicitly averaging over exponentially many networks which are trained with different label sets. To the best of our knowledge, DisturbLabel serves as the first work which adds noises on the loss layer. Meanwhile, DisturbLabel cooperates well with Dropout to provide complementary regularization functions. Experiments demonstrate competitive recognition results on several popular image recognition datasets.

Comments:	To appear in CVPR 2016 (10 pages, 10 figures)
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1605.00055 [cs.CV]
	(or arXiv:1605.00055v1 [cs.CV] for this version)

Submission history

From: Lingxi Xie [view email]
[v1] Sat, 30 Apr 2016 02:44:48 GMT (365kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1605.00055

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: DisturbLabel: Regularizing CNN on the Loss Layer

Submission history