Understanding Dimensional Collapse in Contrastive Self-supervised Learning

Jing, Li; Vincent, Pascal; LeCun, Yann; Tian, Yuandong

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2110

Computer Science > Computer Vision and Pattern Recognition

Title: Understanding Dimensional Collapse in Contrastive Self-supervised Learning

Authors: Li Jing, Pascal Vincent, Yann LeCun, Yuandong Tian

(Submitted on 18 Oct 2021 (v1), last revised 23 Apr 2022 (this version, v3))

Abstract: Self-supervised visual representation learning aims to learn useful representations without relying on human annotations. Joint embedding approach bases on maximizing the agreement between embedding vectors from different views of the same image. Various methods have been proposed to solve the collapsing problem where all embedding vectors collapse to a trivial constant solution. Among these methods, contrastive learning prevents collapse via negative sample pairs. It has been shown that non-contrastive methods suffer from a lesser collapse problem of a different nature: dimensional collapse, whereby the embedding vectors end up spanning a lower-dimensional subspace instead of the entire available embedding space. Here, we show that dimensional collapse also happens in contrastive learning. In this paper, we shed light on the dynamics at play in contrastive learning that leads to dimensional collapse. Inspired by our theory, we propose a novel contrastive learning method, called DirectCLR, which directly optimizes the representation space without relying on an explicit trainable projector. Experiments show that DirectCLR outperforms SimCLR with a trainable linear projector on ImageNet.

Comments:	In Proceedings of the 10th International Conference on Learning Representations (ICLR) 2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Journal reference:	ICLR 2022
Cite as:	arXiv:2110.09348 [cs.CV]
	(or arXiv:2110.09348v3 [cs.CV] for this version)

Submission history

From: Li Jing [view email]
[v1] Mon, 18 Oct 2021 14:22:19 GMT (3087kb,D)
[v2] Wed, 2 Feb 2022 19:03:47 GMT (535kb,D)
[v3] Sat, 23 Apr 2022 16:44:20 GMT (542kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2110.09348

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Understanding Dimensional Collapse in Contrastive Self-supervised Learning

Submission history