Survey on Self-Supervised Multimodal Representation Learning and Foundation Models

Thapa, Sushil

Full-text links:

Download:

Computer Science > Machine Learning

Title: Survey on Self-Supervised Multimodal Representation Learning and Foundation Models

Authors: Sushil Thapa

(Submitted on 29 Nov 2022)

Abstract: Deep learning has been the subject of growing interest in recent years. Specifically, a specific type called Multimodal learning has shown great promise for solving a wide range of problems in domains such as language, vision, audio, etc. One promising research direction to improve this further has been learning rich and robust low-dimensional data representation of the high-dimensional world with the help of large-scale datasets present on the internet. Because of its potential to avoid the cost of annotating large-scale datasets, self-supervised learning has been the de facto standard for this task in recent years. This paper summarizes some of the landmark research papers that are directly or indirectly responsible to build the foundation of multimodal self-supervised learning of representation today. The paper goes over the development of representation learning over the last few years for each modality and how they were combined to get a multimodal agent later.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computer Science and Game Theory (cs.GT)
Cite as:	arXiv:2211.15837 [cs.LG]
	(or arXiv:2211.15837v1 [cs.LG] for this version)

Submission history

From: Sushil Thapa [view email]
[v1] Tue, 29 Nov 2022 00:17:43 GMT (3617kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2211.15837

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Survey on Self-Supervised Multimodal Representation Learning and Foundation Models

Submission history