Multi-VAE: Learning Disentangled View-common and View-peculiar Visual Representations for Multi-view Clustering

Xu, Jie; Ren, Yazhou; Tang, Huayi; Pu, Xiaorong; Zhu, Xiaofeng; Zeng, Ming; He, Lifang

Full-text links:

Download:

Source

Current browse context:

cs.CV

< prev | next >

new | recent | 2106

Computer Science > Computer Vision and Pattern Recognition

Title: Multi-VAE: Learning Disentangled View-common and View-peculiar Visual Representations for Multi-view Clustering

Authors: Jie Xu, Yazhou Ren, Huayi Tang, Xiaorong Pu, Xiaofeng Zhu, Ming Zeng, Lifang He

(Submitted on 21 Jun 2021 (v1), last revised 7 Jul 2021 (this version, v2))

Abstract: Multi-view clustering, a long-standing and important research problem, focuses on mining complementary information from diverse views. However, existing works often fuse multiple views' representations or handle clustering in a common feature space, which may result in their entanglement especially for visual representations. To address this issue, we present a novel VAE-based multi-view clustering framework (Multi-VAE) by learning disentangled visual representations. Concretely, we define a view-common variable and multiple view-peculiar variables in the generative model. The prior of view-common variable obeys approximately discrete Gumbel Softmax distribution, which is introduced to extract the common cluster factor of multiple views. Meanwhile, the prior of view-peculiar variable follows continuous Gaussian distribution, which is used to represent each view's peculiar visual factors. By controlling the mutual information capacity to disentangle the view-common and view-peculiar representations, continuous visual information of multiple views can be separated so that their common discrete cluster information can be effectively mined. Experimental results demonstrate that Multi-VAE enjoys the disentangled and explainable visual representations, while obtaining superior clustering performance compared with state-of-the-art methods.

Comments:	Because some important information about the authors hasn't been confirmed, and our manuscript need to be improved and revised. The new version may need a long time to modified, so we decide to withdrew it
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2106.11232 [cs.CV]
	(or arXiv:2106.11232v2 [cs.CV] for this version)

Submission history

From: Jie Xu [view email]
[v1] Mon, 21 Jun 2021 16:23:28 GMT (5889kb,D)
[v2] Wed, 7 Jul 2021 14:29:15 GMT (0kb,I)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2106.11232

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Multi-VAE: Learning Disentangled View-common and View-peculiar Visual Representations for Multi-view Clustering

Submission history