We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Do Not Escape From the Manifold: Discovering the Local Coordinates on the Latent Space of GANs

Abstract: The discovery of the disentanglement properties of the latent space in GANs motivated a lot of research to find the semantically meaningful directions on it. In this paper, we suggest that the disentanglement property is closely related to the geometry of the latent space. In this regard, we propose an unsupervised method for finding the semantic-factorizing directions on the intermediate latent space of GANs based on the local geometry. Intuitively, our proposed method, called Local Basis, finds the principal variation of the latent space in the neighborhood of the base latent variable. Experimental results show that the local principal variation corresponds to the semantic factorization and traversing along it provides strong robustness to image traversal. Moreover, we suggest an explanation for the limited success in finding the global traversal directions in the latent space, especially W-space of StyleGAN2. We show that W-space is warped globally by comparing the local geometry, discovered from Local Basis, through the metric on Grassmannian Manifold. The global warpage implies that the latent space is not well-aligned globally and therefore the global traversal directions are bound to show limited success on it.
Comments: 24 pages, 19 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Journal reference: International Conference on Learning Representations, 2022
Cite as: arXiv:2106.06959 [cs.CV]
  (or arXiv:2106.06959v5 [cs.CV] for this version)

Submission history

From: Jaewoong Choi [view email]
[v1] Sun, 13 Jun 2021 10:29:42 GMT (30940kb,D)
[v2] Tue, 19 Oct 2021 02:08:06 GMT (42416kb,D)
[v3] Fri, 4 Feb 2022 09:58:39 GMT (43511kb,D)
[v4] Sat, 7 May 2022 15:31:36 GMT (43510kb,D)
[v5] Sat, 25 Jun 2022 13:44:07 GMT (43510kb,D)

Link back to: arXiv, form interface, contact.