Current browse context:
stat.ML
Change to browse by:
References & Citations
Statistics > Machine Learning
Title: Disentangling Visual Embeddings with Minimal Distributional Assumptions
(Submitted on 28 Jun 2022 (v1), revised 28 Oct 2022 (this version, v2), latest version 6 Jun 2023 (v5))
Abstract: Interest in understanding and factorizing embedding spaces learned by deep encoders is growing. Concept discovery methods search the embedding spaces for interpretable latent components like object shape or color and disentangle them into individual axes in the embedding space. Yet, the applicability of modern disentanglement learning techniques or independent component analysis (ICA) is limited when it comes to vision tasks: They either require training a model of the complex image-generating process or their rigid stochastic independence assumptions on the component distribution are violated in practice. In this work, we identify components in encoder embedding spaces without distributional assumptions and without training a generator. Instead, we utilize functional compositionality properties of image-generating processes. We derive two novel post-hoc component discovery methods and prove theoretical identifiability guarantees. We study them in realistic visual disentanglement tasks with correlated components and violated functional assumptions. Our approaches stably maintain superior performance against 300+ state-of-the-art disentanglement and component analysis models.
Submission history
From: Tobias Leemann [view email][v1] Tue, 28 Jun 2022 10:21:17 GMT (6061kb,D)
[v2] Fri, 28 Oct 2022 11:25:20 GMT (7339kb,D)
[v3] Tue, 21 Feb 2023 13:55:22 GMT (7998kb,D)
[v4] Thu, 25 May 2023 16:10:42 GMT (7998kb,D)
[v5] Tue, 6 Jun 2023 07:01:53 GMT (8061kb,D)
Link back to: arXiv, form interface, contact.