Current browse context:
cs.LG
Change to browse by:
References & Citations
Computer Science > Machine Learning
Title: Intrinsic Universal Measurements of Non-linear Embeddings
(Submitted on 5 Nov 2018 (v1), last revised 1 Aug 2022 (this version, v2))
Abstract: A basic problem in machine learning is to find a mapping $f$ from a low dimensional latent space $\mathcal{Y}$ to a high dimensional observation space $\mathcal{X}$. Modern tools such as deep neural networks are capable to represent general non-linear mappings. A learner can easily find a mapping which perfectly fits all the observations. However, such a mapping is often not considered as good, because it is not simple enough and can overfit. How to define simplicity? We try to make a formal definition on the amount of information imposed by a non-linear mapping $f$. Intuitively, we measure the local discrepancy between the pullback geometry and the intrinsic geometry of the latent space. Our definition is based on information geometry and is independent of the empirical observations, nor specific parameterizations. We prove its basic properties and discuss relationships with related machine learning methods.
Submission history
From: Ke Sun [view email][v1] Mon, 5 Nov 2018 00:32:28 GMT (36kb,D)
[v2] Mon, 1 Aug 2022 05:11:15 GMT (27kb,D)
Link back to: arXiv, form interface, contact.