We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Methodology

Title: Principal Sub-manifolds

Abstract: We invent a novel method of finding principal components in multivariate data sets that lie on an embedded nonlinear Riemannian manifold within a higher-dimensional space. Our aim is to extend the geometric interpretation of PCA, while being able to capture non-geodesic modes of variation in the data. We introduce the concept of a principal sub-manifold, a manifold passing through the center of the data, and at any point on the manifold extending in the direction of highest variation in the space spanned by the eigenvectors of the local tangent space PCA. Compared to recent work for the case where the sub-manifold is of dimension one \citep{Panaretos2014}--essentially a curve lying on the manifold attempting to capture one-dimensional variation--the current setting is much more general. The principal sub-manifold is therefore an extension of the principal flow, accommodating to capture higher dimensional variation in the data. We show the principal sub-manifold yields the ball spanned by the usual principal components in Euclidean space. By means of examples, we illustrate how to find, use and interpret a principal sub-manifold and we present an application in shape analysis.
Comments: 38 pages, 18 figures
Subjects: Methodology (stat.ME)
Cite as: arXiv:1604.04318 [stat.ME]
  (or arXiv:1604.04318v3 [stat.ME] for this version)

Submission history

From: Zhigang Yao [view email]
[v1] Fri, 15 Apr 2016 00:12:33 GMT (2261kb,D)
[v2] Wed, 21 Sep 2016 06:01:48 GMT (4353kb,D)
[v3] Wed, 26 May 2021 07:12:24 GMT (14363kb,D)

Link back to: arXiv, form interface, contact.