We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: Optimal whitening and decorrelation

Abstract: Whitening, or sphering, is a common preprocessing step in statistical analysis to transform random variables to orthogonality. However, due to rotational freedom there are infinitely many possible whitening procedures. Consequently, there is a diverse range of sphering methods in use, for example based on principal component analysis, Cholesky matrix decomposition and Mahalanobis transformation, among others.
Here we provide an overview of the underlying theory and discuss five natural whitening procedures. Subsequently, we demonstrate that investigating the cross-covariance and the cross-correlation matrix between sphered and original variables allows to break the rotational invariance of whitening and to identify optimal transformations. As a result we recommended two particular whitening approaches: CAT-CAR whitening to produce sphered variables that are maximally similar to the original variables, and PCA-whitening based on the correlation matrix to obtain maximally compressed whitened variables.
Comments: 12 pages, 2 tables
Subjects: Methodology (stat.ME); Machine Learning (stat.ML)
Cite as: arXiv:1512.00809 [stat.ME]
  (or arXiv:1512.00809v1 [stat.ME] for this version)

Submission history

From: Korbinian Strimmer [view email]
[v1] Wed, 2 Dec 2015 18:54:53 GMT (12kb)
[v2] Thu, 26 May 2016 16:36:44 GMT (14kb)
[v3] Thu, 15 Dec 2016 11:27:22 GMT (13kb)
[v4] Sun, 18 Dec 2016 00:17:54 GMT (13kb)

Link back to: arXiv, form interface, contact.