We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: Optimal whitening and decorrelation

Abstract: Whitening, or sphering, is a common preprocessing step in statistical analysis to transform random variables to orthogonality. However, due to rotational freedom there are infinitely many possible whitening procedures. Consequently, there is a diverse range of sphering methods in use, for example based on principal component analysis (PCA), Cholesky matrix decomposition and zero-phase component analysis (ZCA), among others.
Here we provide an overview of the underlying theory and discuss five natural whitening procedures. Subsequently, we demonstrate that investigating the cross-covariance and the cross-correlation matrix between sphered and original variables allows to break the rotational invariance and to identify optimal whitening transformations. As a result we recommend two particular approaches: ZCA-cor whitening to produce sphered variables that are maximally similar to the original variables, and PCA-cor whitening to obtain sphered variables that maximally compress the original variables.
Comments: 14 pages, 2 tables
Subjects: Methodology (stat.ME); Machine Learning (stat.ML)
Journal reference: The American Statistician 2018, Vol. 72, No. 4, pp. 309-314
DOI: 10.1080/00031305.2016.1277159
Cite as: arXiv:1512.00809 [stat.ME]
  (or arXiv:1512.00809v4 [stat.ME] for this version)

Submission history

From: Korbinian Strimmer [view email]
[v1] Wed, 2 Dec 2015 18:54:53 GMT (12kb)
[v2] Thu, 26 May 2016 16:36:44 GMT (14kb)
[v3] Thu, 15 Dec 2016 11:27:22 GMT (13kb)
[v4] Sun, 18 Dec 2016 00:17:54 GMT (13kb)

Link back to: arXiv, form interface, contact.