We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.ST

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Mathematics > Statistics Theory

Title: Successive normalization of rectangular arrays

Abstract: Standard statistical techniques often require transforming data to have mean $0$ and standard deviation $1$. Typically, this process of "standardization" or "normalization" is applied across subjects when each subject produces a single number. High throughput genomic and financial data often come as rectangular arrays where each coordinate in one direction concerns subjects who might have different status (case or control, say), and each coordinate in the other designates "outcome" for a specific feature, for example, "gene," "polymorphic site" or some aspect of financial profile. It may happen, when analyzing data that arrive as a rectangular array, that one requires BOTH the subjects and the features to be "on the same footing." Thus there may be a need to standardize across rows and columns of the rectangular matrix. There arises the question as to how to achieve this double normalization. We propose and investigate the convergence of what seems to us a natural approach to successive normalization which we learned from our colleague Bradley Efron. We also study the implementation of the method on simulated data and also on data that arose from scientific experimentation.
Comments: Published in at this http URL the Annals of Statistics (this http URL) by the Institute of Mathematical Statistics (this http URL). With Corrections
Subjects: Statistics Theory (math.ST)
Journal reference: Annals of Statistics 2010, Vol. 38, No. 3, 1638-1664
DOI: 10.1214/09-AOS743
Report number: IMS-AOS-AOS743
Cite as: arXiv:1010.0520 [math.ST]
  (or arXiv:1010.0520v2 [math.ST] for this version)

Submission history

From: Richard A. Olshen [view email]
[v1] Mon, 4 Oct 2010 09:48:16 GMT (205kb)
[v2] Wed, 11 Dec 2013 12:51:40 GMT (210kb)

Link back to: arXiv, form interface, contact.