We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Joint mean and covariance estimation with unreplicated matrix-variate data

Abstract: It has been proposed that complex populations, such as those that arise in genomics studies, may exhibit dependencies among observations as well as among variables. This gives rise to the challenging problem of analyzing unreplicated high-dimensional data with unknown mean and dependence structures. Matrix-variate approaches that impose various forms of (inverse) covariance sparsity allow flexible dependence structures to be estimated, but cannot directly be applied when the mean and covariance matrices are estimated jointly. We present a practical method utilizing generalized least squares and penalized (inverse) covariance estimation to address this challenge. We establish consistency and obtain rates of convergence for estimating the mean parameters and covariance matrices. The advantages of our approaches are: (i) dependence graphs and covariance structures can be estimated in the presence of unknown mean structure, (ii) the mean structure becomes more efficiently estimated when accounting for the dependence structure among observations; and (iii) inferences about the mean parameters become correctly calibrated. We use simulation studies and analysis of genomic data from a twin study of ulcerative colitis to illustrate the statistical convergence and the performance of our methods in practical settings. Several lines of evidence show that the test statistics for differential gene expression produced by our methods are correctly calibrated and improve power over conventional methods.
Comments: 15 Figures; 79 pages and 4 tables; to appear in the Journal of the American Statistical Association; Technical Report 540, Department of Statistics, University of Michigan; removed condition (A1') and corrected condition (A1) and (A2)
Subjects: Machine Learning (stat.ML)
Cite as: arXiv:1611.04208 [stat.ML]
  (or arXiv:1611.04208v4 [stat.ML] for this version)

Submission history

From: Michael Hornstein [view email]
[v1] Sun, 13 Nov 2016 23:54:03 GMT (498kb,D)
[v2] Tue, 15 Nov 2016 04:29:42 GMT (505kb,D)
[v3] Sat, 6 Jan 2018 21:00:30 GMT (607kb,D)
[v4] Thu, 7 Jun 2018 15:41:21 GMT (612kb,D)

Link back to: arXiv, form interface, contact.