We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.ST

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Statistics Theory

Title: Low-rank matrix denoising for count data using unbiased Kullback-Leibler risk estimation

Abstract: Many statistical studies are concerned with the analysis of observations organized in a matrix form whose elements are count data. When these observations are assumed to follow a Poisson or a multinomial distribution, it is of interest to focus on the estimation of either the intensity matrix (Poisson case) or the compositional matrix (multinomial case) when it is assumed to have a low rank structure. In this setting, it is proposed to construct an estimator minimizing the regularized negative log-likelihood by a nuclear norm penalty. Such an approach easily yields a low-rank matrix-valued estimator with positive entries which belongs to the set of row-stochastic matrices in the multinomial case. Then, as a main contribution, a data-driven procedure is constructed to select the regularization parameter in the construction of such estimators by minimizing (approximately) unbiased estimates of the Kullback-Leibler (KL) risk in such models, which generalize Stein's unbiased risk estimation originally proposed for Gaussian data. The evaluation of these quantities is a delicate problem, and novel methods are introduced to obtain accurate numerical approximation of such unbiased estimates. Simulated data are used to validate this way of selecting regularizing parameters for low-rank matrix estimation from count data. For data following a multinomial distribution, the performances of this approach are also compared to $K$-fold cross-validation. Examples from a survey study and metagenomics also illustrate the benefits of this methodology for real data analysis.
Subjects: Statistics Theory (math.ST)
Cite as: arXiv:2001.10391 [math.ST]
  (or arXiv:2001.10391v3 [math.ST] for this version)

Submission history

From: Jeremie Bigot [view email]
[v1] Tue, 28 Jan 2020 15:01:50 GMT (2484kb,D)
[v2] Sun, 22 Nov 2020 07:57:46 GMT (2785kb,D)
[v3] Mon, 3 Jan 2022 11:28:43 GMT (2905kb,D)

Link back to: arXiv, form interface, contact.