We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Mathematics > Statistics Theory

Title: Matrix completion with data-dependent missingness probabilities

Abstract: The problem of completing a large matrix with lots of missing entries has received widespread attention in the last couple of decades. Two popular approaches to the matrix completion problem are based on singular value thresholding and nuclear norm minimization. Most of the past works on this subject assume that there is a single number $p$ such that each entry of the matrix is available independently with probability $p$ and missing otherwise. This assumption may not be realistic for many applications. In this work, we replace it with the assumption that the probability that an entry is available is an unknown function $f$ of the entry itself. For example, if the entry is the rating given to a movie by a viewer, then it seems plausible that high value entries have greater probability of being available than low value entries. We propose two new estimators, based on singular value thresholding and nuclear norm minimization, to recover the matrix under this assumption. The estimators involve no tuning parameters, and are shown to be consistent under a low rank assumption. We also provide a consistent estimator of the unknown function $f$.
Comments: 28 pages, 9 figures. To appear in IEEE Trans. Inf. Theory
Subjects: Statistics Theory (math.ST); Information Theory (cs.IT); Probability (math.PR); Methodology (stat.ME)
Cite as: arXiv:2106.02290 [math.ST]
  (or arXiv:2106.02290v3 [math.ST] for this version)

Submission history

From: Sourav Chatterjee [view email]
[v1] Fri, 4 Jun 2021 07:07:14 GMT (369kb,D)
[v2] Sun, 1 Aug 2021 16:28:51 GMT (372kb,D)
[v3] Fri, 22 Apr 2022 07:48:17 GMT (433kb,D)

Link back to: arXiv, form interface, contact.