### Current browse context:

math.ST

### Change to browse by:

### References & Citations

# Mathematics > Statistics Theory

# Title: Matrix completion with data-dependent missingness probabilities

(Submitted on 4 Jun 2021 (v1), last revised 22 Apr 2022 (this version, v3))

Abstract: The problem of completing a large matrix with lots of missing entries has received widespread attention in the last couple of decades. Two popular approaches to the matrix completion problem are based on singular value thresholding and nuclear norm minimization. Most of the past works on this subject assume that there is a single number $p$ such that each entry of the matrix is available independently with probability $p$ and missing otherwise. This assumption may not be realistic for many applications. In this work, we replace it with the assumption that the probability that an entry is available is an unknown function $f$ of the entry itself. For example, if the entry is the rating given to a movie by a viewer, then it seems plausible that high value entries have greater probability of being available than low value entries. We propose two new estimators, based on singular value thresholding and nuclear norm minimization, to recover the matrix under this assumption. The estimators involve no tuning parameters, and are shown to be consistent under a low rank assumption. We also provide a consistent estimator of the unknown function $f$.

## Submission history

From: Sourav Chatterjee [view email]**[v1]**Fri, 4 Jun 2021 07:07:14 GMT (369kb,D)

**[v2]**Sun, 1 Aug 2021 16:28:51 GMT (372kb,D)

**[v3]**Fri, 22 Apr 2022 07:48:17 GMT (433kb,D)

Link back to: arXiv, form interface, contact.