Statistical Mechanics of High-Dimensional Inference

Advani, Madhu; Ganguli, Surya

doi:10.1103/PhysRevX.6.031034

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 1601

Statistics > Machine Learning

Title: Statistical Mechanics of High-Dimensional Inference

Authors: Madhu Advani, Surya Ganguli

(Submitted on 18 Jan 2016 (v1), last revised 22 Feb 2016 (this version, v2))

Abstract: To model modern large-scale datasets, we need efficient algorithms to infer a set of $P$ unknown model parameters from $N$ noisy measurements. What are fundamental limits on the accuracy of parameter inference, given finite signal-to-noise ratios, limited measurements, prior information, and computational tractability requirements? How can we combine prior information with measurements to achieve these limits? Classical statistics gives incisive answers to these questions as the measurement density $\alpha = \frac{N}{P}\rightarrow \infty$. However, these classical results are not relevant to modern high-dimensional inference problems, which instead occur at finite $\alpha$. We formulate and analyze high-dimensional inference as a problem in the statistical physics of quenched disorder. Our analysis uncovers fundamental limits on the accuracy of inference in high dimensions, and reveals that widely cherished inference algorithms like maximum likelihood (ML) and maximum-a posteriori (MAP) inference cannot achieve these limits. We further find optimal, computationally tractable algorithms that can achieve these limits. Intriguingly, in high dimensions, these optimal algorithms become computationally simpler than MAP and ML, while still outperforming them. For example, such optimal algorithms can lead to as much as a 20% reduction in the amount of data to achieve the same performance relative to MAP. Moreover, our analysis reveals simple relations between optimal high dimensional inference and low dimensional scalar Bayesian inference, insights into the nature of generalization and predictive power in high dimensions, information theoretic limits on compressed sensing, phase transitions in quadratic inference, and connections to central mathematical objects in convex optimization theory and random matrix theory.

Comments:	See this http URL for supplementary material
Subjects:	Machine Learning (stat.ML); Disordered Systems and Neural Networks (cond-mat.dis-nn); Statistical Mechanics (cond-mat.stat-mech); Statistics Theory (math.ST); Quantitative Methods (q-bio.QM)
Journal reference:	Phys. Rev. X 6, 031034 (2016)
DOI:	10.1103/PhysRevX.6.031034
Cite as:	arXiv:1601.04650 [stat.ML]
	(or arXiv:1601.04650v2 [stat.ML] for this version)

Submission history

From: Madhu Advani [view email]
[v1] Mon, 18 Jan 2016 18:38:35 GMT (236kb,D)
[v2] Mon, 22 Feb 2016 03:10:56 GMT (254kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:1601.04650

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: Statistical Mechanics of High-Dimensional Inference

Submission history