We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.IT

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Information Theory

Title: Does $\ell_p$-minimization outperform $\ell_1$-minimization?

Abstract: In many application areas we are faced with the following question: Can we recover a sparse vector $x_o \in \mathbb{R}^N$ from its undersampled set of noisy observations $y \in \mathbb{R}^n$, $y=A x_o+w$. The last decade has witnessed a surge of algorithms and theoretical results addressing this question. One of the most popular algorithms is the $\ell_p$-regularized least squares (LPLS) given by the following formulation: \[ \hat{x}(\gamma,p )\in \arg\min_x \frac{1}{2}\|y - Ax\|_2^2+\gamma\|x\|_p^p, \] where $p \in [0,1]$. Despite the non-convexity of these problems for $p<1$, they are still appealing because of the following folklores in compressed sensing: (i) $\hat{x}(\gamma,p )$ is closer to $x_o$ than $\hat{x}(\gamma,1)$. (ii) If we employ iterative methods that aim to converge to a local minima of LPLS, then under good initialization these algorithms converge to a solution that is closer to $x_o$ than $\hat{x}(\gamma,1)$. In spite of the existence of plenty of empirical results that support these folklore theorems, the theoretical progress to establish them has been very limited.
This paper aims to study the above folklore theorems and establish their scope of validity. Starting with approximate message passing algorithm as a heuristic method for solving LPLS, we study the impact of initialization on the performance of AMP. Then, we employ the replica analysis to show the connection between the solution of AMP and $\hat{x}(\gamma, p)$ in the asymptotic settings. This enables us to compare the accuracy of $\hat{x}(\gamma,p)$ for $p \in [0,1]$. In particular, we will characterize the phase transition and noise sensitivity of LPLS for every $0\leq p\leq 1$ accurately. Our results in the noiseless setting confirm that LPLS exhibits the same phase transition for every $0\leq p <1$ and this phase transition is much higher than that of LASSO.
Subjects: Information Theory (cs.IT); Statistics Theory (math.ST)
Cite as: arXiv:1501.03704 [cs.IT]
  (or arXiv:1501.03704v2 [cs.IT] for this version)

Submission history

From: Le Zheng [view email]
[v1] Thu, 15 Jan 2015 14:54:10 GMT (822kb,D)
[v2] Fri, 10 Jun 2016 23:25:44 GMT (835kb,D)

Link back to: arXiv, form interface, contact.