We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: A Tight Bound of Hard Thresholding

Authors: Jie Shen, Ping Li
Abstract: This paper is concerned with the hard thresholding technique which sets all but the $k$ largest absolute elements to zero. We establish a tight bound that quantitatively characterizes the deviation of the thresholded solution from a given signal. Our theoretical result is universal in the sense that it holds for all choices of parameters, and the underlying analysis only depends on fundamental arguments in mathematical optimization. We discuss the implications for the literature:
Compressed Sensing. On account of the crucial estimate, we bridge the connection between restricted isometry property (RIP) and the sparsity parameter of $k$ for a vast volume of hard thresholding based algorithms, which renders an improvement on the RIP condition especially when the true sparsity is unknown. This suggests that in essence, many more kinds of sensing matrices or fewer measurements are admissible for the data acquisition procedure.
Machine Learning. In terms of large-scale machine learning, a significant yet challenging problem is producing sparse solutions in online setting. In stark contrast to prior works that attempted the $\ell_1$ relaxation for promoting sparsity, we present a novel algorithm which performs hard thresholding in each iteration to ensure such parsimonious solutions. Equipped with the developed bound for hard thresholding, we prove global linear convergence for a number of prevalent statistical models under mild assumptions, even though the problem turns out to be non-convex.
Comments: V1 was submitted to COLT 2016. V2 fixes minor flaws, adds extra experiments and discusses time complexity
Subjects: Machine Learning (stat.ML); Information Theory (cs.IT); Machine Learning (cs.LG); Numerical Analysis (math.NA); Optimization and Control (math.OC)
Cite as: arXiv:1605.01656 [stat.ML]
  (or arXiv:1605.01656v2 [stat.ML] for this version)

Submission history

From: Jie Shen [view email]
[v1] Thu, 5 May 2016 17:10:34 GMT (47kb)
[v2] Sun, 15 Oct 2017 03:04:09 GMT (100kb)
[v3] Thu, 28 Jun 2018 17:58:11 GMT (92kb)

Link back to: arXiv, form interface, contact.