We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Machine Learning

Title: Continuation of Nesterov's Smoothing for Regression with Structured Sparsity in High-Dimensional Neuroimaging

Abstract: Predictive models can be used on high-dimensional brain images for diagnosis of a clinical condition. Spatial regularization through structured sparsity offers new perspectives in this context and reduces the risk of overfitting the model while providing interpretable neuroimaging signatures by forcing the solution to adhere to domain-specific constraints. Total Variation (TV) enforces spatial smoothness of the solution while segmenting predictive regions from the background. We consider the problem of minimizing the sum of a smooth convex loss, a non-smooth convex penalty (whose proximal operator is known) and a wide range of possible complex, non-smooth convex structured penalties such as TV or overlapping group Lasso. Existing solvers are either limited in the functions they can minimize or in their practical capacity to scale to high-dimensional imaging data. Nesterov's smoothing technique can be used to minimize a large number of non-smooth convex structured penalties but reasonable precision requires a small smoothing parameter, which slows down the convergence speed. To benefit from the versatility of Nesterov's smoothing technique, we propose a first order continuation algorithm, CONESTA, which automatically generates a sequence of decreasing smoothing parameters. The generated sequence maintains the optimal convergence speed towards any globally desired precision. Our main contributions are: To propose an expression of the duality gap to probe the current distance to the global optimum in order to adapt the smoothing parameter and the convergence speed. We provide a convergence rate, which is an improvement over classical proximal gradient smoothing methods. We demonstrate on both simulated and high-dimensional structural neuroimaging data that CONESTA significantly outperforms many state-of-the-art solvers in regard to convergence speed and precision.
Comments: 11 pages, 6 figures, accepted in IEEE TMI, IEEE Transactions on Medical Imaging 2018
Subjects: Machine Learning (stat.ML)
Cite as: arXiv:1605.09658 [stat.ML]
  (or arXiv:1605.09658v6 [stat.ML] for this version)

Submission history

From: Edouard Duchesnay [view email]
[v1] Tue, 31 May 2016 15:09:13 GMT (1513kb,D)
[v2] Thu, 6 Oct 2016 09:15:58 GMT (1513kb,D)
[v3] Thu, 13 Oct 2016 13:15:38 GMT (1513kb,D)
[v4] Wed, 23 Nov 2016 17:07:19 GMT (1517kb,D)
[v5] Mon, 12 Mar 2018 16:54:38 GMT (2811kb,D)
[v6] Sun, 22 Apr 2018 11:29:40 GMT (2831kb,D)

Link back to: arXiv, form interface, contact.