We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: A Theory of the Risk for Optimization with Relaxation and its Application to Support Vector Machines

Abstract: In this paper we consider optimization with relaxation, an ample paradigm to make data-driven designs. This approach was previously considered by the same authors of this work in Garatti and Campi (2019), a study that revealed a deep-seated connection between two concepts: risk (probability of not satisfying a new, out-of-sample, constraint) and complexity (according to a definition introduced in paper Garatti and Campi (2019)). This connection was shown to have profound implications in applications because it implied that the risk can be estimated from the complexity, a quantity that can be measured from the data without any knowledge of the data-generation mechanism. In the present work we establish new results. First, we expand the scope of Garatti and Campi (2019) so as to embrace a more general setup that covers various algorithms in machine learning. Then, we study classical support vector methods - including SVM (Support Vector Machine), SVR (Support Vector Regression) and SVDD (Support Vector Data Description) - and derive new results for the ability of these methods to generalize. All results are valid for any finite size of the data set. When the sample size tends to infinity, we establish the unprecedented result that the risk approaches the ratio between the complexity and the cardinality of the data sample, regardless of the value of the complexity.
Comments: this https URL
Subjects: Machine Learning (cs.LG); Systems and Control (eess.SY); Optimization and Control (math.OC); Machine Learning (stat.ML)
Journal reference: Journal of Machine Learning Research 22(288):1-38, 2021
Cite as: arXiv:2004.05839 [cs.LG]
  (or arXiv:2004.05839v4 [cs.LG] for this version)

Submission history

From: Simone Garatti [view email]
[v1] Mon, 13 Apr 2020 09:38:25 GMT (630kb,D)
[v2] Wed, 30 Sep 2020 10:34:10 GMT (1046kb,D)
[v3] Tue, 20 Oct 2020 19:26:04 GMT (1046kb,D)
[v4] Mon, 8 Jan 2024 11:00:50 GMT (751kb,D)

Link back to: arXiv, form interface, contact.