On Structured Prediction Theory with Calibrated Convex Surrogate Losses

Osokin, Anton; Bach, Francis; Lacoste-Julien, Simon

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1703

Computer Science > Machine Learning

Title: On Structured Prediction Theory with Calibrated Convex Surrogate Losses

Authors: Anton Osokin, Francis Bach, Simon Lacoste-Julien

(Submitted on 7 Mar 2017 (v1), last revised 29 Jan 2018 (this version, v4))

Abstract: We provide novel theoretical insights on structured prediction in the context of efficient convex surrogate loss minimization with consistency guarantees. For any task loss, we construct a convex surrogate that can be optimized via stochastic gradient descent and we prove tight bounds on the so-called "calibration function" relating the excess surrogate risk to the actual risk. In contrast to prior related work, we carefully monitor the effect of the exponential number of classes in the learning guarantees as well as on the optimization complexity. As an interesting consequence, we formalize the intuition that some task losses make learning harder than others, and that the classical 0-1 loss is ill-suited for general structured prediction.

Comments:	Appears in: Advances in Neural Information Processing Systems 30 (NIPS 2017). 30 pages
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1703.02403 [cs.LG]
	(or arXiv:1703.02403v4 [cs.LG] for this version)

Submission history

From: Anton Osokin [view email]
[v1] Tue, 7 Mar 2017 14:39:15 GMT (101kb,D)
[v2] Thu, 14 Sep 2017 11:16:05 GMT (72kb,D)
[v3] Thu, 16 Nov 2017 13:38:49 GMT (77kb,D)
[v4] Mon, 29 Jan 2018 08:25:28 GMT (77kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1703.02403

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: On Structured Prediction Theory with Calibrated Convex Surrogate Losses

Submission history