Current browse context:
cs.LG
Change to browse by:
References & Citations
Computer Science > Machine Learning
Title: On Structured Prediction Theory with Calibrated Convex Surrogate Losses
(Submitted on 7 Mar 2017 (v1), last revised 29 Jan 2018 (this version, v4))
Abstract: We provide novel theoretical insights on structured prediction in the context of efficient convex surrogate loss minimization with consistency guarantees. For any task loss, we construct a convex surrogate that can be optimized via stochastic gradient descent and we prove tight bounds on the so-called "calibration function" relating the excess surrogate risk to the actual risk. In contrast to prior related work, we carefully monitor the effect of the exponential number of classes in the learning guarantees as well as on the optimization complexity. As an interesting consequence, we formalize the intuition that some task losses make learning harder than others, and that the classical 0-1 loss is ill-suited for general structured prediction.
Submission history
From: Anton Osokin [view email][v1] Tue, 7 Mar 2017 14:39:15 GMT (101kb,D)
[v2] Thu, 14 Sep 2017 11:16:05 GMT (72kb,D)
[v3] Thu, 16 Nov 2017 13:38:49 GMT (77kb,D)
[v4] Mon, 29 Jan 2018 08:25:28 GMT (77kb,D)
Link back to: arXiv, form interface, contact.