LP-SparseMAP: Differentiable Relaxed Optimization for Sparse Structured Prediction

Niculae, Vlad; Martins, André F. T.

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2001

Computer Science > Machine Learning

Title: LP-SparseMAP: Differentiable Relaxed Optimization for Sparse Structured Prediction

Authors: Vlad Niculae, André F. T. Martins

(Submitted on 13 Jan 2020 (v1), last revised 5 Aug 2020 (this version, v3))

Abstract: Structured prediction requires manipulating a large number of combinatorial structures, e.g., dependency trees or alignments, either as latent or output variables. Recently, the SparseMAP method has been proposed as a differentiable, sparse alternative to maximum a posteriori (MAP) and marginal inference. SparseMAP returns a combination of a small number of structures, a desirable property in some downstream applications. However, SparseMAP requires a tractable MAP inference oracle. This excludes, e.g., loopy graphical models or factor graphs with logic constraints, which generally require approximate inference. In this paper, we introduce LP-SparseMAP, an extension of SparseMAP that addresses this limitation via a local polytope relaxation. LP-SparseMAP uses the flexible and powerful domain specific language of factor graphs for defining and backpropagating through arbitrary hidden structure, supporting coarse decompositions, hard logic constraints, and higher-order correlations. We derive the forward and backward algorithms needed for using LP-SparseMAP as a hidden or output layer. Experiments in three structured prediction tasks show benefits compared to SparseMAP and Structured SVM.

Comments:	34 pages, 5 tables, 4 figures. ICML 2020
Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
Cite as:	arXiv:2001.04437 [cs.LG]
	(or arXiv:2001.04437v3 [cs.LG] for this version)

Submission history

From: Vlad Niculae [view email]
[v1] Mon, 13 Jan 2020 18:16:13 GMT (67kb,D)
[v2] Mon, 29 Jun 2020 18:05:12 GMT (80kb,D)
[v3] Wed, 5 Aug 2020 15:36:49 GMT (90kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2001.04437

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: LP-SparseMAP: Differentiable Relaxed Optimization for Sparse Structured Prediction

Submission history