Syntax-guided Neural Module Distillation to Probe Compositionality in Sentence Embeddings

Pandey, Rohan

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2301

Change to browse by:

Computer Science > Computation and Language

Title: Syntax-guided Neural Module Distillation to Probe Compositionality in Sentence Embeddings

Authors: Rohan Pandey

(Submitted on 21 Jan 2023 (v1), last revised 8 Feb 2023 (this version, v2))

Abstract: Past work probing compositionality in sentence embedding models faces issues determining the causal impact of implicit syntax representations. Given a sentence, we construct a neural module net based on its syntax parse and train it end-to-end to approximate the sentence's embedding generated by a transformer model. The distillability of a transformer to a Syntactic NeurAl Module Net (SynNaMoN) then captures whether syntax is a strong causal model of its compositional ability. Furthermore, we address questions about the geometry of semantic composition by specifying individual SynNaMoN modules' internal architecture & linearity. We find differences in the distillability of various sentence embedding models that broadly correlate with their performance, but observe that distillability doesn't considerably vary by model size. We also present preliminary evidence that much syntax-guided composition in sentence embedding models is linear, and that non-linearities may serve primarily to handle non-compositional phrases.

Comments:	EACL 2023 (camera-ready)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2301.08998 [cs.CL]
	(or arXiv:2301.08998v2 [cs.CL] for this version)

Submission history

From: Rohan Pandey [view email]
[v1] Sat, 21 Jan 2023 19:42:02 GMT (302kb,D)
[v2] Wed, 8 Feb 2023 09:10:27 GMT (304kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2301.08998

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Syntax-guided Neural Module Distillation to Probe Compositionality in Sentence Embeddings

Submission history