We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Syntactic Substitutability as Unsupervised Dependency Syntax

Abstract: Syntax is a latent hierarchical structure which underpins the robust and compositional nature of human language. In this work, we explore the hypothesis that syntactic dependencies can be represented in language model attention distributions and propose a new method to induce these structures theory-agnostically. Instead of modeling syntactic relations as defined by annotation schemata, we model a more general property implicit in the definition of dependency relations, syntactic substitutability. This property captures the fact that words at either end of a dependency can be substituted with words from the same category. Substitutions can be used to generate a set of syntactically invariant sentences whose representations are then used for parsing. We show that increasing the number of substitutions used improves parsing accuracy on natural data. On long-distance subject-verb agreement constructions, our method achieves 79.5% recall compared to 8.9% using a previous method. Our method also provides improvements when transferred to a different parsing setup, demonstrating that it generalizes.
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2211.16031 [cs.CL]
  (or arXiv:2211.16031v3 [cs.CL] for this version)

Submission history

From: Jasper Jian [view email]
[v1] Tue, 29 Nov 2022 09:01:37 GMT (48kb,D)
[v2] Mon, 22 May 2023 03:08:28 GMT (55kb,D)
[v3] Fri, 20 Oct 2023 18:10:57 GMT (61kb,D)

Link back to: arXiv, form interface, contact.