References & Citations
Computer Science > Computation and Language
Title: Infusing Finetuning with Semantic Dependencies
(Submitted on 10 Dec 2020 (v1), last revised 16 Dec 2021 (this version, v5))
Abstract: For natural language processing systems, two kinds of evidence support the use of text representations from neural language models "pretrained" on large unannotated corpora: performance on application-inspired benchmarks (Peters et al., 2018, inter alia), and the emergence of syntactic abstractions in those representations (Tenney et al., 2019, inter alia). On the other hand, the lack of grounded supervision calls into question how well these representations can ever capture meaning (Bender and Koller, 2020). We apply novel probes to recent language models -- specifically focusing on predicate-argument structure as operationalized by semantic dependencies (Ivanova et al., 2012) -- and find that, unlike syntax, semantics is not brought to the surface by today's pretrained models. We then use convolutional graph encoders to explicitly incorporate semantic parses into task-specific finetuning, yielding benefits to natural language understanding (NLU) tasks in the GLUE benchmark. This approach demonstrates the potential for general-purpose (rather than task-specific) linguistic supervision, above and beyond conventional pretraining and finetuning. Several diagnostics help to localize the benefits of our approach.
Submission history
From: Zhaofeng Wu [view email][v1] Thu, 10 Dec 2020 01:27:24 GMT (3603kb,D)
[v2] Sun, 20 Dec 2020 19:56:30 GMT (3603kb,D)
[v3] Mon, 8 Feb 2021 07:40:54 GMT (3605kb,D)
[v4] Tue, 1 Jun 2021 19:09:55 GMT (3603kb,D)
[v5] Thu, 16 Dec 2021 03:43:48 GMT (3605kb,D)
Link back to: arXiv, form interface, contact.