Entailment Semantics Can Be Extracted from an Ideal Language Model

Merrill, William; Warstadt, Alex; Linzen, Tal

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2209

Change to browse by:

Computer Science > Computation and Language

Title: Entailment Semantics Can Be Extracted from an Ideal Language Model

Authors: William Merrill, Alex Warstadt, Tal Linzen

(Submitted on 26 Sep 2022 (v1), last revised 8 Jan 2024 (this version, v3))

Abstract: Language models are often trained on text alone, without additional grounding. There is debate as to how much of natural language semantics can be inferred from such a procedure. We prove that entailment judgments between sentences can be extracted from an ideal language model that has perfectly learned its target distribution, assuming the training sentences are generated by Gricean agents, i.e., agents who follow fundamental principles of communication from the linguistic theory of pragmatics. We also show entailment judgments can be decoded from the predictions of a language model trained on such Gricean data. Our results reveal a pathway for understanding the semantic information encoded in unlabeled linguistic data and a potential framework for extracting semantics from language models.

Comments:	Accepted at CONLL 2022. Updated Dec 4, 2023 and Jan 8, 2024 with erratum
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2209.12407 [cs.CL]
	(or arXiv:2209.12407v3 [cs.CL] for this version)

Submission history

From: William Merrill [view email]
[v1] Mon, 26 Sep 2022 04:16:02 GMT (110kb,D)
[v2] Wed, 6 Dec 2023 15:36:41 GMT (125kb,D)
[v3] Mon, 8 Jan 2024 22:01:26 GMT (125kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2209.12407

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Entailment Semantics Can Be Extracted from an Ideal Language Model

Submission history