What GPT Knows About Who is Who

Yang, Xiaohan; Peynetti, Eduardo; Meerman, Vasco; Tanner, Chris

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2205

Computer Science > Computation and Language

Title: What GPT Knows About Who is Who

Authors: Xiaohan Yang, Eduardo Peynetti, Vasco Meerman, Chris Tanner

(Submitted on 16 May 2022)

Abstract: Coreference resolution -- which is a crucial task for understanding discourse and language at large -- has yet to witness widespread benefits from large language models (LLMs). Moreover, coreference resolution systems largely rely on supervised labels, which are highly expensive and difficult to annotate, thus making it ripe for prompt engineering. In this paper, we introduce a QA-based prompt-engineering method and discern \textit{generative}, pre-trained LLMs' abilities and limitations toward the task of coreference resolution. Our experiments show that GPT-2 and GPT-Neo can return valid answers, but that their capabilities to identify coreferent mentions are limited and prompt-sensitive, leading to inconsistent results.

Comments:	Accepted by ACL 2022 Workshop on Insights from Negative Results in NLP
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2205.07407 [cs.CL]
	(or arXiv:2205.07407v1 [cs.CL] for this version)

Submission history

From: Xiaohan Yang [view email]
[v1] Mon, 16 May 2022 00:59:37 GMT (952kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2205.07407

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: What GPT Knows About Who is Who

Submission history