We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: WikiGUM: Exhaustive Entity Linking for Wikification in 12 Genres

Abstract: Previous work on Entity Linking has focused on resources targeting non-nested proper named entity mentions, often in data from Wikipedia, i.e. Wikification. In this paper, we present and evaluate WikiGUM, a fully wikified dataset, covering all mentions of named entities, including their non-named and pronominal mentions, as well as mentions nested within other mentions. The dataset covers a broad range of 12 written and spoken genres, most of which have not been included in Entity Linking efforts to date, leading to poor performance by a pretrained SOTA system in our evaluation. The availability of a variety of other annotations for the same data also enables further research on entities in context.
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2109.07449 [cs.CL]
  (or arXiv:2109.07449v1 [cs.CL] for this version)

Submission history

From: Jessica Lin [view email]
[v1] Wed, 15 Sep 2021 17:35:24 GMT (82kb,D)

Link back to: arXiv, form interface, contact.