We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: DocRED-FE: A Document-Level Fine-Grained Entity And Relation Extraction Dataset

Abstract: Joint entity and relation extraction (JERE) is one of the most important tasks in information extraction. However, most existing works focus on sentence-level coarse-grained JERE, which have limitations in real-world scenarios. In this paper, we construct a large-scale document-level fine-grained JERE dataset DocRED-FE, which improves DocRED with Fine-Grained Entity Type. Specifically, we redesign a hierarchical entity type schema including 11 coarse-grained types and 119 fine-grained types, and then re-annotate DocRED manually according to this schema. Through comprehensive experiments we find that: (1) DocRED-FE is challenging to existing JERE models; (2) Our fine-grained entity types promote relation classification. We make DocRED-FE with instruction and the code for our baselines publicly available at this https URL
Comments: Accepted by IEEE ICASSP 2023. The first two authors contribute equally
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2303.11141 [cs.CL]
  (or arXiv:2303.11141v2 [cs.CL] for this version)

Submission history

From: Hongbo Wang [view email]
[v1] Mon, 20 Mar 2023 14:19:58 GMT (705kb,D)
[v2] Tue, 21 Mar 2023 09:03:14 GMT (705kb,D)

Link back to: arXiv, form interface, contact.