We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:


References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: GLEN: General-Purpose Event Detection for Thousands of Types

Abstract: The development of event extraction systems has been hindered by the absence of wide-coverage, large-scale datasets. To make event extraction systems more accessible, we build a general-purpose event detection dataset GLEN, which covers 3,465 different event types, making it over 20x larger in ontology than any current dataset. GLEN is created by utilizing the DWD Overlay, which provides a mapping between Wikidata Qnodes and PropBank rolesets. This enables us to use the abundant existing annotation for PropBank as distant supervision. In addition, we also propose a new multi-stage event detection model specifically designed to handle the large ontology size and partial labels in GLEN. We show that our model exhibits superior performance (~10% F1 gain) compared to both conventional classification baselines and newer definition-based models. Finally, we perform error analysis and show that label noise is still the largest challenge for improving performance.
Comments: The first two authors contributed equally. (15 pages, 11 figures)
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2303.09093 [cs.CL]
  (or arXiv:2303.09093v2 [cs.CL] for this version)

Submission history

From: Sha Li [view email]
[v1] Thu, 16 Mar 2023 05:36:38 GMT (604kb,D)
[v2] Mon, 20 Mar 2023 20:40:15 GMT (604kb,D)

Link back to: arXiv, form interface, contact.