We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: NUBES: A Corpus of Negation and Uncertainty in Spanish Clinical Texts

Abstract: This paper introduces the first version of the NUBes corpus (Negation and Uncertainty annotations in Biomedical texts in Spanish). The corpus is part of an on-going research and currently consists of 29,682 sentences obtained from anonymised health records annotated with negation and uncertainty. The article includes an exhaustive comparison with similar corpora in Spanish, and presents the main annotation and design decisions. Additionally, we perform preliminary experiments using deep learning algorithms to validate the annotated dataset. As far as we know, NUBes is the largest publicly available corpus for negation in Spanish and the first that also incorporates the annotation of speculation cues, scopes, and events.
Comments: Accepted at the Twelfth International Conference on Language Resources and Evaluation (LREC 2020)
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2004.01092 [cs.CL]
  (or arXiv:2004.01092v1 [cs.CL] for this version)

Submission history

From: Naiara Pérez Miguel [view email]
[v1] Thu, 2 Apr 2020 15:51:31 GMT (35kb)

Link back to: arXiv, form interface, contact.