We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: NLNDE: The Neither-Language-Nor-Domain-Experts' Way of Spanish Medical Document De-Identification

Abstract: Natural language processing has huge potential in the medical domain which recently led to a lot of research in this field. However, a prerequisite of secure processing of medical documents, e.g., patient notes and clinical trials, is the proper de-identification of privacy-sensitive information. In this paper, we describe our NLNDE system, with which we participated in the MEDDOCAN competition, the medical document anonymization task of IberLEF 2019. We address the task of detecting and classifying protected health information from Spanish data as a sequence-labeling problem and investigate different embedding methods for our neural network. Despite dealing in a non-standard language and domain setting, the NLNDE system achieves promising results in the competition.
Comments: Published at IberLEF 2019. Winning System of the MEDDOCAN shared task
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as: arXiv:2007.01030 [cs.CL]
  (or arXiv:2007.01030v1 [cs.CL] for this version)

Submission history

From: Lukas Lange [view email]
[v1] Thu, 2 Jul 2020 11:30:32 GMT (59kb,D)

Link back to: arXiv, form interface, contact.