We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: Atypical lexical abbreviations identification in Russian medical texts

Authors: Anna Berdichevskaia (NUST "MISiS")
Abstract: Abbreviation is a method of word formation that aims to construct the shortened term from the first letters of the initial phrase. Implicit abbreviations frequently cause the comprehension difficulties for unprepared readers. In this paper, we propose an efficient ML-based algorithm which allows to identify the abbreviations in Russian texts. The method achieves ROC AUC score 0.926 and F1 score 0.706 which are confirmed as competitive in comparison with the baselines. Along with the pipeline, we also establish first to our knowledge Russian dataset that is relevant for the desired task.
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2206.01987 [cs.CL]
  (or arXiv:2206.01987v1 [cs.CL] for this version)

Submission history

From: Anna Berdichevskaia [view email]
[v1] Sat, 4 Jun 2022 13:16:08 GMT (1074kb,D)

Link back to: arXiv, form interface, contact.