We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: Automatic Lexical Simplification for Turkish

Abstract: In this paper, we present the first automatic lexical simplification system for the Turkish language. Recent text simplification efforts rely on manually crafted simplified corpora and comprehensive NLP tools that can analyse the target text both in word and sentence levels. Turkish is a morphologically rich agglutinative language that requires unique considerations such as the proper handling of inflectional cases. Being a low-resource language in terms of available resources and industrial-strength tools, it makes the text simplification task harder to approach. We present a new text simplification pipeline based on pretrained representation model BERT together with morphological features to generate grammatically correct and semantically appropriate word-level simplifications.
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2201.05878 [cs.CL]
  (or arXiv:2201.05878v2 [cs.CL] for this version)

Submission history

From: Ahmet Yavuz Uluslu [view email]
[v1] Sat, 15 Jan 2022 15:58:44 GMT (646kb)
[v2] Mon, 24 Jan 2022 11:42:40 GMT (809kb,D)

Link back to: arXiv, form interface, contact.