We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: A machine transliteration tool between Uzbek alphabets

Abstract: Machine transliteration, as defined in this paper, is a process of automatically transforming written script of words from a source alphabet into words of another target alphabet within the same language, while preserving their meaning, as well as pronunciation. The main goal of this paper is to present a machine transliteration tool between three common scripts used in low-resource Uzbek language: the old Cyrillic, currently official Latin, and newly announced New Latin alphabets. The tool has been created using a combination of rule-based and fine-tuning approaches. The created tool is available as an open-source Python package, as well as a web-based application including a public API. To our knowledge, this is the first machine transliteration tool that supports the newly announced Latin alphabet of the Uzbek language.
Comments: Preprint of a conference paper: The International Conference on Agglutinative Language Technologies as a challenge of Natural Language Processing (ALTNLP)
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2205.09578 [cs.CL]
  (or arXiv:2205.09578v1 [cs.CL] for this version)

Submission history

From: Elmurod Kuriyozov [view email]
[v1] Thu, 19 May 2022 14:19:42 GMT (251kb,D)

Link back to: arXiv, form interface, contact.