We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: For the Purpose of Curry: A UD Treebank for Ashokan Prakrit

Abstract: We present the first linguistically annotated treebank of Ashokan Prakrit, an early Middle Indo-Aryan dialect continuum attested through Emperor Ashoka Maurya's 3rd century BCE rock and pillar edicts. For annotation, we used the multilingual Universal Dependencies (UD) formalism, following recent UD work on Sanskrit and other Indo-Aryan languages. We touch on some interesting linguistic features that posed issues in annotation: regnal names and other nominal compounds, "proto-ergative" participial constructions, and possible grammaticalizations evidenced by sandhi (phonological assimilation across morpheme boundaries). Eventually, we plan for a complete annotation of all attested Ashokan texts, towards the larger goals of improving UD coverage of different diachronic stages of Indo-Aryan and studying language change in Indo-Aryan using computational methods.
Comments: To be presented at Universal Dependencies Workshop 2021 (UDW 2021)
Subjects: Computation and Language (cs.CL)
MSC classes: 68T50
ACM classes: I.2.7
Cite as: arXiv:2111.12783 [cs.CL]
  (or arXiv:2111.12783v2 [cs.CL] for this version)

Submission history

From: Aryaman Arora [view email]
[v1] Wed, 24 Nov 2021 20:30:09 GMT (5828kb,D)
[v2] Sat, 11 Dec 2021 19:48:44 GMT (9498kb,D)

Link back to: arXiv, form interface, contact.