We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Building an Endangered Language Resource in the Classroom: Universal Dependencies for Kakataibo

Abstract: In this paper, we launch a new Universal Dependencies treebank for an endangered language from Amazonia: Kakataibo, a Panoan language spoken in Peru. We first discuss the collaborative methodology implemented, which proved effective to create a treebank in the context of a Computational Linguistic course for undergraduates. Then, we describe the general details of the treebank and the language-specific considerations implemented for the proposed annotation. We finally conduct some experiments on part-of-speech tagging and syntactic dependency parsing. We focus on monolingual and transfer learning settings, where we study the impact of a Shipibo-Konibo treebank, another Panoan language resource.
Comments: Accepted to LREC 2022
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2206.10343 [cs.CL]
  (or arXiv:2206.10343v1 [cs.CL] for this version)

Submission history

From: Arturo Oncevay [view email]
[v1] Tue, 21 Jun 2022 12:58:56 GMT (988kb,D)

Link back to: arXiv, form interface, contact.