We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Number Theory Meets Linguistics: Modelling Noun Pluralisation Across 1497 Languages Using 2-adic Metrics

Abstract: A simple machine learning model of pluralisation as a linear regression problem minimising a p-adic metric substantially outperforms even the most robust of Euclidean-space regressors on languages in the Indo-European, Austronesian, Trans New-Guinea, Sino-Tibetan, Nilo-Saharan, Oto-Meanguean and Atlantic-Congo language families. There is insufficient evidence to support modelling distinct noun declensions as a p-adic neighbourhood even in Indo-European languages.
Comments: Accepted into AACL-IJCNLP 2022
Subjects: Computation and Language (cs.CL); Number Theory (math.NT)
Cite as: arXiv:2211.13124 [cs.CL]
  (or arXiv:2211.13124v1 [cs.CL] for this version)

Submission history

From: Greg Baker [view email]
[v1] Sat, 8 Oct 2022 09:37:43 GMT (1061kb,D)

Link back to: arXiv, form interface, contact.