We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Normalization of Different Swedish Dialects Spoken in Finland

Abstract: Our study presents a dialect normalization method for different Finland Swedish dialects covering six regions. We tested 5 different models, and the best model improved the word error rate from 76.45 to 28.58. Contrary to results reported in earlier research on Finnish dialects, we found that training the model with one word at a time gave best results. We believe this is due to the size of the training data available for the model. Our models are accessible as a Python package. The study provides important information about the adaptability of these methods in different contexts, and gives important baselines for further study.
Comments: In Proceedings of the 4th ACM SIGSPATIAL Workshop on Geospatial Humanities (GeoHumanities'20)
Subjects: Computation and Language (cs.CL)
DOI: 10.1145/3423337.3429435
Cite as: arXiv:2012.05318 [cs.CL]
  (or arXiv:2012.05318v1 [cs.CL] for this version)

Submission history

From: Mika Hämäläinen [view email]
[v1] Wed, 9 Dec 2020 20:59:31 GMT (1541kb,D)

Link back to: arXiv, form interface, contact.