We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: Improved Language Identification Through Cross-Lingual Self-Supervised Learning

Abstract: Language identification greatly impacts the success of downstream tasks such as automatic speech recognition. Recently, self-supervised speech representations learned by wav2vec 2.0 have been shown to be very effective for a range of speech tasks. We extend previous self-supervised work on language identification by experimenting with pre-trained models which were learned on real-world unconstrained speech in multiple languages and not just on English. We show that models pre-trained on many languages perform better and enable language identification systems that require very little labeled data to perform well. Results on a 26 languages setup show that with only 10 minutes of labeled data per language, a cross-lingually pre-trained model can achieve over 89.2% accuracy.
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as: arXiv:2107.04082 [cs.CL]
  (or arXiv:2107.04082v4 [cs.CL] for this version)

Submission history

From: Andros Tjandra [view email]
[v1] Thu, 8 Jul 2021 19:37:06 GMT (486kb,D)
[v2] Sat, 24 Jul 2021 03:24:21 GMT (486kb,D)
[v3] Wed, 4 Aug 2021 20:04:24 GMT (486kb,D)
[v4] Mon, 18 Oct 2021 03:49:06 GMT (1218kb,D)

Link back to: arXiv, form interface, contact.