Current browse context:
cs.CL
Change to browse by:
References & Citations
Computer Science > Computation and Language
Title: Pseudo-Labeling for Massively Multilingual Speech Recognition
(Submitted on 30 Oct 2021 (v1), last revised 8 Mar 2022 (this version, v3))
Abstract: Semi-supervised learning through pseudo-labeling has become a staple of state-of-the-art monolingual speech recognition systems. In this work, we extend pseudo-labeling to massively multilingual speech recognition with 60 languages. We propose a simple pseudo-labeling recipe that works well even with low-resource languages: train a supervised multilingual model, fine-tune it with semi-supervised learning on a target language, generate pseudo-labels for that language, and train a final model using pseudo-labels for all languages, either from scratch or by fine-tuning. Experiments on the labeled Common Voice and unlabeled VoxPopuli datasets show that our recipe can yield a model with better performance for many languages that also transfers well to LibriSpeech.
Submission history
From: Loren Lugosch [view email][v1] Sat, 30 Oct 2021 03:30:17 GMT (1475kb,D)
[v2] Fri, 4 Mar 2022 21:20:10 GMT (1475kb,D)
[v3] Tue, 8 Mar 2022 14:48:41 GMT (1475kb,D)
Link back to: arXiv, form interface, contact.