We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: User-friendly automatic transcription of low-resource languages: Plugging ESPnet into Elpis

Abstract: This paper reports on progress integrating the speech recognition toolkit ESPnet into Elpis, a web front-end originally designed to provide access to the Kaldi automatic speech recognition toolkit. The goal of this work is to make end-to-end speech recognition models available to language workers via a user-friendly graphical interface. Encouraging results are reported on (i) development of an ESPnet recipe for use in Elpis, with preliminary results on data sets previously used for training acoustic models with the Persephone toolkit along with a new data set that had not previously been used in speech recognition, and (ii) incorporating ESPnet into Elpis along with UI enhancements and a CUDA-supported Dockerfile.
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Signal Processing (eess.SP)
Cite as: arXiv:2101.03027 [cs.CL]
  (or arXiv:2101.03027v2 [cs.CL] for this version)

Submission history

From: Alexis Michaud [view email]
[v1] Tue, 15 Dec 2020 09:06:21 GMT (265kb,D)
[v2] Mon, 22 Feb 2021 07:23:37 GMT (1004kb,D)

Link back to: arXiv, form interface, contact.