We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.SD

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Sound

Title: Adaptive Frequency Cepstral Coefficients for Word Mispronunciation Detection

Abstract: Systems based on automatic speech recognition (ASR) technology can provide important functionality in computer assisted language learning applications. This is a young but growing area of research motivated by the large number of students studying foreign languages. Here we propose a Hidden Markov Model (HMM)-based method to detect mispronunciations. Exploiting the specific dialog scripting employed in language learning software, HMMs are trained for different pronunciations. New adaptive features have been developed and obtained through an adaptive warping of the frequency scale prior to computing the cepstral coefficients. The optimization criterion used for the warping function is to maximize separation of two major groups of pronunciations (native and non-native) in terms of classification rate. Experimental results show that the adaptive frequency scale yields a better coefficient representation leading to higher classification rates in comparison with conventional HMMs using Mel-frequency cepstral coefficients.
Comments: 4th International Congress on Image and Signal Processing (CISP) 2011
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV)
DOI: 10.1109/CISP.2011.6100685
Cite as: arXiv:1602.08132 [cs.SD]
  (or arXiv:1602.08132v1 [cs.SD] for this version)

Submission history

From: Zhenhao Ge [view email]
[v1] Thu, 25 Feb 2016 22:17:31 GMT (285kb)

Link back to: arXiv, form interface, contact.