Automatic measurement of vowel duration via structured prediction

Adi, Yossi; Keshet, Joseph; Cibelli, Emily; Gustafson, Erin; Clopper, Cynthia; Goldrick, Matthew

doi:10.1121/1.4972527

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 1610

Statistics > Machine Learning

Title: Automatic measurement of vowel duration via structured prediction

Authors: Yossi Adi, Joseph Keshet, Emily Cibelli, Erin Gustafson, Cynthia Clopper, Matthew Goldrick

(Submitted on 26 Oct 2016)

Abstract: A key barrier to making phonetic studies scalable and replicable is the need to rely on subjective, manual annotation. To help meet this challenge, a machine learning algorithm was developed for automatic measurement of a widely used phonetic measure: vowel duration. Manually-annotated data were used to train a model that takes as input an arbitrary length segment of the acoustic signal containing a single vowel that is preceded and followed by consonants and outputs the duration of the vowel. The model is based on the structured prediction framework. The input signal and a hypothesized set of a vowel's onset and offset are mapped to an abstract vector space by a set of acoustic feature functions. The learning algorithm is trained in this space to minimize the difference in expectations between predicted and manually-measured vowel durations. The trained model can then automatically estimate vowel durations without phonetic or orthographic transcription. Results comparing the model to three sets of manually annotated data suggest it out-performed the current gold standard for duration measurement, an HMM-based forced aligner (which requires orthographic or phonetic transcription as an input).

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Sound (cs.SD)
DOI:	10.1121/1.4972527
Cite as:	arXiv:1610.08166 [stat.ML]
	(or arXiv:1610.08166v1 [stat.ML] for this version)

Submission history

From: Yossi Adi [view email]
[v1] Wed, 26 Oct 2016 04:50:35 GMT (378kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:1610.08166

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: Automatic measurement of vowel duration via structured prediction

Submission history