We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Sound

Title: Using instantaneous frequency and aperiodicity detection to estimate F0 for high-quality speech synthesis

Abstract: This paper introduces a general and flexible framework for F0 and aperiodicity (additive non periodic component) analysis, specifically intended for high-quality speech synthesis and modification applications. The proposed framework consists of three subsystems: instantaneous frequency estimator and initial aperiodicity detector, F0 trajectory tracker, and F0 refinement and aperiodicity extractor. A preliminary implementation of the proposed framework substantially outperformed (by a factor of 10 in terms of RMS F0 estimation error) existing F0 extractors in tracking ability of temporally varying F0 trajectories. The front end aperiodicity detector consists of a complex-valued wavelet analysis filter with a highly selective temporal and spectral envelope. This front end aperiodicity detector uses a new measure that quantifies the deviation from periodicity. The measure is less sensitive to slow FM and AM and closely correlates with the signal to noise ratio.
Comments: Accepted for presentation in ISCA workshop SSW9
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
Journal reference: 9th ISCA Speech Synthesis Workshop, 2016, pp.221-228
DOI: 10.21437/SSW.2016-36
Cite as: arXiv:1605.07809 [cs.SD]
  (or arXiv:1605.07809v2 [cs.SD] for this version)

Submission history

From: Hideki Kawahara [view email]
[v1] Wed, 25 May 2016 10:20:07 GMT (476kb)
[v2] Fri, 22 Jul 2016 20:56:20 GMT (1098kb)

Link back to: arXiv, form interface, contact.