Current browse context:
cs.SD
Change to browse by:
References & Citations
Computer Science > Sound
Title: Repeat after me: Self-supervised learning of acoustic-to-articulatory mapping by vocal imitation
(Submitted on 5 Apr 2022)
Abstract: We propose a computational model of speech production combining a pre-trained neural articulatory synthesizer able to reproduce complex speech stimuli from a limited set of interpretable articulatory parameters, a DNN-based internal forward model predicting the sensory consequences of articulatory commands, and an internal inverse model based on a recurrent neural network recovering articulatory commands from the acoustic speech input. Both forward and inverse models are jointly trained in a self-supervised way from raw acoustic-only speech data from different speakers. The imitation simulations are evaluated objectively and subjectively and display quite encouraging performances.
Submission history
From: Marc-Antoine Georges [view email][v1] Tue, 5 Apr 2022 15:02:49 GMT (3350kb,D)
Link back to: arXiv, form interface, contact.