VANI: Very-lightweight Accent-controllable TTS for Native and Non-native speakers with Identity Preservation

Badlani, Rohan; Arora, Akshit; Ghosh, Subhankar; Valle, Rafael; Shih, Kevin J.; Santos, João Felipe; Ginsburg, Boris; Catanzaro, Bryan

Full-text links:

Download:

Computer Science > Sound

Title: VANI: Very-lightweight Accent-controllable TTS for Native and Non-native speakers with Identity Preservation

Authors: Rohan Badlani, Akshit Arora, Subhankar Ghosh, Rafael Valle, Kevin J. Shih, João Felipe Santos, Boris Ginsburg, Bryan Catanzaro

(Submitted on 14 Mar 2023)

Abstract: We introduce VANI, a very lightweight multi-lingual accent controllable speech synthesis system. Our model builds upon disentanglement strategies proposed in RADMMM and supports explicit control of accent, language, speaker and fine-grained $F_0$ and energy features for speech synthesis. We utilize the Indic languages dataset, released for LIMMITS 2023 as part of ICASSP Signal Processing Grand Challenge, to synthesize speech in 3 different languages. Our model supports transferring the language of a speaker while retaining their voice and the native accent of the target language. We utilize the large-parameter RADMMM model for Track $1$ and lightweight VANI model for Track $2$ and $3$ of the competition.

Comments:	Presentation accepted at ICASSP 2023
Subjects:	Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2303.07578 [cs.SD]
	(or arXiv:2303.07578v1 [cs.SD] for this version)

Submission history

From: Rohan Badlani [view email]
[v1] Tue, 14 Mar 2023 01:55:41 GMT (3854kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2303.07578

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Sound

Title: VANI: Very-lightweight Accent-controllable TTS for Native and Non-native speakers with Identity Preservation

Submission history