IMaSC -- ICFOSS Malayalam Speech Corpus

Gopinath, Deepa P; K, Thennal D; Nair, Vrinda V; S, Swaraj K; G, Sachin

Full-text links:

Download:

Current browse context:

cs.SD

< prev | next >

new | recent | 2211

Computer Science > Sound

Title: IMaSC -- ICFOSS Malayalam Speech Corpus

Authors: Deepa P Gopinath, Thennal D K, Vrinda V Nair, Swaraj K S, Sachin G

(Submitted on 23 Nov 2022)

Abstract: Modern text-to-speech (TTS) systems use deep learning to synthesize speech increasingly approaching human quality, but they require a database of high quality audio-text sentence pairs for training. Malayalam, the official language of the Indian state of Kerala and spoken by 35+ million people, is a low resource language in terms of available corpora for TTS systems. In this paper, we present IMaSC, a Malayalam text and speech corpora containing approximately 50 hours of recorded speech. With 8 speakers and a total of 34,473 text-audio pairs, IMaSC is larger than every other publicly available alternative. We evaluated the database by using it to train TTS models for each speaker based on a modern deep learning architecture. Via subjective evaluation, we show that our models perform significantly better in terms of naturalness compared to previous studies and publicly available models, with an average mean opinion score of 4.50, indicating that the synthesized speech is close to human quality.

Comments:	18 pages, 8 figures
Subjects:	Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2211.12796 [cs.SD]
	(or arXiv:2211.12796v1 [cs.SD] for this version)

Submission history

From: Deepa Gopinath PhD [view email]
[v1] Wed, 23 Nov 2022 09:21:01 GMT (968kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2211.12796

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Sound

Title: IMaSC -- ICFOSS Malayalam Speech Corpus

Submission history