Unified Mandarin TTS Front-end Based on Distilled BERT Model

Zhang, Yang; Deng, Liqun; Wang, Yasheng

Full-text links:

Download:

Current browse context:

cs.SD

< prev | next >

new | recent | 2012

Computer Science > Sound

Title: Unified Mandarin TTS Front-end Based on Distilled BERT Model

Authors: Yang Zhang, Liqun Deng, Yasheng Wang

(Submitted on 31 Dec 2020)

Abstract: The front-end module in a typical Mandarin text-to-speech system (TTS) is composed of a long pipeline of text processing components, which requires extensive efforts to build and is prone to large accumulative model size and cascade errors. In this paper, a pre-trained language model (PLM) based model is proposed to simultaneously tackle the two most important tasks in TTS front-end, i.e., prosodic structure prediction (PSP) and grapheme-to-phoneme (G2P) conversion. We use a pre-trained Chinese BERT[1] as the text encoder and employ multi-task learning technique to adapt it to the two TTS front-end tasks. Then, the BERT encoder is distilled into a smaller model by employing a knowledge distillation technique called TinyBERT[2], making the whole model size 25% of that of benchmark pipeline models while maintaining competitive performance on both tasks. With the proposed the methods, we are able to run the whole TTS front-end module in a light and unified manner, which is more friendly to deployment on mobile devices.

Comments:	5 pages
Subjects:	Sound (cs.SD); Computation and Language (cs.CL)
Cite as:	arXiv:2012.15404 [cs.SD]
	(or arXiv:2012.15404v1 [cs.SD] for this version)

Submission history

From: Yang Zhang [view email]
[v1] Thu, 31 Dec 2020 02:34:57 GMT (817kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2012.15404

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Sound

Title: Unified Mandarin TTS Front-end Based on Distilled BERT Model

Submission history