The CUHK-TUDELFT System for The SLT 2021 Children Speech Recognition Challenge

Ng, Si-Ioi; Liu, Wei; Peng, Zhiyuan; Feng, Siyuan; Huang, Hing-Pang; Scharenborg, Odette; Lee, Tan

Full-text links:

Download:

Current browse context:

eess.AS

< prev | next >

new | recent | 2011

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: The CUHK-TUDELFT System for The SLT 2021 Children Speech Recognition Challenge

Authors: Si-Ioi Ng, Wei Liu, Zhiyuan Peng, Siyuan Feng, Hing-Pang Huang, Odette Scharenborg, Tan Lee

(Submitted on 12 Nov 2020)

Abstract: This technical report describes our submission to the 2021 SLT Children Speech Recognition Challenge (CSRC) Track 1. Our approach combines the use of a joint CTC-attention end-to-end (E2E) speech recognition framework, transfer learning, data augmentation and development of various language models. Procedures of data pre-processing, the background and the course of system development are described. The analysis of the experiment results, as well as the comparison between the E2E and DNN-HMM hybrid system are discussed in detail. Our system achieved a character error rate (CER) of 20.1% in our designated test set, and 23.6% in the official evaluation set, which is placed at 10-th overall.

Comments:	Submitted to 2021 SLT Children Speech Recognition Challenge (CSRC)
Subjects:	Audio and Speech Processing (eess.AS); Sound (cs.SD)
Cite as:	arXiv:2011.06239 [eess.AS]
	(or arXiv:2011.06239v1 [eess.AS] for this version)

Submission history

From: Si-Ioi Ng [view email]
[v1] Thu, 12 Nov 2020 07:31:19 GMT (86kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2011.06239

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: The CUHK-TUDELFT System for The SLT 2021 Children Speech Recognition Challenge

Submission history