Improving the Performance of Online Neural Transducer Models

Sainath, Tara N.; Chiu, Chung-Cheng; Prabhavalkar, Rohit; Kannan, Anjuli; Wu, Yonghui; Nguyen, Patrick; Chen, Zhifeng

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 1712

Computer Science > Computation and Language

Title: Improving the Performance of Online Neural Transducer Models

Authors: Tara N. Sainath, Chung-Cheng Chiu, Rohit Prabhavalkar, Anjuli Kannan, Yonghui Wu, Patrick Nguyen, Zhifeng Chen

(Submitted on 5 Dec 2017)

Abstract: Having a sequence-to-sequence model which can operate in an online fashion is important for streaming applications such as Voice Search. Neural transducer is a streaming sequence-to-sequence model, but has shown a significant degradation in performance compared to non-streaming models such as Listen, Attend and Spell (LAS). In this paper, we present various improvements to NT. Specifically, we look at increasing the window over which NT computes attention, mainly by looking backwards in time so the model still remains online. In addition, we explore initializing a NT model from a LAS-trained model so that it is guided with a better alignment. Finally, we explore including stronger language models such as using wordpiece models, and applying an external LM during the beam search. On a Voice Search task, we find with these improvements we can get NT to match the performance of LAS.

Subjects:	Computation and Language (cs.CL); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
Cite as:	arXiv:1712.01807 [cs.CL]
	(or arXiv:1712.01807v1 [cs.CL] for this version)

Submission history

From: Chung-Cheng Chiu [view email]
[v1] Tue, 5 Dec 2017 18:34:56 GMT (133kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1712.01807

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Improving the Performance of Online Neural Transducer Models

Submission history