The Microsoft 2017 Conversational Speech Recognition System

Xiong, W.; Wu, L.; Alleva, F.; Droppo, J.; Huang, X.; Stolcke, A.

doi:10.1109/ICASSP.2018.8461870

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 1708

Change to browse by:

Computer Science > Computation and Language

Title: The Microsoft 2017 Conversational Speech Recognition System

Authors: W. Xiong, L. Wu, F. Alleva, J. Droppo, X. Huang, A. Stolcke

(Submitted on 21 Aug 2017 (v1), last revised 24 Aug 2017 (this version, v2))

Abstract: We describe the 2017 version of Microsoft's conversational speech recognition system, in which we update our 2016 system with recent developments in neural-network-based acoustic and language modeling to further advance the state of the art on the Switchboard speech recognition task. The system adds a CNN-BLSTM acoustic model to the set of model architectures we combined previously, and includes character-based and dialog session aware LSTM language models in rescoring. For system combination we adopt a two-stage approach, whereby subsets of acoustic models are first combined at the senone/frame level, followed by a word-level voting via confusion networks. We also added a confusion network rescoring step after system combination. The resulting system yields a 5.1\% word error rate on the 2000 Switchboard evaluation set.

Subjects:	Computation and Language (cs.CL)
Journal reference:	Proc. IEEE ICASSP, April 2018, pp. 5934-5938
DOI:	10.1109/ICASSP.2018.8461870
Report number:	Microsoft Technical Report MSR-TR-2017-39
Cite as:	arXiv:1708.06073 [cs.CL]
	(or arXiv:1708.06073v2 [cs.CL] for this version)

Submission history

From: Andreas Stolcke [view email]
[v1] Mon, 21 Aug 2017 03:17:23 GMT (49kb,D)
[v2] Thu, 24 Aug 2017 23:30:37 GMT (50kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1708.06073

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: The Microsoft 2017 Conversational Speech Recognition System

Submission history