The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021

Liu, Dan; Du, Mengge; Li, Xiaoxi; Hu, Yuchen; Dai, Lirong

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2107

Computer Science > Computation and Language

Title: The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021

Authors: Dan Liu, Mengge Du, Xiaoxi Li, Yuchen Hu, Lirong Dai

(Submitted on 1 Jul 2021 (v1), last revised 9 Jul 2021 (this version, v2))

Abstract: This paper describes USTC-NELSLIP's submissions to the IWSLT2021 Simultaneous Speech Translation task. We proposed a novel simultaneous translation model, Cross Attention Augmented Transducer (CAAT), which extends conventional RNN-T to sequence-to-sequence tasks without monotonic constraints, e.g., simultaneous translation. Experiments on speech-to-text (S2T) and text-to-text (T2T) simultaneous translation tasks shows CAAT achieves better quality-latency trade-offs compared to \textit{wait-k}, one of the previous state-of-the-art approaches. Based on CAAT architecture and data augmentation, we build S2T and T2T simultaneous translation systems in this evaluation campaign. Compared to last year's optimal systems, our S2T simultaneous translation system improves by an average of 11.3 BLEU for all latency regimes, and our T2T simultaneous translation system improves by an average of 4.6 BLEU.

Subjects:	Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2107.00279 [cs.CL]
	(or arXiv:2107.00279v2 [cs.CL] for this version)

Submission history

From: Dan Liu [view email]
[v1] Thu, 1 Jul 2021 08:09:00 GMT (7170kb,D)
[v2] Fri, 9 Jul 2021 09:34:51 GMT (7170kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2107.00279

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021

Submission history