Current browse context:
eess.AS
Change to browse by:
References & Citations
Electrical Engineering and Systems Science > Audio and Speech Processing
Title: Fast Contextual Adaptation with Neural Associative Memory for On-Device Personalized Speech Recognition
(Submitted on 5 Oct 2021 (v1), last revised 7 Oct 2021 (this version, v2))
Abstract: Fast contextual adaptation has shown to be effective in improving Automatic Speech Recognition (ASR) of rare words and when combined with an on-device personalized training, it can yield an even better recognition result. However, the traditional re-scoring approaches based on an external language model is prone to diverge during the personalized training. In this work, we introduce a model-based end-to-end contextual adaptation approach that is decoder-agnostic and amenable to on-device personalization. Our on-device simulation experiments demonstrate that the proposed approach outperforms the traditional re-scoring technique by 12% relative WER and 15.7% entity mention specific F1-score in a continues personalization scenario.
Submission history
From: Tsendsuren Munkhdalai [view email][v1] Tue, 5 Oct 2021 00:33:09 GMT (237kb,D)
[v2] Thu, 7 Oct 2021 00:12:51 GMT (236kb,D)
Link back to: arXiv, form interface, contact.