We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

eess.AS

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Speech Prediction using an Adaptive Recurrent Neural Network with Application to Packet Loss Concealment

Abstract: This paper proposes a novel approach for speech signal prediction based on a recurrent neural network (RNN). Unlike existing RNN-based predictors, which operate on parametric features and are trained offline on a large collection of such features, the proposed predictor operates directly on speech samples and is trained online on the recent past of the speech signal. Optionally, the network can be pre-trained offline to speed-up convergence at start-up. The proposed predictor is a single end-to-end network that captures all sorts of dependencies between samples, and therefore has the potential to outperform classical linear/non-linear and short-term/long-term speech predictor structures. We apply it to the packet loss concealment (PLC) problem and show that it outperforms the standard ITU G.711 Appendix I PLC technique.
Subjects: Audio and Speech Processing (eess.AS)
DOI: 10.1109/ICASSP.2018.8462185
Cite as: arXiv:2111.08116 [eess.AS]
  (or arXiv:2111.08116v1 [eess.AS] for this version)

Submission history

From: Reza Lotfidereshgi [view email]
[v1] Mon, 15 Nov 2021 22:33:48 GMT (255kb,D)

Link back to: arXiv, form interface, contact.