We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

eess

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: Improving Gated Recurrent Unit Based Acoustic Modeling with Batch Normalization and Enlarged Context

Abstract: The use of future contextual information is typically shown to be helpful for acoustic modeling. Recently, we proposed a RNN model called minimal gated recurrent unit with input projection (mGRUIP), in which a context module namely temporal convolution, is specifically designed to model the future context. This model, mGRUIP with context module (mGRUIP-Ctx), has been shown to be able of utilizing the future context effectively, meanwhile with quite low model latency and computation cost. In this paper, we continue to improve mGRUIP-Ctx with two revisions: applying BN methods and enlarging model context. Experimental results on two Mandarin ASR tasks (8400 hours and 60K hours) show that, the revised mGRUIP-Ctx outperform LSTM with a large margin (11% to 38%). It even performs slightly better than a superior BLSTM on the 8400h task, with 33M less parameters and just 290ms model latency.
Comments: ISCSLP 2018
Subjects: Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
Cite as: arXiv:1811.10169 [cs.CL]
  (or arXiv:1811.10169v1 [cs.CL] for this version)

Submission history

From: Jie Li [view email]
[v1] Mon, 26 Nov 2018 04:00:03 GMT (70kb)

Link back to: arXiv, form interface, contact.