Learning Compact Recurrent Neural Networks

Lu, Zhiyun; Sindhwani, Vikas; Sainath, Tara N.

Full-text links:

Download:

Current browse context:

cs.NE

< prev | next >

new | recent | 1604

Computer Science > Machine Learning

Title: Learning Compact Recurrent Neural Networks

Authors: Zhiyun Lu, Vikas Sindhwani, Tara N. Sainath

(Submitted on 9 Apr 2016)

Abstract: Recurrent neural networks (RNNs), including long short-term memory (LSTM) RNNs, have produced state-of-the-art results on a variety of speech recognition tasks. However, these models are often too large in size for deployment on mobile devices with memory and latency constraints. In this work, we study mechanisms for learning compact RNNs and LSTMs via low-rank factorizations and parameter sharing schemes. Our goal is to investigate redundancies in recurrent architectures where compression can be admitted without losing performance. A hybrid strategy of using structured matrices in the bottom layers and shared low-rank factors on the top layers is found to be particularly effective, reducing the parameters of a standard LSTM by 75%, at a small cost of 0.3% increase in WER, on a 2,000-hr English Voice Search task.

Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:1604.02594 [cs.LG]
	(or arXiv:1604.02594v1 [cs.LG] for this version)

Submission history

From: Zhiyun Lu [view email]
[v1] Sat, 9 Apr 2016 19:09:22 GMT (106kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1604.02594

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Learning Compact Recurrent Neural Networks

Submission history