Compressing Recurrent Neural Networks Using Hierarchical Tucker Tensor Decomposition

Yin, Miao; Liao, Siyu; Liu, Xiao-Yang; Wang, Xiaodong; Yuan, Bo

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2005

Computer Science > Machine Learning

Title: Compressing Recurrent Neural Networks Using Hierarchical Tucker Tensor Decomposition

Authors: Miao Yin, Siyu Liao, Xiao-Yang Liu, Xiaodong Wang, Bo Yuan

(Submitted on 9 May 2020)

Abstract: Recurrent Neural Networks (RNNs) have been widely used in sequence analysis and modeling. However, when processing high-dimensional data, RNNs typically require very large model sizes, thereby bringing a series of deployment challenges. Although the state-of-the-art tensor decomposition approaches can provide good model compression performance, these existing methods are still suffering some inherent limitations, such as restricted representation capability and insufficient model complexity reduction. To overcome these limitations, in this paper we propose to develop compact RNN models using Hierarchical Tucker (HT) decomposition. HT decomposition brings strong hierarchical structure to the decomposed RNN models, which is very useful and important for enhancing the representation capability. Meanwhile, HT decomposition provides higher storage and computational cost reduction than the existing tensor decomposition approaches for RNN compression. Our experimental results show that, compared with the state-of-the-art compressed RNN models, such as TT-LSTM, TR-LSTM and BT-LSTM, our proposed HT-based LSTM (HT-LSTM), consistently achieves simultaneous and significant increases in both compression ratio and test accuracy on different datasets.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2005.04366 [cs.LG]
	(or arXiv:2005.04366v1 [cs.LG] for this version)

Submission history

From: Miao Yin [view email]
[v1] Sat, 9 May 2020 05:15:20 GMT (728kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2005.04366

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Compressing Recurrent Neural Networks Using Hierarchical Tucker Tensor Decomposition

Submission history