Learning Low-Rank Approximation for CNNs

Lee, Dongsoo; Kwon, Se Jung; Kim, Byeongwook; Wei, Gu-Yeon

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1905

Computer Science > Machine Learning

Title: Learning Low-Rank Approximation for CNNs

Authors: Dongsoo Lee, Se Jung Kwon, Byeongwook Kim, Gu-Yeon Wei

(Submitted on 24 May 2019)

Abstract: Low-rank approximation is an effective model compression technique to not only reduce parameter storage requirements, but to also reduce computations. For convolutional neural networks (CNNs), however, well-known low-rank approximation methods, such as Tucker or CP decomposition, result in degraded model accuracy because decomposed layers hinder training convergence. In this paper, we propose a new training technique that finds a flat minimum in the view of low-rank approximation without a decomposed structure during training. By preserving the original model structure, 2-dimensional low-rank approximation demanding lowering (such as im2col) is available in our proposed scheme. We show that CNN models can be compressed by low-rank approximation with much higher compression ratio than conventional training methods while maintaining or even enhancing model accuracy. We also discuss various 2-dimensional low-rank approximation techniques for CNNs.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1905.10145 [cs.LG]
	(or arXiv:1905.10145v1 [cs.LG] for this version)

Submission history

From: Se Jung Kwon [view email]
[v1] Fri, 24 May 2019 10:56:02 GMT (544kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1905.10145

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Learning Low-Rank Approximation for CNNs

Submission history