Chain of Compression: A Systematic Approach to Combinationally Compress Convolutional Neural Networks

Shen, Yingtao; Sun, Minqing; Zhao, Jie; Zou, An

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2403

Computer Science > Machine Learning

Title: Chain of Compression: A Systematic Approach to Combinationally Compress Convolutional Neural Networks

Authors: Yingtao Shen, Minqing Sun, Jie Zhao, An Zou

(Submitted on 26 Mar 2024)

Abstract: Convolutional neural networks (CNNs) have achieved significant popularity, but their computational and memory intensity poses challenges for resource-constrained computing systems, particularly with the prerequisite of real-time performance. To release this burden, model compression has become an important research focus. Many approaches like quantization, pruning, early exit, and knowledge distillation have demonstrated the effect of reducing redundancy in neural networks. Upon closer examination, it becomes apparent that each approach capitalizes on its unique features to compress the neural network, and they can also exhibit complementary behavior when combined. To explore the interactions and reap the benefits from the complementary features, we propose the Chain of Compression, which works on the combinational sequence to apply these common techniques to compress the neural network. Validated on the image-based regression and classification networks across different data sets, our proposed Chain of Compression can significantly compress the computation cost by 100-1000 times with ignorable accuracy loss compared with the baseline model.

Comments:	10 pages, 15 figures
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:2403.17447 [cs.LG]
	(or arXiv:2403.17447v1 [cs.LG] for this version)

Submission history

From: Yingtao Shen [view email]
[v1] Tue, 26 Mar 2024 07:26:00 GMT (436kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2403.17447

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Chain of Compression: A Systematic Approach to Combinationally Compress Convolutional Neural Networks

Submission history