On the Acceleration of Deep Neural Network Inference using Quantized Compressed Sensing

Oveneke, Meshia Cédric

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2108

Computer Science > Machine Learning

Title: On the Acceleration of Deep Neural Network Inference using Quantized Compressed Sensing

Authors: Meshia Cédric Oveneke

(Submitted on 23 Aug 2021)

Abstract: Accelerating deep neural network (DNN) inference on resource-limited devices is one of the most important barriers to ensuring a wider and more inclusive adoption. To alleviate this, DNN binary quantization for faster convolution and memory savings is one of the most promising strategies despite its serious drop in accuracy. The present paper therefore proposes a novel binary quantization function based on quantized compressed sensing (QCS). Theoretical arguments conjecture that our proposal preserves the practical benefits of standard methods, while reducing the quantization error and the resulting drop in accuracy.

Comments:	3 pages, no figures, paper accepted at Black In AI at the 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada
Subjects:	Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Signal Processing (eess.SP); Numerical Analysis (math.NA); Machine Learning (stat.ML)
Cite as:	arXiv:2108.10101 [cs.LG]
	(or arXiv:2108.10101v1 [cs.LG] for this version)

Submission history

From: Meshia Cédric Oveneke [view email]
[v1] Mon, 23 Aug 2021 12:03:24 GMT (9kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2108.10101

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: On the Acceleration of Deep Neural Network Inference using Quantized Compressed Sensing

Submission history