We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: On the Acceleration of Deep Neural Network Inference using Quantized Compressed Sensing

Abstract: Accelerating deep neural network (DNN) inference on resource-limited devices is one of the most important barriers to ensuring a wider and more inclusive adoption. To alleviate this, DNN binary quantization for faster convolution and memory savings is one of the most promising strategies despite its serious drop in accuracy. The present paper therefore proposes a novel binary quantization function based on quantized compressed sensing (QCS). Theoretical arguments conjecture that our proposal preserves the practical benefits of standard methods, while reducing the quantization error and the resulting drop in accuracy.
Comments: 3 pages, no figures, paper accepted at Black In AI at the 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Signal Processing (eess.SP); Numerical Analysis (math.NA); Machine Learning (stat.ML)
Cite as: arXiv:2108.10101 [cs.LG]
  (or arXiv:2108.10101v1 [cs.LG] for this version)

Submission history

From: Meshia Cédric Oveneke [view email]
[v1] Mon, 23 Aug 2021 12:03:24 GMT (9kb)

Link back to: arXiv, form interface, contact.