Efficient Neural Network Deployment for Microcontroller

Unlu, Hasan

Full-text links:

Download:

Current browse context:

cs.AR

< prev | next >

new | recent | 2007

Computer Science > Hardware Architecture

Title: Efficient Neural Network Deployment for Microcontroller

Authors: Hasan Unlu

(Submitted on 2 Jul 2020)

Abstract: Edge computing for neural networks is getting important especially for low power applications and offline devices. TensorFlow Lite and PyTorch Mobile were released for this purpose. But they mainly support mobile devices instead of microcontroller level yet. Microcontroller support is an emerging area now. There are many approaches to reduce network size and compute load like pruning, binarization and layer manipulation i.e. operator reordering. This paper is going to explore and generalize convolution neural network deployment for microcontrollers with two novel optimization proposals offering memory saving and compute efficiency in 2D convolutions as well as fully connected layers. The first one is in-place max-pooling, if the stride is greater than or equal to pooling kernel size. The second optimization is to use ping-pong buffers between layers to reduce memory consumption significantly. The memory savings and performance will be compared with CMSIS-NN framework developed for ARM Cortex-M CPUs. The final purpose is to develop a tool consuming PyTorch model with trained network weights, and it turns into an optimized inference engine(forward pass) in C/C++ for low memory(kilobyte level) and limited computing capable microcontrollers.

Subjects:	Hardware Architecture (cs.AR); Machine Learning (cs.LG)
Cite as:	arXiv:2007.01348 [cs.AR]
	(or arXiv:2007.01348v1 [cs.AR] for this version)

Submission history

From: Hasan Unlu [view email]
[v1] Thu, 2 Jul 2020 19:21:05 GMT (1108kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2007.01348

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Hardware Architecture

Title: Efficient Neural Network Deployment for Microcontroller

Submission history