PokeBNN: A Binary Pursuit of Lightweight Accuracy

Zhang, Yichi; Zhang, Zhiru; Lew, Lukasz

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2112

Computer Science > Machine Learning

Title: PokeBNN: A Binary Pursuit of Lightweight Accuracy

Authors: Yichi Zhang, Zhiru Zhang, Lukasz Lew

(Submitted on 30 Nov 2021 (v1), last revised 28 Apr 2022 (this version, v2))

Abstract: Optimization of Top-1 ImageNet promotes enormous networks that may be impractical in inference settings. Binary neural networks (BNNs) have the potential to significantly lower the compute intensity but existing models suffer from low quality. To overcome this deficiency, we propose PokeConv, a binary convolution block which improves quality of BNNs by techniques such as adding multiple residual paths, and tuning the activation function. We apply it to ResNet-50 and optimize ResNet's initial convolutional layer which is hard to binarize. We name the resulting network family PokeBNN. These techniques are chosen to yield favorable improvements in both top-1 accuracy and the network's cost. In order to enable joint optimization of the cost together with accuracy, we define arithmetic computation effort (ACE), a hardware- and energy-inspired cost metric for quantized and binarized networks. We also identify a need to optimize an under-explored hyper-parameter controlling the binarization gradient approximation.
We establish a new, strong state-of-the-art (SOTA) on top-1 accuracy together with commonly-used CPU64 cost, ACE cost and network size metrics. ReActNet-Adam, the previous SOTA in BNNs, achieved a 70.5% top-1 accuracy with 7.9 ACE. A small variant of PokeBNN achieves 70.5% top-1 with 2.6 ACE, more than 3x reduction in cost; a larger PokeBNN achieves 75.6% top-1 with 7.8 ACE, more than 5% improvement in accuracy without increasing the cost. PokeBNN implementation in JAX/Flax and reproduction instructions are available in AQT repository: this https URL

Comments:	Accepted to CVPR 2022
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2112.00133 [cs.LG]
	(or arXiv:2112.00133v2 [cs.LG] for this version)

Submission history

From: Lukasz Lew [view email]
[v1] Tue, 30 Nov 2021 22:05:59 GMT (135kb,D)
[v2] Thu, 28 Apr 2022 19:58:34 GMT (150kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2112.00133

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: PokeBNN: A Binary Pursuit of Lightweight Accuracy

Submission history