SGQuant: Squeezing the Last Bit on Graph Neural Networks with Specialized Quantization

Feng, Boyuan; Wang, Yuke; Li, Xu; Yang, Shu; Peng, Xueqiao; Ding, Yufei

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2007

Computer Science > Machine Learning

Title: SGQuant: Squeezing the Last Bit on Graph Neural Networks with Specialized Quantization

Authors: Boyuan Feng, Yuke Wang, Xu Li, Shu Yang, Xueqiao Peng, Yufei Ding

(Submitted on 9 Jul 2020 (v1), last revised 16 Sep 2020 (this version, v2))

Abstract: With the increasing popularity of graph-based learning, Graph Neural Networks (GNNs) win lots of attention from the research and industry field because of their high accuracy. However, existing GNNs suffer from high memory footprints (e.g., node embedding features). This high memory footprint hurdles the potential applications towards memory-constrained devices, such as the widely-deployed IoT devices. To this end, we propose a specialized GNN quantization scheme, SGQuant, to systematically reduce the GNN memory consumption. Specifically, we first propose a GNN-tailored quantization algorithm design and a GNN quantization fine-tuning scheme to reduce memory consumption while maintaining accuracy. Then, we investigate the multi-granularity quantization strategy that operates at different levels (components, graph topology, and layers) of GNN computation. Moreover, we offer an automatic bit-selecting (ABS) to pinpoint the most appropriate quantization bits for the above multi-granularity quantizations. Intensive experiments show that SGQuant can effectively reduce the memory footprint from 4.25x to 31.9x compared with the original full-precision GNNs while limiting the accuracy drop to 0.4% on average.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2007.05100 [cs.LG]
	(or arXiv:2007.05100v2 [cs.LG] for this version)

Submission history

From: Yuke Wang [view email]
[v1] Thu, 9 Jul 2020 22:42:34 GMT (2451kb,D)
[v2] Wed, 16 Sep 2020 07:13:58 GMT (1882kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2007.05100

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: SGQuant: Squeezing the Last Bit on Graph Neural Networks with Specialized Quantization

Submission history