The Pitfall of Evaluating Performance on Emerging AI Accelerators

Jiang, Zihan; Li, Jiansong; Zhan, Jiangfeng

Full-text links:

Download:

Current browse context:

cs.PF

< prev | next >

new | recent | 1911

Computer Science > Performance

Title: The Pitfall of Evaluating Performance on Emerging AI Accelerators

Authors: Zihan Jiang, Jiansong Li, Jiangfeng Zhan

(Submitted on 8 Nov 2019)

Abstract: In recent years, domain-specific hardware has brought significant performance improvements in deep learning (DL). Both industry and academia only focus on throughput when evaluating these AI accelerators, which usually are custom ASICs deployed in datacenter to speed up the inference phase of DL workloads. Pursuing higher hardware throughput such as OPS (Operation Per Second) using various optimizations seems to be their main design target. However, they ignore the importance of accuracy in the DL nature. Motivated by this, this paper argue that a single throughput metric can not comprehensively reflect the real-world performance of AI accelerators. To reveal this pitfall, we evaluates several frequently-used optimizations on a typical AI accelerator and quantifies their impact on accuracy and throughout under representative DL inference workloads. Based on our experimental results, we find that some optimizations cause significant loss on accuracy in some workloads, although it can improves the throughout. Furthermore, our results show the importance of end-to-end evaluation in DL.

Subjects:	Performance (cs.PF); Machine Learning (cs.LG)
Cite as:	arXiv:1911.02987 [cs.PF]
	(or arXiv:1911.02987v1 [cs.PF] for this version)

Submission history

From: Zihan Jiang [view email]
[v1] Fri, 8 Nov 2019 02:13:21 GMT (639kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1911.02987v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Performance

Title: The Pitfall of Evaluating Performance on Emerging AI Accelerators

Submission history