Dissecting the Graphcore IPU Architecture via Microbenchmarking

Jia, Zhe; Tillman, Blake; Maggioni, Marco; Scarpazza, Daniele Paolo

Full-text links:

Download:

Current browse context:

cs.DC

< prev | next >

new | recent | 1912

Computer Science > Distributed, Parallel, and Cluster Computing

Title: Dissecting the Graphcore IPU Architecture via Microbenchmarking

Authors: Zhe Jia, Blake Tillman, Marco Maggioni, Daniele Paolo Scarpazza

(Submitted on 7 Dec 2019)

Abstract: This report focuses on the architecture and performance of the Intelligence Processing Unit (IPU), a novel, massively parallel platform recently introduced by Graphcore and aimed at Artificial Intelligence/Machine Learning (AI/ML) workloads. We dissect the IPU's performance behavior using microbenchmarks that we crafted for the purpose. We study the IPU's memory organization and performance. We study the latency and bandwidth that the on-chip and off-chip interconnects offer, both in point-to-point transfers and in a spectrum of collective operations, under diverse loads. We evaluate the IPU's compute power over matrix multiplication, convolution, and AI/ML primitives. We discuss actual performance in comparison with its theoretical limits. Our findings reveal how the IPU's architectural design affects its performance. Moreover, they offer simple mental models to predict an application's performance on the IPU, on the basis of the computation and communication steps it involves. This report is the natural extension to a novel architecture of a continuing effort of ours that focuses on the microbenchmark-based discovery of massively parallel architectures.

Comments:	91 pages, 21 figures
Subjects:	Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR); Performance (cs.PF)
Cite as:	arXiv:1912.03413 [cs.DC]
	(or arXiv:1912.03413v1 [cs.DC] for this version)

Submission history

From: Daniele Scarpazza [view email]
[v1] Sat, 7 Dec 2019 02:10:19 GMT (4633kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1912.03413

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Distributed, Parallel, and Cluster Computing

Title: Dissecting the Graphcore IPU Architecture via Microbenchmarking

Submission history