Porting of the DBCSR library for Sparse Matrix-Matrix Multiplications to Intel Xeon Phi systems

Bethune, Iain; Gloess, Andeas; Hutter, Juerg; Lazzaro, Alfio; Pabst, Hans; Reid, Fiona

doi:10.3233/978-1-61499-843-3-47

Full-text links:

Download:

Current browse context:

cs.DC

< prev | next >

new | recent | 1708

Change to browse by:

Computer Science > Distributed, Parallel, and Cluster Computing

Title: Porting of the DBCSR library for Sparse Matrix-Matrix Multiplications to Intel Xeon Phi systems

Authors: Iain Bethune, Andeas Gloess, Juerg Hutter, Alfio Lazzaro, Hans Pabst, Fiona Reid

(Submitted on 11 Aug 2017 (v1), last revised 21 Aug 2017 (this version, v2))

Abstract: Multiplication of two sparse matrices is a key operation in the simulation of the electronic structure of systems containing thousands of atoms and electrons. The highly optimized sparse linear algebra library DBCSR (Distributed Block Compressed Sparse Row) has been specifically designed to efficiently perform such sparse matrix-matrix multiplications. This library is the basic building block for linear scaling electronic structure theory and low scaling correlated methods in CP2K. It is parallelized using MPI and OpenMP, and can exploit GPU accelerators by means of CUDA. We describe a performance comparison of DBCSR on systems with Intel Xeon Phi Knights Landing (KNL) processors, with respect to systems with Intel Xeon CPUs (including systems with GPUs). We find that the DBCSR on Cray XC40 KNL-based systems is 11%-14% slower than on a hybrid Cray XC50 with Nvidia P100 cards, at the same number of nodes. When compared to a Cray XC40 system equipped with dual-socket Intel Xeon CPUs, the KNL is up to 24% faster.

Comments:	Submitted to the ParCo2017 conference, Bologna, Italy 12-15 September 2017
Subjects:	Distributed, Parallel, and Cluster Computing (cs.DC)
Journal reference:	Advances in Parallel Computing, Volume 32: Parallel Computing is Everywhere, pp 47 - 56, 2018, IOS Press
DOI:	10.3233/978-1-61499-843-3-47
Cite as:	arXiv:1708.03604 [cs.DC]
	(or arXiv:1708.03604v2 [cs.DC] for this version)

Submission history

From: Alfio Lazzaro [view email]
[v1] Fri, 11 Aug 2017 16:35:59 GMT (871kb,D)
[v2] Mon, 21 Aug 2017 11:55:20 GMT (871kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1708.03604

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Distributed, Parallel, and Cluster Computing

Title: Porting of the DBCSR library for Sparse Matrix-Matrix Multiplications to Intel Xeon Phi systems

Submission history