We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:


References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Distributed, Parallel, and Cluster Computing

Title: Bit-GraphBLAS: Bit-Level Optimizations of Matrix-Centric Graph Processing on GPU

Abstract: In a general graph data structure like an adjacency matrix, when edges are homogeneous, the connectivity of two nodes can be sufficiently represented using a single bit. This insight has, however, not yet been adequately exploited by the existing matrix-centric graph processing frameworks. This work fills the void by systematically exploring the bit-level representation of graphs and the corresponding optimizations to the graph operations. It proposes a two-level representation named Bit-Block Compressed Sparse Row (B2SR) and presents a series of optimizations to the graph operations on B2SR by leveraging the intrinsics of modern GPUs. Evaluations on NVIDIA Pascal and Volta GPUs show that the optimizations bring up to $40\times$ and $6555\times$ for essential GraphBLAS kernels SpMV and SpGEMM, respectively, making GraphBLAS-based BFS accelerate up to $433\times$, SSSP, PR, and CC up to $35\times$, and TC up to $52\times$.
Comments: To appear in 36th IEEE International Parallel & Distributed Processing Symposium (IPDPS 2022)
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC)
Cite as: arXiv:2201.08560 [cs.DC]
  (or arXiv:2201.08560v2 [cs.DC] for this version)

Submission history

From: Jou-An Chen [view email]
[v1] Fri, 21 Jan 2022 06:35:12 GMT (3479kb,D)
[v2] Tue, 22 Feb 2022 03:32:18 GMT (412kb,D)

Link back to: arXiv, form interface, contact.