We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.DC

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Distributed, Parallel, and Cluster Computing

Title: GraphChallenge.org Triangle Counting Performance

Abstract: The rise of graph analytic systems has created a need for new ways to measure and compare the capabilities of graph processing systems. The MIT/Amazon/IEEE Graph Challenge has been developed to provide a well-defined community venue for stimulating research and highlighting innovations in graph analysis software, hardware, algorithms, and systems. GraphChallenge.org provides a wide range of pre-parsed graph data sets, graph generators, mathematically defined graph algorithms, example serial implementations in a variety of languages, and specific metrics for measuring performance. The triangle counting component of GraphChallenge.org tests the performance of graph processing systems to count all the triangles in a graph and exercises key graph operations found in many graph algorithms. In 2017, 2018, and 2019 many triangle counting submissions were received from a wide range of authors and organizations. This paper presents a performance analysis of the best performers of these submissions. These submissions show that their state-of-the-art triangle counting execution time, $T_{\rm tri}$, is a strong function of the number of edges in the graph, $N_e$, which improved significantly from 2017 ($T_{\rm tri} \approx (N_e/10^8)^{4/3}$) to 2018 ($T_{\rm tri} \approx N_e/10^9$) and remained comparable from 2018 to 2019. Graph Challenge provides a clear picture of current graph analysis systems and underscores the need for new innovations to achieve high performance on very large graphs.
Comments: 10 pages, 8 figures, 121 references, to be submitted to IEEE HPEC 2020. This work reports new updated results on prior work reported in arXiv:1805.09675 & arXiv:1708.06866
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
DOI: 10.1109/HPEC43674.2020.9286166
Cite as: arXiv:2003.09269 [cs.DC]
  (or arXiv:2003.09269v1 [cs.DC] for this version)

Submission history

From: Jeremy Kepner [view email]
[v1] Wed, 18 Mar 2020 20:36:29 GMT (415kb,D)

Link back to: arXiv, form interface, contact.