We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.PF

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Performance

Title: NumaPerf: Predictive and Full NUMA Profiling

Authors: Xin Zhao (University of Massachusetts Amherst), Jin Zhou (University of Massachusetts Amherst), Hui Guan (University of Massachusetts Amherst), Wei Wang (University of Texas at San Antonio), Xu Liu (North Carolina State University), Tongping Liu (University of Massachusetts Amherst)
Abstract: Parallel applications are extremely challenging to achieve the optimal performance on the NUMA architecture, which necessitates the assistance of profiling tools. However, existing NUMA-profiling tools share some similar shortcomings, such as portability, effectiveness, and helpfulness issues. This paper proposes a novel profiling tool - NumaPerf - that overcomes these issues. NumaPerf aims to identify potential performance issues for any NUMA architecture, instead of only on the current hardware. To achieve this, NumaPerf focuses on memory sharing patterns between threads, instead of real remote accesses. NumaPerf further detects potential thread migrations and load imbalance issues that could significantly affect the performance but are omitted by existing profilers. NumaPerf also separates cache coherence issues that may require different fix strategies. Based on our extensive evaluation, NumaPerf is able to identify more performance issues than any existing tool, while fixing these bugs leads to up to 5.94x performance speedup.
Subjects: Performance (cs.PF)
Cite as: arXiv:2102.05204 [cs.PF]
  (or arXiv:2102.05204v1 [cs.PF] for this version)

Submission history

From: Tongping Liu [view email]
[v1] Wed, 10 Feb 2021 01:07:41 GMT (622kb,D)

Link back to: arXiv, form interface, contact.