We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Distributed, Parallel, and Cluster Computing

Title: Early Application Experiences on a Modern GPU-Accelerated Arm-based HPC Platform

Abstract: This paper assesses and reports the experience of eleven application teams working to build, validate, and benchmark several HPC applications on a novel GPU-accerated Arm testbed. The testbed consists of the latest, at time of writing, Arm Devkits from NVIDIA with server-class Arm CPUs and NVIDIA A100 GPUs. The applications and mini-apps are written using multiple parallel programming models, including C, CUDA, Fortran, OpenACC, and OpenMP. Each application builds extensively on the other tools available in the programming environment, including scientific libraries, compilers, and other tooling. Our goal is to evaluate application readiness for the next generation of Arm and GPU-based HPC systems and determine the tooling readiness for future application developers. On both accounts, the reported case studies demonstrate that the diversity of software and tools available for GPU-accelerated Arm systems are prepared for production, even before NVIDIA deploys their next-generation such platform: Grace.
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR)
Cite as: arXiv:2209.09731 [cs.DC]
  (or arXiv:2209.09731v1 [cs.DC] for this version)

Submission history

From: Wael Elwasif [view email]
[v1] Tue, 20 Sep 2022 14:01:52 GMT (681kb)
[v2] Wed, 21 Sep 2022 19:58:26 GMT (679kb)
[v3] Wed, 28 Sep 2022 18:52:18 GMT (681kb)
[v4] Mon, 19 Dec 2022 20:32:21 GMT (679kb)

Link back to: arXiv, form interface, contact.