We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.DC

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Distributed, Parallel, and Cluster Computing

Title: RFaaS: RDMA-Enabled FaaS Platform for Serverless High-Performance Computing

Abstract: The rigid MPI programming model and batch scheduling dominate high-performance computing. While clouds brought new levels of elasticity into the world of computing, supercomputers still suffer from low resource utilization rates. To enhance supercomputing clusters with the benefits of serverless computing, a modern cloud programming paradigm for pay-as-you-go execution of stateless functions, we present rFaaS, the first RDMA-aware Function-as-a-Service (FaaS) platform. With hot invocations and decentralized function placement, we overcome the major performance limitations of FaaS systems and provide low-latency remote invocations in multi-tenant environments. We evaluate the new serverless system through a series of microbenchmarks and show that remote functions execute with negligible performance overheads. We demonstrate how serverless computing can bring elastic resource management into MPI-based high-performance applications. Overall, our results show that MPI applications can benefit from modern cloud programming paradigms to guarantee high performance at lower resource costs.
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC)
Cite as: arXiv:2106.13859 [cs.DC]
  (or arXiv:2106.13859v1 [cs.DC] for this version)

Submission history

From: Marcin Copik [view email]
[v1] Fri, 25 Jun 2021 19:36:02 GMT (847kb,D)
[v2] Mon, 28 Mar 2022 16:24:41 GMT (1409kb,D)
[v3] Mon, 15 May 2023 04:12:15 GMT (2528kb,D)

Link back to: arXiv, form interface, contact.