We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.DC

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Distributed, Parallel, and Cluster Computing

Title: Ray: A Distributed Framework for Emerging AI Applications

Abstract: The next generation of AI applications will continuously interact with the environment and learn from these interactions. These applications impose new and demanding systems requirements, both in terms of performance and flexibility. In this paper, we consider these requirements and present Ray---a distributed system to address them. Ray implements a unified interface that can express both task-parallel and actor-based computations, supported by a single dynamic execution engine. To meet the performance requirements, Ray employs a distributed scheduler and a distributed and fault-tolerant store to manage the system's control state. In our experiments, we demonstrate scaling beyond 1.8 million tasks per second and better performance than existing specialized systems for several challenging reinforcement learning applications.
Comments: 17 pages, 14 figures, 13th USENIX Symposium on Operating Systems Design and Implementation, 2018
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as: arXiv:1712.05889 [cs.DC]
  (or arXiv:1712.05889v2 [cs.DC] for this version)

Submission history

From: Robert Nishihara [view email]
[v1] Sat, 16 Dec 2017 01:29:49 GMT (681kb,D)
[v2] Sun, 30 Sep 2018 03:14:16 GMT (5765kb,D)

Link back to: arXiv, form interface, contact.