We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.NI

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Networking and Internet Architecture

Title: Optimal Server Selection for Straggler Mitigation

Abstract: The performance of large-scale distributed compute systems is adversely impacted by stragglers when the execution time of a job is uncertain. To manage stragglers, we consider a multi-fork approach for job scheduling, where additional parallel servers are added at forking instants. In terms of the forking instants and the number of additional servers, we compute the job completion time and the cost of server utilization when the task processing times are assumed to have a shifted exponential distribution. We use this study to provide insights into the scheduling design of the forking instants and the associated number of additional servers to be started. Numerical results demonstrate orders of magnitude improvement in cost in the regime of low completion times as compared to the prior works.
Subjects: Networking and Internet Architecture (cs.NI); Distributed, Parallel, and Cluster Computing (cs.DC)
Journal reference: IEEE/ACM Transactions on Networking 2020
DOI: 10.1109/TNET.2020.2973224
Report number: Volume 28, Issue 2, pp. 709--721, April 2020
Cite as: arXiv:1911.05918 [cs.NI]
  (or arXiv:1911.05918v1 [cs.NI] for this version)

Submission history

From: Vaneet Aggarwal [view email]
[v1] Thu, 14 Nov 2019 03:18:20 GMT (174kb)

Link back to: arXiv, form interface, contact.