References & Citations
Computer Science > Distributed, Parallel, and Cluster Computing
Title: Fault Tolerant QR Factorization for General Matrices
(Submitted on 9 Apr 2016 (v1), last revised 14 Apr 2016 (this version, v2))
Abstract: This paper presents a fault-tolerant algorithm for the QR factorization of general matrices. It relies on the communication-avoiding algorithm, and uses the structure of the reduction of each part of the computation to introduce redundancies that are sufficient to recover the state of a failed process. After a process has failed, its state can be recovered based on the data held by one process only. Besides, it does not add any significant operation in the critical path during failure-free execution.
Submission history
From: Camille Coti [view email][v1] Sat, 9 Apr 2016 00:25:10 GMT (394kb,D)
[v2] Thu, 14 Apr 2016 13:10:14 GMT (11kb,D)
Link back to: arXiv, form interface, contact.