We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.DC

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Distributed, Parallel, and Cluster Computing

Title: Efficient AlphaFold2 Training using Parallel Evoformer and Branch Parallelism

Abstract: The accuracy of AlphaFold2, a frontier end-to-end structure prediction system, is already close to that of the experimental determination techniques. Due to the complex model architecture and large memory consumption, it requires lots of computational resources and time to train AlphaFold2 from scratch. Efficient AlphaFold2 training could accelerate the development of life science. In this paper, we propose a Parallel Evoformer and Branch Parallelism to speed up the training of AlphaFold2. We conduct sufficient experiments on UniFold implemented in PyTorch and HelixFold implemented in PaddlePaddle, and Branch Parallelism can improve the training performance by 38.67% and 36.93%, respectively. We also demonstrate that the accuracy of Parallel Evoformer could be on par with AlphaFold2 on the CASP14 and CAMEO datasets. The source code is available on this https URL
Comments: arXiv admin note: substantial text overlap with arXiv:2207.05477
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC)
Cite as: arXiv:2211.00235 [cs.DC]
  (or arXiv:2211.00235v1 [cs.DC] for this version)

Submission history

From: Guoxia Wang [view email]
[v1] Tue, 1 Nov 2022 02:59:35 GMT (1064kb,D)

Link back to: arXiv, form interface, contact.