We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.DC

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Distributed, Parallel, and Cluster Computing

Title: The BioExcel methodology for developing dynamic, scalable, reliable and portable computational biomolecular workflows

Abstract: Developing complex biomolecular workflows is not always straightforward. It requires tedious developments to enable the interoperability between the different biomolecular simulation and analysis tools. Moreover, the need to execute the pipelines on distributed systems increases the complexity of these developments. To address these issues, we propose a methodology to simplify the implementation of these workflows on HPC infrastructures. It combines a library, the BioExcel Building Blocks (BioBBs), that allows scientists to implement biomolecular pipelines as Python scripts, and the PyCOMPSs programming framework which allows to easily convert Python scripts into task-based parallel workflows executed in distributed computing systems such as HPC clusters, clouds, containerized platforms, etc. Using this methodology, we have implemented a set of computational molecular workflows and we have performed several experiments to validate its portability, scalability, reliability and malleability.
Comments: Accepted in IEEE eScience conference 2022
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC)
ACM classes: D.1; D.2; J.3
Journal reference: 2022 IEEE 18th International Conference on e-Science (e-Science)
DOI: 10.1109/eScience55777.2022.00049
Cite as: arXiv:2208.14130 [cs.DC]
  (or arXiv:2208.14130v1 [cs.DC] for this version)

Submission history

From: Jorge Ejarque [view email]
[v1] Tue, 30 Aug 2022 10:27:19 GMT (7771kb,D)

Link back to: arXiv, form interface, contact.