We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.DC

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Distributed, Parallel, and Cluster Computing

Title: AnySeq: A High Performance Sequence Alignment Library based on Partial Evaluation

Authors: André Müller (1), Bertil Schmidt (1), Andreas Hildebrandt (1), Richard Membarth (2 and 3), Roland Leißa (3), Matthis Kruse (3), Sebastian Hack (3) ((1) Johannes Gutenberg University, (2) DFKI, (3) Saarland University)
Abstract: Sequence alignments are fundamental to bioinformatics which has resulted in a variety of optimized implementations. Unfortunately, the vast majority of them are hand-tuned and specific to certain architectures and execution models. This not only makes them challenging to understand and extend, but also difficult to port to other platforms. We present AnySeq - a novel library for computing different types of pairwise alignments of DNA sequences. Our approach combines high performance with an intuitively understandable implementation, which is achieved through the concept of partial evaluation. Using the AnyDSL compiler framework, AnySeq enables the compilation of algorithmic variants that are highly optimized for specific usage scenarios and hardware targets with a single, uniform codebase. The resulting domain-specific library thus allows the variation of alignment parameters (such as alignment type, scoring scheme, and traceback vs.~plain score) by simple function composition rather than metaprogramming techniques which are often hard to understand. Our implementation supports multithreading and SIMD vectorization on CPUs, CUDA-enabled GPUs, and FPGAs. AnySeq is at most 7% slower and in many cases faster (up to 12%) than state-of-the art manually optimized alignment libraries on CPUs (SeqAn) and on GPUs (NVBio).
Comments: To be published in IPDPS 2020. This work is supported by the Federal Ministry of Education and Research (BMBF) as part of the MetaDL, Metacca, and ProThOS projects as well as by the Intel Visual Computing Institute (IVCI) and Cluster of Excellence on Multimodal Computing and Interaction (MMCI) at Saarland University
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
Cite as: arXiv:2002.04561 [cs.DC]
  (or arXiv:2002.04561v1 [cs.DC] for this version)

Submission history

From: Roland Leißa [view email]
[v1] Tue, 11 Feb 2020 17:34:12 GMT (1749kb)

Link back to: arXiv, form interface, contact.