We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

q-bio.GN

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Quantitative Biology > Genomics

Title: Computer Architecture-Aware Optimisation of DNA Analysis Systems

Abstract: DNA sequencing is revolutionising the field of medicine. DNA sequencers, the machines which perform DNA sequencing, have evolved from the size of a fridge to that of a mobile phone over the last two decades. The cost of sequencing a human genome also has reduced from billions of dollars to hundreds of dollars. Despite these improvements, DNA sequencers output hundreds or thousands of gigabytes of data that must be analysed on computers to discover meaningful information with biological implications. Unfortunately, the analysis techniques have not kept the pace with rapidly improving sequencing technologies. Consequently, even today, the process of DNA analysis is performed on high-performance computers, just as it was a couple of decades ago. Such high-performance computers are not portable. Consequently, the full utility of an ultra-portable sequencer for sequencing in-the-field or at the point-of-care is limited by the lack of portable lightweight analytic techniques. This thesis proposes computer architecture-aware optimisation of DNA analysis software. DNA analysis software is inevitably convoluted due to the complexity associated with biological data. Modern computer architectures are also complex. Performing architecture-aware optimisations requires the synergistic use of knowledge from both domains, (i.e, DNA sequence analysis and computer architecture). This thesis aims to draw the two domains together. In this thesis, gold-standard DNA sequence analysis workflows are systematically examined for algorithmic components that cause performance bottlenecks. Identified bottlenecks are resolved through architecture-aware optimisations at different levels, i.e., memory, cache, register and processor. The optimised software tools are used in complete end-to-end analysis workflows and their efficacy is demonstrated by running on prototypical embedded systems.
Comments: Supervisors: Parameswaran, Sri , Computer Science & Engineering, Faculty of Engineering, UNSW; Ignjatovic, Aleksandar , Computer Science & Engineering, Faculty of Engineering, UNSW; Smith, Martin A., Garvan Institute of Medical Research, Faculty of Medicine, UNSW unsworks: this http URL
Subjects: Genomics (q-bio.GN); Computational Engineering, Finance, and Science (cs.CE)
ACM classes: J.3; C.3; C.4
Cite as: arXiv:2101.05012 [q-bio.GN]
  (or arXiv:2101.05012v1 [q-bio.GN] for this version)

Submission history

From: Hasindu Gamaarachchi [view email]
[v1] Wed, 13 Jan 2021 11:29:12 GMT (35575kb,D)

Link back to: arXiv, form interface, contact.