We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

q-bio.GN

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Quantitative Biology > Genomics

Title: LRez: C++ API and toolkit for analyzing and managing Linked-Reads data

Abstract: Linked-Reads technologies, such as 10x Genomics, combine both the high-quality and low cost of short-reads sequencing and a long-range information, through the use of barcodes able to tag reads which originate from a common long DNA fragment. This technology has been employed in a broad range of applications including assembly or phasing of genomes, and structural variant calling. However, to date, no tool or API dedicated to the manipulation of Linked-Reads data exist. We introduce LRez, a C++ API and toolkit which allows easy management of Linked-Reads data. LRez includes various functionalities, for computing number of common barcodes between genomic regions, extracting barcodes from BAM files, as well as indexing and querying both BAM and FASTQ files to quickly fetch reads or alignments sharing one or multiple barcodes. LRez can thus be used in a broad range of applications requiring barcode processing, in order to improve their performances. LRez is implemented in C++, supported on Linux platforms, and available under AGPL-3.0 License at this https URL
Comments: 4 pages, 1 table
Subjects: Genomics (q-bio.GN)
Cite as: arXiv:2103.14419 [q-bio.GN]
  (or arXiv:2103.14419v2 [q-bio.GN] for this version)

Submission history

From: Pierre Morisse [view email]
[v1] Fri, 26 Mar 2021 11:59:52 GMT (15kb)
[v2] Mon, 29 Mar 2021 07:23:09 GMT (15kb)

Link back to: arXiv, form interface, contact.