We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

q-bio

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Quantitative Biology > Genomics

Title: snpQT: flexible, reproducible, and comprehensive quality control and imputation of genomic data

Abstract: Motivation: Quality control of genomic data is an essential but complicated multi-step procedure, often requiring separate installation and expert familiarity with a combination of disparate bioinformatics tools. Results: To provide an automated solution that retains comprehensive quality checks and flexible workflow architecture, we have developed snpQT, a scalable, stand-alone software pipeline, offering some 36 discrete quality filters or correction steps, with plots before-and-after user-modifiable thresholding. This includes build conversion, population stratification against 1,000 Genomes data, population outlier removal, and built-in imputation with its own pre- and post- quality controls. Common input formats are used and users need not be superusers nor have any prior coding experience. A comprehensive online tutorial and installation guide is provided through to GWAS (this https URL), introducing snpQT using a synthetic demonstration dataset and a real-world Amyotrophic Lateral Sclerosis SNP-array dataset. Availability: snpQT is open source and freely available at this https URL Contact: Vasilopoulou-C@ulster.ac.uk, w.duddy@ulster.ac.uk
Subjects: Genomics (q-bio.GN)
Cite as: arXiv:2105.01923 [q-bio.GN]
  (or arXiv:2105.01923v1 [q-bio.GN] for this version)

Submission history

From: William Duddy [view email]
[v1] Wed, 5 May 2021 08:22:40 GMT (628kb,D)

Link back to: arXiv, form interface, contact.