We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Removal of Batch Effects using Distribution-Matching Residual Networks

Abstract: Sources of variability in experimentally derived data include measurement error in addition to the physical phenomena of interest. This measurement error is a combination of systematic components, originating from the measuring instrument, and random measurement errors. Several novel biological technologies, such as mass cytometry and single-cell RNA-seq, are plagued with systematic errors that may severely affect statistical analysis if the data is not properly calibrated. We propose a novel deep learning approach for removing systematic batch effects. Our method is based on a residual network, trained to minimize the Maximum Mean Discrepancy (MMD) between the multivariate distributions of two replicates, measured in different batches. We apply our method to mass cytometry and single-cell RNA-seq datasets, and demonstrate that it effectively attenuates batch effects.
Comments: fixed typo
Subjects: Machine Learning (stat.ML)
DOI: 10.1093/bioinformatics/btx196
Cite as: arXiv:1610.04181 [stat.ML]
  (or arXiv:1610.04181v6 [stat.ML] for this version)

Submission history

From: Uri Shaham [view email]
[v1] Thu, 13 Oct 2016 17:14:33 GMT (1065kb,D)
[v2] Sun, 16 Oct 2016 22:02:57 GMT (1065kb,D)
[v3] Mon, 28 Nov 2016 22:10:28 GMT (916kb,D)
[v4] Wed, 7 Dec 2016 03:20:42 GMT (629kb,D)
[v5] Fri, 23 Dec 2016 18:19:04 GMT (1895kb,D)
[v6] Mon, 8 Jan 2018 22:51:40 GMT (1895kb,D)

Link back to: arXiv, form interface, contact.