References & Citations
Statistics > Machine Learning
Title: Removal of Batch Effects using Distribution-Matching Residual Networks
(Submitted on 13 Oct 2016 (v1), last revised 8 Jan 2018 (this version, v6))
Abstract: Sources of variability in experimentally derived data include measurement error in addition to the physical phenomena of interest. This measurement error is a combination of systematic components, originating from the measuring instrument, and random measurement errors. Several novel biological technologies, such as mass cytometry and single-cell RNA-seq, are plagued with systematic errors that may severely affect statistical analysis if the data is not properly calibrated. We propose a novel deep learning approach for removing systematic batch effects. Our method is based on a residual network, trained to minimize the Maximum Mean Discrepancy (MMD) between the multivariate distributions of two replicates, measured in different batches. We apply our method to mass cytometry and single-cell RNA-seq datasets, and demonstrate that it effectively attenuates batch effects.
Submission history
From: Uri Shaham [view email][v1] Thu, 13 Oct 2016 17:14:33 GMT (1065kb,D)
[v2] Sun, 16 Oct 2016 22:02:57 GMT (1065kb,D)
[v3] Mon, 28 Nov 2016 22:10:28 GMT (916kb,D)
[v4] Wed, 7 Dec 2016 03:20:42 GMT (629kb,D)
[v5] Fri, 23 Dec 2016 18:19:04 GMT (1895kb,D)
[v6] Mon, 8 Jan 2018 22:51:40 GMT (1895kb,D)
Link back to: arXiv, form interface, contact.