Current browse context:
cs.LG
Change to browse by:
References & Citations
Computer Science > Machine Learning
Title: A deep generative model for gene expression profiles from single-cell RNA sequencing
(Submitted on 7 Sep 2017 (v1), last revised 16 Jan 2018 (this version, v4))
Abstract: We propose a probabilistic model for interpreting gene expression levels that are observed through single-cell RNA sequencing. In the model, each cell has a low-dimensional latent representation. Additional latent variables account for technical effects that may erroneously set some observations of gene expression levels to zero. Conditional distributions are specified by neural networks, giving the proposed model enough flexibility to fit the data well. We use variational inference and stochastic optimization to approximate the posterior distribution. The inference procedure scales to over one million cells, whereas competing algorithms do not. Even for smaller datasets, for several tasks, the proposed procedure outperforms state-of-the-art methods like ZIFA and ZINB-WaVE. We also extend our framework to account for batch effects and other confounding factors, and propose a Bayesian hypothesis test for differential expression that outperforms DESeq2.
Submission history
From: Romain Lopez [view email][v1] Thu, 7 Sep 2017 05:59:49 GMT (8kb)
[v2] Tue, 17 Oct 2017 01:41:27 GMT (30kb,D)
[v3] Wed, 18 Oct 2017 00:37:51 GMT (30kb,D)
[v4] Tue, 16 Jan 2018 22:44:59 GMT (30kb,D)
Link back to: arXiv, form interface, contact.