We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: A deep generative model for single-cell RNA sequencing with application to detecting differentially expressed genes

Abstract: We propose a probabilistic model for interpreting gene expression levels that are observed through single-cell RNA sequencing. In the model, each cell has a low-dimensional latent representation. Additional latent variables account for technical effects that may erroneously set some observations of gene expression levels to zero. Conditional distributions are specified by neural networks, giving the proposed model enough flexibility to fit the data well. We use variational inference and stochastic optimization to approximate the posterior distribution. The inference procedure scales to over one million cells, whereas competing algorithms do not. Even for smaller datasets, for several tasks, the proposed procedure outperforms state-of-the-art methods like ZIFA and ZINB-WaVE. We also extend our framework to take into account batch effects and other confounding factors and propose a natural Bayesian hypothesis framework for differential expression that outperforms tradition DESeq2.
Comments: Updated a previous submission instead. See arXiv:1709.02082
Subjects: Machine Learning (cs.LG); Genomics (q-bio.GN); Machine Learning (stat.ML)
Cite as: arXiv:1710.05086 [cs.LG]
  (or arXiv:1710.05086v2 [cs.LG] for this version)

Submission history

From: Romain Lopez [view email]
[v1] Fri, 13 Oct 2017 21:47:48 GMT (30kb,D)
[v2] Tue, 17 Oct 2017 01:42:35 GMT (0kb,I)

Link back to: arXiv, form interface, contact.