We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

q-bio

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Quantitative Biology > Quantitative Methods

Title: Factorized linear discriminant analysis and its application in computational biology

Abstract: A fundamental problem in computational biology is to find a suitable representation of the high-dimensional gene expression data that is consistent with the structural and functional properties of cell types, collectively called their phenotypes. This representation is often sought from a linear transformation of the original data, for the reasons of model interpretability and computational simplicity. Here we propose a novel method of linear dimensionality reduction to address this problem. This method, which we call factorized linear discriminant analysis (FLDA), seeks a linear transformation of gene expressions that varies highly with only one phenotypic feature and minimally with others. We further leverage our approach with a sparsity-based regularization algorithm, which selects a few genes important to a specific phenotypic feature or feature combination. We illustrated this approach by applying it to a single-cell transcriptome dataset of Drosophila T4/T5 neurons. A representation from FLDA captured structures in the data aligned with phenotypic features and revealed critical genes for each phenotype.
Subjects: Quantitative Methods (q-bio.QM); Machine Learning (cs.LG)
Cite as: arXiv:2010.02171 [q-bio.QM]
  (or arXiv:2010.02171v4 [q-bio.QM] for this version)

Submission history

From: Mu Qiao [view email]
[v1] Mon, 5 Oct 2020 17:18:56 GMT (3049kb,D)
[v2] Sun, 29 Nov 2020 13:16:56 GMT (3284kb,D)
[v3] Mon, 22 Feb 2021 05:35:54 GMT (3317kb,D)
[v4] Sat, 27 Mar 2021 05:53:43 GMT (3317kb,D)

Link back to: arXiv, form interface, contact.