Current browse context:
cs
Change to browse by:
References & Citations
Quantitative Biology > Quantitative Methods
Title: Factorized linear discriminant analysis and its application in computational biology
(Submitted on 5 Oct 2020 (v1), last revised 27 Mar 2021 (this version, v4))
Abstract: A fundamental problem in computational biology is to find a suitable representation of the high-dimensional gene expression data that is consistent with the structural and functional properties of cell types, collectively called their phenotypes. This representation is often sought from a linear transformation of the original data, for the reasons of model interpretability and computational simplicity. Here we propose a novel method of linear dimensionality reduction to address this problem. This method, which we call factorized linear discriminant analysis (FLDA), seeks a linear transformation of gene expressions that varies highly with only one phenotypic feature and minimally with others. We further leverage our approach with a sparsity-based regularization algorithm, which selects a few genes important to a specific phenotypic feature or feature combination. We illustrated this approach by applying it to a single-cell transcriptome dataset of Drosophila T4/T5 neurons. A representation from FLDA captured structures in the data aligned with phenotypic features and revealed critical genes for each phenotype.
Submission history
From: Mu Qiao [view email][v1] Mon, 5 Oct 2020 17:18:56 GMT (3049kb,D)
[v2] Sun, 29 Nov 2020 13:16:56 GMT (3284kb,D)
[v3] Mon, 22 Feb 2021 05:35:54 GMT (3317kb,D)
[v4] Sat, 27 Mar 2021 05:53:43 GMT (3317kb,D)
Link back to: arXiv, form interface, contact.