We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Interpretable Single-Cell Set Classification with Kernel Mean Embeddings

Abstract: Modern single-cell flow and mass cytometry technologies measure the expression of several proteins of the individual cells within a blood or tissue sample. Each profiled biological sample is thus represented by a set of hundreds of thousands of multidimensional cell feature vectors, which incurs a high computational cost to predict each biological sample's associated phenotype with machine learning models. Such a large set cardinality also limits the interpretability of machine learning models due to the difficulty in tracking how each individual cell influences the ultimate prediction. Using Kernel Mean Embedding to encode the cellular landscape of each profiled biological sample, we can train a simple linear classifier and achieve state-of-the-art classification accuracy on 3 flow and mass cytometry datasets. Our model contains few parameters but still performs similarly to deep learning models with millions of parameters. In contrast with deep learning approaches, the linearity and sub-selection step of our model make it easy to interpret classification results. Clustering analysis further shows that our method admits rich biological interpretability for linking cellular heterogeneity to clinical phenotype.
Comments: 13 pages
Subjects: Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
Cite as: arXiv:2201.07322 [cs.LG]
  (or arXiv:2201.07322v1 [cs.LG] for this version)

Submission history

From: Siyuan Shan [view email]
[v1] Tue, 18 Jan 2022 21:40:36 GMT (2547kb,D)
[v2] Fri, 28 Jan 2022 18:12:16 GMT (2549kb,D)
[v3] Mon, 31 Jan 2022 05:43:47 GMT (2548kb,D)
[v4] Thu, 10 Feb 2022 08:10:14 GMT (2549kb,D)
[v5] Tue, 28 Jun 2022 15:39:38 GMT (10304kb,D)

Link back to: arXiv, form interface, contact.