We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

q-bio.QM

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Quantitative Biology > Quantitative Methods

Title: Image and graph convolution networks improve microbiome-based machine learning accuracy

Abstract: The human gut microbiome is associated with a large number of disease etiologies. As such, it is a natural candidate for machine learning based biomarker development for multiple diseases and conditions. The microbiome is often analyzed using 16S rRNA gene sequencing. However, several properties of microbial 16S rRNA gene sequencing hinder machine learning, including non-uniform representation, a small number of samples compared with the dimension of each sample, and sparsity of the data, with the majority of bacteria present in a small subset of samples. We suggest two novel methods to combine information from different bacteria and improve data representation for machine learning using bacterial taxonomy. iMic and gMic translate the microbiome to images and graphs respectively, and convolutional neural networks are then applied to the graph or image. We show that both algorithms improve performance of static 16S rRNA gene sequence-based machine learning compared to the best state-of-the-art methods. Furthermore, these methods ease the interpretation of the classifiers. iMic is then extended to dynamic microbiome samples, and an iMic explainable AI algorithm is proposed to detect bacteria relevant to each condition.
Comments: 19 pages of manuscript, 3 figures, and 4 pages of Supp. Mat
Subjects: Quantitative Methods (q-bio.QM)
MSC classes: 92-08
ACM classes: J.3
Cite as: arXiv:2205.06525 [q-bio.QM]
  (or arXiv:2205.06525v1 [q-bio.QM] for this version)

Submission history

From: Oshrit Shtossel [view email]
[v1] Fri, 13 May 2022 09:17:12 GMT (1005kb)

Link back to: arXiv, form interface, contact.