We gratefully acknowledge support from
the Simons Foundation and member institutions.

Quantitative Methods

New submissions

[ total of 6 entries: 1-6 ]
[ showing up to 2000 entries per page: fewer | more ]

New submissions for Thu, 26 Jan 23

[1]  arXiv:2301.10709 [pdf, other]
Title: The Clinical Trials Puzzle: How Network Effects Limit Drug Discovery
Comments: manuscript + SI
Subjects: Quantitative Methods (q-bio.QM); Social and Information Networks (cs.SI)

The depth of knowledge offered by post-genomic medicine has carried the promise of new drugs, and cures for multiple diseases. To explore the degree to which this capability has materialized, we extract meta-data from 356,403 clinical trials spanning four decades, aiming to offer mechanistic insights into the innovation practices in drug discovery. We find that convention dominates over innovation, as over 96% of the recorded trials focus on previously tested drug targets, and the tested drugs target only 12% of the human interactome. If current patterns persist, it would take 170 years to target all druggable proteins. We uncover two network-based fundamental mechanisms that currently limit target discovery: preferential attachment, leading to the repeated exploration of previously targeted proteins; and local network effects, limiting exploration to proteins interacting with highly explored proteins. We build on these insights to develop a quantitative network-based model of drug discovery. We demonstrate that the model is able to accurately recreate the exploration patterns observed in clinical trials. Most importantly, we show that a network-based search strategy can widen the scope of drug discovery by guiding exploration to novel proteins that are part of under explored regions in the human interactome.

[2]  arXiv:2301.10748 [pdf]
Title: Individualised prescriptive inference in ischaemic stroke
Comments: 124 pages
Subjects: Quantitative Methods (q-bio.QM)

The gold standard in the treatment of ischaemic stroke is set by evidence from randomised controlled trials. Yet the manifest complexities of the brain's functional, connective and vascular architectures introduce heterogeneity in treatment susceptibility that violates the premises of the underlying statistical framework, plausibly leading to substantial errors at both individual and population levels. The counterfactual nature of therapeutic inference has made quantifying the impact of this defect difficult. Employing large-scale lesion, connective, functional, genetic expression, and receptor distribution data, here we conduct a comprehensive series of semi-synthetic virtual interventional trials, quantifying the fidelity of the traditional approach in inferring individual treatment effects against biologically plausible, empirically informed ground truths. We compare the performance of machine learning models flexible enough to capture the observed heterogeneity, and find that the richness of the modelled lesion representation is decisive in determining individual-level fidelity, even where freedom from treatment allocation bias cannot be guaranteed. We are compelled to conclude that complex modelling of richly represented data is critical to individualised prescriptive inference in ischaemic stroke.

Cross-lists for Thu, 26 Jan 23

[3]  arXiv:2301.10351 (cross-list from cs.CV) [pdf, other]
Title: Few-Shot Learning Enables Population-Scale Analysis of Leaf Traits in Populus trichocarpa
Subjects: Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)

Plant phenotyping is typically a time-consuming and expensive endeavor, requiring large groups of researchers to meticulously measure biologically relevant plant traits, and is the main bottleneck in understanding plant adaptation and the genetic architecture underlying complex traits at population scale. In this work, we address these challenges by leveraging few-shot learning with convolutional neural networks (CNNs) to segment the leaf body and visible venation of 2,906 P. trichocarpa leaf images obtained in the field. In contrast to previous methods, our approach (i) does not require experimental or image pre-processing, (ii) uses the raw RGB images at full resolution, and (iii) requires very few samples for training (e.g., just eight images for vein segmentation). Traits relating to leaf morphology and vein topology are extracted from the resulting segmentations using traditional open-source image-processing tools, validated using real-world physical measurements, and used to conduct a genome-wide association study to identify genes controlling the traits. In this way, the current work is designed to provide the plant phenotyping community with (i) methods for fast and accurate image-based feature extraction that require minimal training data, and (ii) a new population-scale data set, including 68 different leaf phenotypes, for domain scientists and machine learning researchers. All of the few-shot learning code, data, and results are made publicly available.

[4]  arXiv:2301.10450 (cross-list from cs.LG) [pdf]
Title: HealthEdge: A Machine Learning-Based Smart Healthcare Framework for Prediction of Type 2 Diabetes in an Integrated IoT, Edge, and Cloud Computing System
Comments: arXiv admin note: text overlap with arXiv:2211.07643
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)

Diabetes Mellitus has no permanent cure to date and is one of the leading causes of death globally. The alarming increase in diabetes calls for the need to take precautionary measures to avoid/predict the occurrence of diabetes. This paper proposes HealthEdge, a machine learning-based smart healthcare framework for type 2 diabetes prediction in an integrated IoT-edge-cloud computing system. Numerical experiments and comparative analysis were carried out between the two most used machine learning algorithms in the literature, Random Forest (RF) and Logistic Regression (LR), using two real-life diabetes datasets. The results show that RF predicts diabetes with 6% more accuracy on average compared to LR.

Replacements for Thu, 26 Jan 23

[5]  arXiv:2207.00812 (replaced) [pdf, other]
Title: A systematic review of biologically-informed deep learning models for cancer: fundamental trends for encoding and interpreting oncology data
Comments: 25 pages, 5 figures
Subjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[6]  arXiv:2209.14631 (replaced) [pdf, other]
Title: Working With Convex Responses: Antifragility From Finance to Oncology
Comments: arXiv admin note: text overlap with arXiv:1808.00065 Final accepted version in Entropy
Subjects: Quantitative Methods (q-bio.QM); Statistical Finance (q-fin.ST)
[ total of 6 entries: 1-6 ]
[ showing up to 2000 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, q-bio, recent, 2301, contact, help  (Access key information)