Quantitative Methods
New submissions
[ showing up to 2000 entries per page: fewer | more ]
New submissions for Thu, 26 Jan 23
- [1] arXiv:2301.10709 [pdf, other]
-
Title: The Clinical Trials Puzzle: How Network Effects Limit Drug DiscoveryComments: manuscript + SISubjects: Quantitative Methods (q-bio.QM); Social and Information Networks (cs.SI)
The depth of knowledge offered by post-genomic medicine has carried the promise of new drugs, and cures for multiple diseases. To explore the degree to which this capability has materialized, we extract meta-data from 356,403 clinical trials spanning four decades, aiming to offer mechanistic insights into the innovation practices in drug discovery. We find that convention dominates over innovation, as over 96% of the recorded trials focus on previously tested drug targets, and the tested drugs target only 12% of the human interactome. If current patterns persist, it would take 170 years to target all druggable proteins. We uncover two network-based fundamental mechanisms that currently limit target discovery: preferential attachment, leading to the repeated exploration of previously targeted proteins; and local network effects, limiting exploration to proteins interacting with highly explored proteins. We build on these insights to develop a quantitative network-based model of drug discovery. We demonstrate that the model is able to accurately recreate the exploration patterns observed in clinical trials. Most importantly, we show that a network-based search strategy can widen the scope of drug discovery by guiding exploration to novel proteins that are part of under explored regions in the human interactome.
- [2] arXiv:2301.10748 [pdf]
-
Title: Individualised prescriptive inference in ischaemic strokeAuthors: Dominic Giles, Tianbo Xu, Chris Foulon, Robert Gray, Sebastien Ourselin, Jorge Cardoso, Hans Rolf Jäger, Geraint Rees, Ashwani Jha, Parashkev NachevComments: 124 pagesSubjects: Quantitative Methods (q-bio.QM)
The gold standard in the treatment of ischaemic stroke is set by evidence from randomised controlled trials. Yet the manifest complexities of the brain's functional, connective and vascular architectures introduce heterogeneity in treatment susceptibility that violates the premises of the underlying statistical framework, plausibly leading to substantial errors at both individual and population levels. The counterfactual nature of therapeutic inference has made quantifying the impact of this defect difficult. Employing large-scale lesion, connective, functional, genetic expression, and receptor distribution data, here we conduct a comprehensive series of semi-synthetic virtual interventional trials, quantifying the fidelity of the traditional approach in inferring individual treatment effects against biologically plausible, empirically informed ground truths. We compare the performance of machine learning models flexible enough to capture the observed heterogeneity, and find that the richness of the modelled lesion representation is decisive in determining individual-level fidelity, even where freedom from treatment allocation bias cannot be guaranteed. We are compelled to conclude that complex modelling of richly represented data is critical to individualised prescriptive inference in ischaemic stroke.
Cross-lists for Thu, 26 Jan 23
- [3] arXiv:2301.10351 (cross-list from cs.CV) [pdf, other]
-
Title: Few-Shot Learning Enables Population-Scale Analysis of Leaf Traits in Populus trichocarpaAuthors: John Lagergren, Mirko Pavicic, Hari B. Chhetri, Larry M. York, P. Doug Hyatt, David Kainer, Erica M. Rutter, Kevin Flores, Gail Taylor, Daniel Jacobson, Jared StreichSubjects: Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
Plant phenotyping is typically a time-consuming and expensive endeavor, requiring large groups of researchers to meticulously measure biologically relevant plant traits, and is the main bottleneck in understanding plant adaptation and the genetic architecture underlying complex traits at population scale. In this work, we address these challenges by leveraging few-shot learning with convolutional neural networks (CNNs) to segment the leaf body and visible venation of 2,906 P. trichocarpa leaf images obtained in the field. In contrast to previous methods, our approach (i) does not require experimental or image pre-processing, (ii) uses the raw RGB images at full resolution, and (iii) requires very few samples for training (e.g., just eight images for vein segmentation). Traits relating to leaf morphology and vein topology are extracted from the resulting segmentations using traditional open-source image-processing tools, validated using real-world physical measurements, and used to conduct a genome-wide association study to identify genes controlling the traits. In this way, the current work is designed to provide the plant phenotyping community with (i) methods for fast and accurate image-based feature extraction that require minimal training data, and (ii) a new population-scale data set, including 68 different leaf phenotypes, for domain scientists and machine learning researchers. All of the few-shot learning code, data, and results are made publicly available.
- [4] arXiv:2301.10450 (cross-list from cs.LG) [pdf]
-
Title: HealthEdge: A Machine Learning-Based Smart Healthcare Framework for Prediction of Type 2 Diabetes in an Integrated IoT, Edge, and Cloud Computing SystemComments: arXiv admin note: text overlap with arXiv:2211.07643Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)
Diabetes Mellitus has no permanent cure to date and is one of the leading causes of death globally. The alarming increase in diabetes calls for the need to take precautionary measures to avoid/predict the occurrence of diabetes. This paper proposes HealthEdge, a machine learning-based smart healthcare framework for type 2 diabetes prediction in an integrated IoT-edge-cloud computing system. Numerical experiments and comparative analysis were carried out between the two most used machine learning algorithms in the literature, Random Forest (RF) and Logistic Regression (LR), using two real-life diabetes datasets. The results show that RF predicts diabetes with 6% more accuracy on average compared to LR.
Replacements for Thu, 26 Jan 23
- [5] arXiv:2207.00812 (replaced) [pdf, other]
-
Title: A systematic review of biologically-informed deep learning models for cancer: fundamental trends for encoding and interpreting oncology dataComments: 25 pages, 5 figuresSubjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- [6] arXiv:2209.14631 (replaced) [pdf, other]
-
Title: Working With Convex Responses: Antifragility From Finance to OncologyComments: arXiv admin note: text overlap with arXiv:1808.00065 Final accepted version in EntropySubjects: Quantitative Methods (q-bio.QM); Statistical Finance (q-fin.ST)
[ showing up to 2000 entries per page: fewer | more ]
Disable MathJax (What is MathJax?)
Links to: arXiv, form interface, find, q-bio, recent, 2301, contact, help (Access key information)