We gratefully acknowledge support from
the Simons Foundation and member institutions.

Quantitative Methods

New submissions

[ total of 14 entries: 1-14 ]
[ showing up to 500 entries per page: fewer | more ]

New submissions for Tue, 16 Apr 24

[1]  arXiv:2404.08711 [pdf, ps, other]
Title: Drug Repurposing for Parkinson's Disease Using Random Walk With Restart Algorithm and the Parkinson's Disease Ontology Database
Comments: 5 pages, Final Year Engineering Project on Machine Learning and Healthcare Industry
Subjects: Quantitative Methods (q-bio.QM); Machine Learning (cs.LG); Biomolecules (q-bio.BM)

Parkinson's disease is a progressive and slowly developing neurodegenerative disease, characterized by dopaminergic neuron loss in the substantia nigra region of the brain. Despite extensive research by scientists, there is not yet a cure to this problem and the available therapies mainly help to reduce some of the Parkinson's symptoms. Drug repurposing (that is, the process of finding new uses for existing drugs) receives more appraisals as an efficient way that allows for reducing the time, resources, and risks associated with the development of new drugs. In this research, we design a novel computational platform that integrates gene expression data, biological networks, and the PDOD database to identify possible drug-repositioning agents for PD therapy. By using machine learning approaches like the RWR algorithm and PDOD scoring system we arrange drug-disease conversions and sort our potential sandboxes according to their possible efficacy. We propose gene expression analysis, network prioritization, and drug target data analysis to arrive at a comprehensive evaluation of drug repurposing chances. Our study results highlight such therapies as promising drug candidates to conduct further research on PD treatment. We also provide the rationale for promising drug repurposing ideas by using various sources of data and computational approaches.

[2]  arXiv:2404.08722 [pdf, other]
Title: VADA: a Data-Driven Simulator for Nanopore Sequencing
Subjects: Quantitative Methods (q-bio.QM); Machine Learning (cs.LG)

Nanopore sequencing offers the ability for real-time analysis of long DNA sequences at a low cost, enabling new applications such as early detection of cancer. Due to the complex nature of nanopore measurements and the high cost of obtaining ground truth datasets, there is a need for nanopore simulators. Existing simulators rely on handcrafted rules and parameters and do not learn an internal representation that would allow for analysing underlying biological factors of interest. Instead, we propose VADA, a purely data-driven method for simulating nanopores based on an autoregressive latent variable model. We embed subsequences of DNA and introduce a conditional prior to address the challenge of a collapsing conditioning. We introduce an auxiliary regressor on the latent variable to encourage our model to learn an informative latent representation. We empirically demonstrate that our model achieves competitive simulation performance on experimental nanopore data. Moreover, we show we have learned an informative latent representation that is predictive of the DNA labels. We hypothesize that other biological factors of interest, beyond the DNA labels, can potentially be extracted from such a learned latent representation.

Cross-lists for Tue, 16 Apr 24

[3]  arXiv:2404.08713 (cross-list from eess.IV) [pdf, other]
Title: Survival Prediction Across Diverse Cancer Types Using Neural Networks
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)

Gastric cancer and Colon adenocarcinoma represent widespread and challenging malignancies with high mortality rates and complex treatment landscapes. In response to the critical need for accurate prognosis in cancer patients, the medical community has embraced the 5-year survival rate as a vital metric for estimating patient outcomes. This study introduces a pioneering approach to enhance survival prediction models for gastric and Colon adenocarcinoma patients. Leveraging advanced image analysis techniques, we sliced whole slide images (WSI) of these cancers, extracting comprehensive features to capture nuanced tumor characteristics. Subsequently, we constructed patient-level graphs, encapsulating intricate spatial relationships within tumor tissues. These graphs served as inputs for a sophisticated 4-layer graph convolutional neural network (GCN), designed to exploit the inherent connectivity of the data for comprehensive analysis and prediction. By integrating patients' total survival time and survival status, we computed C-index values for gastric cancer and Colon adenocarcinoma, yielding 0.57 and 0.64, respectively. Significantly surpassing previous convolutional neural network models, these results underscore the efficacy of our approach in accurately predicting patient survival outcomes. This research holds profound implications for both the medical and AI communities, offering insights into cancer biology and progression while advancing personalized treatment strategies. Ultimately, our study represents a significant stride in leveraging AI-driven methodologies to revolutionize cancer prognosis and improve patient outcomes on a global scale.

[4]  arXiv:2404.09059 (cross-list from q-bio.PE) [pdf, other]
Title: Prevalence estimation methods for time-dependent antibody kinetics of infected and vaccinated individuals: a graph-theoretic approach
Comments: 27 pages, 7 figures
Subjects: Populations and Evolution (q-bio.PE); Probability (math.PR); Biological Physics (physics.bio-ph); Quantitative Methods (q-bio.QM); Methodology (stat.ME)

Immune events such as infection, vaccination, and a combination of the two result in distinct time-dependent antibody responses in affected individuals. These responses and event prevalences combine non-trivially to govern antibody levels sampled from a population. Time-dependence and disease prevalence pose considerable modeling challenges that need to be addressed to provide a rigorous mathematical underpinning of the underlying biology. We propose a time-inhomogeneous Markov chain model for event-to-event transitions coupled with a probabilistic framework for anti-body kinetics and demonstrate its use in a setting in which individuals can be infected or vaccinated but not both. We prove the equivalency of this approach to the framework developed in our previous work. Synthetic data are used to demonstrate the modeling process and conduct prevalence estimation via transition probability matrices. This approach is ideal to model sequences of infections and vaccinations, or personal trajectories in a population, making it an important first step towards a mathematical characterization of reinfection, vaccination boosting, and cross-events of infection after vaccination or vice versa.

[5]  arXiv:2404.09606 (cross-list from cs.LG) [pdf, other]
Title: A Self-feedback Knowledge Elicitation Approach for Chemical Reaction Predictions
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)

The task of chemical reaction predictions (CRPs) plays a pivotal role in advancing drug discovery and material science. However, its effectiveness is constrained by the vast and uncertain chemical reaction space and challenges in capturing reaction selectivity, particularly due to existing methods' limitations in exploiting the data's inherent knowledge. To address these challenges, we introduce a data-curated self-feedback knowledge elicitation approach. This method starts from iterative optimization of molecular representations and facilitates the extraction of knowledge on chemical reaction types (RTs). Then, we employ adaptive prompt learning to infuse the prior knowledge into the large language model (LLM). As a result, we achieve significant enhancements: a 14.2% increase in retrosynthesis prediction accuracy, a 74.2% rise in reagent prediction accuracy, and an expansion in the model's capability for handling multi-task chemical reactions. This research offers a novel paradigm for knowledge elicitation in scientific research and showcases the untapped potential of LLMs in CRPs.

[6]  arXiv:2404.09666 (cross-list from eess.IV) [pdf, other]
Title: Deformable MRI Sequence Registration for AI-based Prostate Cancer Diagnosis
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)

The PI-CAI (Prostate Imaging: Cancer AI) challenge led to expert-level diagnostic algorithms for clinically significant prostate cancer detection. The algorithms receive biparametric MRI scans as input, which consist of T2-weighted and diffusion-weighted scans. These scans can be misaligned due to multiple factors in the scanning process. Image registration can alleviate this issue by predicting the deformation between the sequences. We investigate the effect of image registration on the diagnostic performance of AI-based prostate cancer diagnosis. First, the image registration algorithm, developed in MeVisLab, is analyzed using a dataset with paired lesion annotations. Second, the effect on diagnosis is evaluated by comparing case-level cancer diagnosis performance between using the original dataset, rigidly aligned diffusion-weighted scans, or deformably aligned diffusion-weighted scans. Rigid registration showed no improvement. Deformable registration demonstrated a substantial improvement in lesion overlap (+10% median Dice score) and a positive yet non-significant improvement in diagnostic performance (+0.3% AUROC, p=0.18). Our investigation shows that a substantial improvement in lesion alignment does not directly lead to a significant improvement in diagnostic performance. Qualitative analysis indicated that jointly developing image registration methods and diagnostic AI algorithms could enhance diagnostic accuracy and patient outcomes.

[7]  arXiv:2404.09738 (cross-list from q-bio.BM) [pdf, ps, other]
Title: AMPCliff: quantitative definition and benchmarking of activity cliffs in antimicrobial peptides
Subjects: Biomolecules (q-bio.BM); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)

Activity cliff (AC) is a phenomenon that a pair of similar molecules differ by a small structural alternation but exhibit a large difference in their biochemical activities. The AC of small molecules has been extensively investigated but limited knowledge is accumulated about the AC phenomenon in peptides with canonical amino acids. This study introduces a quantitative definition and benchmarking framework AMPCliff for the AC phenomenon in antimicrobial peptides (AMPs) composed by canonical amino acids. A comprehensive analysis of the existing AMP dataset reveals a significant prevalence of AC within AMPs. AMPCliff quantifies the activities of AMPs by the metric minimum inhibitory concentration (MIC), and defines 0.9 as the minimum threshold for the normalized BLOSUM62 similarity score between a pair of aligned peptides with at least two-fold MIC changes. This study establishes a benchmark dataset of paired AMPs in Staphylococcus aureus from the publicly available AMP dataset GRAMPA, and conducts a rigorous procedure to evaluate various AMP AC prediction models, including nine machine learning, four deep learning algorithms, four masked language models, and four generative language models. Our analysis reveals that these models are capable of detecting AMP AC events and the pre-trained protein language ESM2 model demonstrates superior performance across the evaluations. The predictive performance of AMP activity cliffs remains to be further improved, considering that ESM2 with 33 layers only achieves the Spearman correlation coefficient=0.50 for the regression task of the MIC values on the benchmark dataset. Source code and additional resources are available at https://www.healthinformaticslab.org/supp/ or https://github.com/Kewei2023/AMPCliff-generation.

Replacements for Tue, 16 Apr 24

[8]  arXiv:2403.14481 (replaced) [pdf, ps, other]
Title: covSTATIS: a multi-table technique for network neuroscience
Authors: Giulia Baracchini (1), Ju-Chi Yu (2), Jenny Rieck (3), Derek Beaton (4), Vincent Guillemot (5), Cheryl Grady (3 and 6), Herve Abdi (7), R. Nathan Spreng (1) ((1) Montreal Neurological Institute, Department of Neurology and Neurosurgery, McGill University, Montreal, Canada, (2) Campbell Family Mental Health Institute, Centre for Addiction and Mental Health, Toronto, Canada, (3) Rotman Research Institute at Baycrest, Toronto, Canada, (4) Data Science & Advanced Analytics, Unity Health Toronto, Toronto, Canada, (5) Institut Pasteur, Universite Paris Cite, Bioinformatics and Biostatistics Hub, Paris, France, (6) Departments of Psychiatry and Psychology, University of Toronto, Toronto, Canada, (7) School of Behavioral and Brain Sciences, The University of Texas at Dallas, Richardson, USA)
Comments: The first two authors contributed equally to this work
Subjects: Quantitative Methods (q-bio.QM)
[9]  arXiv:2208.02657 (replaced) [pdf, other]
Title: Using Instruments for Selection to Adjust for Selection Bias in Mendelian Randomization
Comments: Main part: 28 pages, 3 figures, 4 tables. Supplement: 24 pages, 8 figures, 10 tables. Paper currently under review
Subjects: Methodology (stat.ME); Quantitative Methods (q-bio.QM)
[10]  arXiv:2304.06819 (replaced) [pdf, other]
Title: Modeling Dense Multimodal Interactions Between Biological Pathways and Histology for Survival Prediction
Comments: Accepted to CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Genomics (q-bio.GN); Quantitative Methods (q-bio.QM); Tissues and Organs (q-bio.TO)
[11]  arXiv:2310.18351 (replaced) [pdf, ps, other]
Title: BioImage.IO Chatbot: A Community-Driven AI Assistant for Integrative Computational Bioimaging
Comments: 15 pages, 2 figures
Subjects: Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)
[12]  arXiv:2312.05063 (replaced) [pdf, other]
Title: Individualizing Glioma Radiotherapy Planning by Optimization of Data and Physics-Informed Discrete Loss
Comments: 22 pages, 7 figures, 1 table. Associated GitHub: this https URL
Subjects: Medical Physics (physics.med-ph); Numerical Analysis (math.NA); Quantitative Methods (q-bio.QM)
[13]  arXiv:2402.16165 (replaced) [pdf, ps, other]
Title: On the Feasibility of Deep Learning Classification from Raw Signal Data in Radiology, Ultrasonography and Electrophysiology
Authors: Szilard Enyedi
Comments: Updated 2024.04.14 17:49+3: accepted to the conference, so added IEEE copyright notice to the first page, as per "IEEE Publication Services and Products Board Operations Manual 2024"'s subsection 8.1.9. point D. 6 pages, 5 figures, 1 table, 56 references, submitted to AQTR 2024
Subjects: Systems and Control (eess.SY); Neural and Evolutionary Computing (cs.NE); Quantitative Methods (q-bio.QM)
[14]  arXiv:2404.02484 (replaced) [pdf, other]
Title: New methods for drug synergy prediction: a mini-review
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)
[ total of 14 entries: 1-14 ]
[ showing up to 500 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, q-bio, recent, 2404, contact, help  (Access key information)