Integrating Single-Cell Foundation Models with Graph Neural Networks for Drug Response Prediction
- URL: http://arxiv.org/abs/2504.14361v1
- Date: Sat, 19 Apr 2025 17:35:54 GMT
- Title: Integrating Single-Cell Foundation Models with Graph Neural Networks for Drug Response Prediction
- Authors: Till Rossner, Ziteng Li, Jonas Balke, Nikoo Salehfard, Tom Seifert, Ming Tang,
- Abstract summary: We propose an innovative methodology for predicting Cancer Drug Response (CDR) through the integration of the scGPT foundation model within the DeepCDR model.<n>Our approach utilizes scGPT to generate embeddings from gene expression data, which are then used as gene expression input data for DeepCDR.
- Score: 3.544038341047844
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: In this study, we propose an innovative methodology for predicting Cancer Drug Response (CDR) through the integration of the scGPT foundation model within the DeepCDR model. Our approach utilizes scGPT to generate embeddings from gene expression data, which are then used as gene expression input data for DeepCDR. The experimental findings demonstrate the efficacy of this scGPT-based method in outperforming previous related works, including the original DeepCDR model and the scFoundation-based model. This study highlights the potential of scGPT embeddings to enhance the accuracy of CDR predictions and offers a promising alternative to existing approaches.
Related papers
- A Generative Framework for Causal Estimation via Importance-Weighted Diffusion Distillation [55.53426007439564]
Estimating individualized treatment effects from observational data is a central challenge in causal inference.<n>In inverse probability weighting (IPW) is a well-established solution to this problem, but its integration into modern deep learning frameworks remains limited.<n>We propose Importance-Weighted Diffusion Distillation (IWDD), a novel generative framework that combines the pretraining of diffusion models with importance-weighted score distillation.
arXiv Detail & Related papers (2025-05-16T17:00:52Z) - Teaching pathology foundation models to accurately predict gene expression with parameter efficient knowledge transfer [1.5416321520529301]
Efficient Knowledge Adaptation (PEKA) is a novel framework that integrates knowledge distillation and structure alignment losses for cross-modal knowledge transfer.<n>We evaluated PEKA for gene expression prediction using multiple spatial transcriptomics datasets.
arXiv Detail & Related papers (2025-04-09T17:24:41Z) - RL-finetuning LLMs from on- and off-policy data with a single algorithm [53.70731390624718]
We introduce a novel reinforcement learning algorithm (AGRO) for fine-tuning large-language models.<n>AGRO leverages the concept of generation consistency, which states that the optimal policy satisfies the notion of consistency across any possible generation of the model.<n>We derive algorithms that find optimal solutions via the sample-based policy gradient and provide theoretical guarantees on their convergence.
arXiv Detail & Related papers (2025-03-25T12:52:38Z) - MuCoS: Efficient Drug Target Discovery via Multi Context Aware Sampling in Knowledge Graphs [0.0]
Multi Context Aware Sampling (MuCoS) is a novel framework that prioritizes high-density neighbours to capture salient structural patterns.<n>Experiments on the KEGG50k dataset demonstrate that MuCoS outperforms state-of-the-art baselines.
arXiv Detail & Related papers (2025-03-11T06:08:42Z) - Chain-of-Retrieval Augmented Generation [72.06205327186069]
This paper introduces an approach for training o1-like RAG models that retrieve and reason over relevant information step by step before generating the final answer.<n>Our proposed method, CoRAG, allows the model to dynamically reformulate the query based on the evolving state.
arXiv Detail & Related papers (2025-01-24T09:12:52Z) - Modeling COVID-19 spread in the USA using metapopulation SIR models coupled with graph convolutional neural networks [0.0]
Graph convolutional neural networks (GCNs) have shown tremendous promise in addressing data-intensive challenges.<n>Some attempts have been made to improve predictions of Susceptible-Infected-Recovered (SIR) models by incorporating human mobility between metapopulations.<n>Recently, researchers have found that a hybrid GCN-SIR approach outperformed existing methodologies when used on the data collected on a precinct level in Japan.
arXiv Detail & Related papers (2025-01-03T18:06:26Z) - Dumpling GNN: Hybrid GNN Enables Better ADC Payload Activity Prediction Based on Chemical Structure [53.76752789814785]
DumplingGNN is a hybrid Graph Neural Network architecture specifically designed for predicting ADC payload activity based on chemical structure.
We evaluate it on a comprehensive ADC payload dataset focusing on DNA Topoisomerase I inhibitors.
It demonstrates exceptional accuracy (91.48%), sensitivity (95.08%), and specificity (97.54%) on our specialized ADC payload dataset.
arXiv Detail & Related papers (2024-09-23T17:11:04Z) - Assessing Reusability of Deep Learning-Based Monotherapy Drug Response Prediction Models Trained with Omics Data [43.57729817547386]
Cancer drug response prediction models present a promising approach towards precision oncology.
Deep learning (DL) methods have shown great potential in this area.
This highlights the need for reusable and adaptable models that can be improved and tested by the wider scientific community.
arXiv Detail & Related papers (2024-09-18T16:08:28Z) - Generative Principal Component Regression via Variational Inference [2.4415762506639944]
One approach to designing appropriate manipulations is to target key features of predictive models.
We develop a novel objective based on supervised variational autoencoders (SVAEs) that enforces such information is represented in the latent space.
We show in simulations that gPCR dramatically improves target selection in manipulation as compared to standard PCR and SVAEs.
arXiv Detail & Related papers (2024-09-03T22:38:55Z) - Predicting T-Cell Receptor Specificity [7.258321140371502]
We established a TCR generative specificity detection framework consisting of an antigen selector and a TCR classifier based on the Random Forest algorithm.
We used the k-fold validation method to compare the performance of our model with ordinary deep learning methods.
arXiv Detail & Related papers (2024-07-27T23:21:07Z) - Cancer-Net PCa-Gen: Synthesis of Realistic Prostate Diffusion Weighted
Imaging Data via Anatomic-Conditional Controlled Latent Diffusion [68.45407109385306]
In Canada, prostate cancer is the most common form of cancer in men and accounted for 20% of new cancer cases for this demographic in 2022.
There has been significant interest in the development of deep neural networks for prostate cancer diagnosis, prognosis, and treatment planning using diffusion weighted imaging (DWI) data.
In this study, we explore the efficacy of latent diffusion for generating realistic prostate DWI data through the introduction of an anatomic-conditional controlled latent diffusion strategy.
arXiv Detail & Related papers (2023-11-30T15:11:03Z) - Clinical Deterioration Prediction in Brazilian Hospitals Based on
Artificial Neural Networks and Tree Decision Models [56.93322937189087]
An extremely boosted neural network (XBNet) is used to predict clinical deterioration (CD)
The XGBoost model obtained the best results in predicting CD among Brazilian hospitals' data.
arXiv Detail & Related papers (2022-12-17T23:29:14Z) - Predicting Cellular Responses with Variational Causal Inference and Refined Relational Information [28.4434795102787]
We propose a graph variational Bayesian causal inference framework to predict a cell's gene expressions under counterfactual perturbations.<n>We leverage information representing biological knowledge in the form of gene regulatory networks to aid individualized cellular response predictions.
arXiv Detail & Related papers (2022-09-30T22:13:57Z) - An Energy-Based Prior for Generative Saliency [62.79775297611203]
We propose a novel generative saliency prediction framework that adopts an informative energy-based model as a prior distribution.
With the generative saliency model, we can obtain a pixel-wise uncertainty map from an image, indicating model confidence in the saliency prediction.
Experimental results show that our generative saliency model with an energy-based prior can achieve not only accurate saliency predictions but also reliable uncertainty maps consistent with human perception.
arXiv Detail & Related papers (2022-04-19T10:51:00Z) - AugmentedPCA: A Python Package of Supervised and Adversarial Linear
Factor Models [0.2148535041822524]
We present methods that augment the principal component analysis objective with either a supervised or adversarial objective.
We implement these methods in an open-source Python package, AugmentedPCA, that can produce excellent real-world baselines.
We demonstrate the utility of these factor models on an open-source, RNA-seq cancer gene expression dataset.
arXiv Detail & Related papers (2022-01-07T17:08:59Z) - Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited
Data [125.7135706352493]
Generative adversarial networks (GANs) typically require ample data for training in order to synthesize high-fidelity images.
Recent studies have shown that training GANs with limited data remains formidable due to discriminator overfitting.
This paper introduces a novel strategy called Adaptive Pseudo Augmentation (APA) to encourage healthy competition between the generator and the discriminator.
arXiv Detail & Related papers (2021-11-12T18:13:45Z) - Incorporating Causal Graphical Prior Knowledge into Predictive Modeling
via Simple Data Augmentation [92.96204497841032]
Causal graphs (CGs) are compact representations of the knowledge of the data generating processes behind the data distributions.
We propose a model-agnostic data augmentation method that allows us to exploit the prior knowledge of the conditional independence (CI) relations.
We experimentally show that the proposed method is effective in improving the prediction accuracy, especially in the small-data regime.
arXiv Detail & Related papers (2021-02-27T06:13:59Z) - Ensemble and Random Collaborative Representation-Based Anomaly Detector
for Hyperspectral Imagery [133.83048723991462]
We propose a novel ensemble and random collaborative representation-based detector (ERCRD) for hyperspectral anomaly detection (HAD)
Our experiments on four real hyperspectral datasets exhibit the accuracy and efficiency of this proposed ERCRD method compared with ten state-of-the-art HAD methods.
arXiv Detail & Related papers (2021-01-06T11:23:51Z) - Variational Temporal Deep Generative Model for Radar HRRP Target
Recognition [39.01318281591659]
We develop a recurrent gamma belief network (rGBN) for radar automatic target recognition (RATR) based on high-resolution range profile (HRRP)
The proposed rGBN adopts a hierarchy of gamma distributions to build its temporal deep generative model.
Experimental results on synthetic and measured HRRP data show that the proposed models are efficient in computation, have good classification accuracy and ability, and provide highly interpretable multi-stochastic-layer latent structure.
arXiv Detail & Related papers (2020-09-28T02:03:51Z) - Unlabelled Data Improves Bayesian Uncertainty Calibration under
Covariate Shift [100.52588638477862]
We develop an approximate Bayesian inference scheme based on posterior regularisation.
We demonstrate the utility of our method in the context of transferring prognostic models of prostate cancer across globally diverse populations.
arXiv Detail & Related papers (2020-06-26T13:50:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.