GRAM-DTI: adaptive multimodal representation learning for drug target interaction prediction
- URL: http://arxiv.org/abs/2509.21971v2
- Date: Tue, 04 Nov 2025 10:56:24 GMT
- Title: GRAM-DTI: adaptive multimodal representation learning for drug target interaction prediction
- Authors: Feng Jiang, Amina Mollaysa, Hehuan Ma, Tommaso Mansi, Junzhou Huang, Mangal Prakash, Rui Liao,
- Abstract summary: GRAMDTI is a pretraining framework that integrates multimodal molecular and protein inputs into unified representations.<n>GRAMDTI extends volume based contrastive learning to four modalities, capturing higher-order semantic alignment.<n>Our results highlight the benefits of higher order multimodal alignment, adaptive modality utilization, and auxiliary supervision for robust and generalizable DTI prediction.
- Score: 25.25496268607753
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Drug target interaction (DTI) prediction is a cornerstone of computational drug discovery, enabling rational design, repurposing, and mechanistic insights. While deep learning has advanced DTI modeling, existing approaches primarily rely on SMILES protein pairs and fail to exploit the rich multimodal information available for small molecules and proteins. We introduce GRAMDTI, a pretraining framework that integrates multimodal molecular and protein inputs into unified representations. GRAMDTI extends volume based contrastive learning to four modalities, capturing higher-order semantic alignment beyond conventional pairwise approaches. To handle modality informativeness, we propose adaptive modality dropout, dynamically regulating each modality's contribution during pre-training. Additionally, IC50 activity measurements, when available, are incorporated as weak supervision to ground representations in biologically meaningful interaction strengths. Experiments on four publicly available datasets demonstrate that GRAMDTI consistently outperforms state of the art baselines. Our results highlight the benefits of higher order multimodal alignment, adaptive modality utilization, and auxiliary supervision for robust and generalizable DTI prediction.
Related papers
- Tensor-DTI: Enhancing Biomolecular Interaction Prediction with Contrastive Embedding Learning [0.015229507502478598]
We propose a contrastive learning framework that integrates multimodal embeddings from molecular graphs, protein language models, and binding-site predictions to improve interaction modeling.<n>Our findings highlight the benefits of integrating multimodal information with contrastive objectives to enhance interaction-prediction accuracy and to provide more interpretable and reliability-aware models for virtual screening splits.
arXiv Detail & Related papers (2026-01-09T13:39:49Z) - Foundation Model for Skeleton-Based Human Action Understanding [56.89025287217221]
This paper presents a Unified Skeleton-based Dense Representation Learning framework.<n>USDRL consists of a Transformer-based Dense Spatio-Temporal (DSTE), Multi-Grained Feature Decorrelation (MG-FD), and Multi-Perspective Consistency Training (MPCT)
arXiv Detail & Related papers (2025-08-18T02:42:16Z) - SCOPE-DTI: Semi-Inductive Dataset Construction and Framework Optimization for Practical Usability Enhancement in Deep Learning-Based Drug Target Interaction Prediction [15.996998422919111]
SCOPE-DTI is a unified framework combining a large-scale, balanced semi-inductive human DTI dataset with advanced deep learning modeling.<n>The SCOPE dataset expands data volume by up to 100-fold compared to common benchmarks such as the Human dataset.<n>By offering comprehensive data, advanced modeling, and accessible tools, SCOPE-DTI accelerates drug discovery research.
arXiv Detail & Related papers (2025-03-12T10:46:25Z) - MIN: Multi-channel Interaction Network for Drug-Target Interaction with Protein Distillation [64.4838301776267]
Multi-channel Interaction Network (MIN) is a novel framework designed to predict drug-target interaction (DTI)<n>MIN incorporates a representation learning module and a multi-channel interaction module.<n>MIN is not only a potent tool for DTI prediction but also offers fresh insights into the prediction of protein binding sites.
arXiv Detail & Related papers (2024-11-23T05:38:36Z) - When Parameter-efficient Tuning Meets General-purpose Vision-language
Models [65.19127815275307]
PETAL revolutionizes the training process by requiring only 0.5% of the total parameters, achieved through a unique mode approximation technique.
Our experiments reveal that PETAL not only outperforms current state-of-the-art methods in most scenarios but also surpasses full fine-tuning models in effectiveness.
arXiv Detail & Related papers (2023-12-16T17:13:08Z) - PGraphDTA: Improving Drug Target Interaction Prediction using Protein
Language Models and Contact Maps [4.590060921188914]
Key aspect of drug discovery involves identifying novel drug-target (DT) interactions.
Protein-ligand interactions exhibit a continuum of binding strengths, known as binding affinity.
We propose novel enhancements to enhance their performance.
arXiv Detail & Related papers (2023-10-06T05:00:25Z) - Exploiting Modality-Specific Features For Multi-Modal Manipulation
Detection And Grounding [54.49214267905562]
We construct a transformer-based framework for multi-modal manipulation detection and grounding tasks.
Our framework simultaneously explores modality-specific features while preserving the capability for multi-modal alignment.
We propose an implicit manipulation query (IMQ) that adaptively aggregates global contextual cues within each modality.
arXiv Detail & Related papers (2023-09-22T06:55:41Z) - SSM-DTA: Breaking the Barriers of Data Scarcity in Drug-Target Affinity
Prediction [127.43571146741984]
Drug-Target Affinity (DTA) is of vital importance in early-stage drug discovery.
wet experiments remain the most reliable method, but they are time-consuming and resource-intensive.
Existing methods have primarily focused on developing techniques based on the available DTA data, without adequately addressing the data scarcity issue.
We present the SSM-DTA framework, which incorporates three simple yet highly effective strategies.
arXiv Detail & Related papers (2022-06-20T14:53:25Z) - Multiple Similarity Drug-Target Interaction Prediction with Random Walks
and Matrix Factorization [16.41618129467975]
We take a multi-layered network perspective, where different layers correspond to different similarity metrics between drugs and targets.
To fully take advantage of topology information captured in multiple views, we develop an optimization framework, called MDMF, for DTI prediction.
The framework learns vector representations of drugs and targets that not only retain higher-order proximity across all hyper-layers, but also approximates the interactions with their inner product.
arXiv Detail & Related papers (2022-01-24T08:02:05Z) - Drug-Target Interaction Prediction with Graph Attention networks [26.40249934284416]
We present an end-to-end framework, DTI-GAT (Drug-Target Interaction prediction with Graph Attention networks) for DTI predictions.
DTI-GAT incorporates a deep network neural architecture that operates on graph-structured data with the attention mechanism.
Experimental evaluations show that DTI-GAT outperforms various state-of-the-art systems on the binary DTI prediction problem.
arXiv Detail & Related papers (2021-07-10T07:06:36Z) - DeepPurpose: a Deep Learning Library for Drug-Target Interaction
Prediction [69.7424023336611]
DeepPurpose is a comprehensive and easy-to-use deep learning library for DTI prediction.
It supports training of customized DTI prediction models by implementing 15 compound and protein encoders and over 50 neural architectures.
We demonstrate state-of-the-art performance of DeepPurpose on several benchmark datasets.
arXiv Detail & Related papers (2020-04-19T17:31:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.