MATE-Pred: Multimodal Attention-based TCR-Epitope interaction Predictor
- URL: http://arxiv.org/abs/2401.08619v1
- Date: Tue, 5 Dec 2023 11:30:00 GMT
- Title: MATE-Pred: Multimodal Attention-based TCR-Epitope interaction Predictor
- Authors: Etienne Goffinet, Raghvendra Mall, Ankita Singh, Rahul Kaushik and
Filippo Castiglione
- Abstract summary: An accurate binding prediction between T-cell receptors ands contributes decisively to successful immunotherapy strategies.
Here, we propose a highly reliable novel method, MATE-Pred, that performs attention-based prediction of T-cell receptors and affinitys binding regimes.
The performance of MATE-Pred projects its potential application in drug discovery.
- Score: 1.933856957193398
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: An accurate binding affinity prediction between T-cell receptors and epitopes
contributes decisively to develop successful immunotherapy strategies. Some
state-of-the-art computational methods implement deep learning techniques by
integrating evolutionary features to convert the amino acid residues of cell
receptors and epitope sequences into numerical values, while some other methods
employ pre-trained language models to summarize the embedding vectors at the
amino acid residue level to obtain sequence-wise representations.
Here, we propose a highly reliable novel method, MATE-Pred, that performs
multi-modal attention-based prediction of T-cell receptors and epitopes binding
affinity. The MATE-Pred is compared and benchmarked with other deep learning
models that leverage multi-modal representations of T-cell receptors and
epitopes. In the proposed method, the textual representation of proteins is
embedded with a pre-trained bi-directional encoder model and combined with two
additional modalities: a) a comprehensive set of selected physicochemical
properties; b) predicted contact maps that estimate the 3D distances between
amino acid residues in the sequences.
The MATE-Pred demonstrates the potential of multi-modal model in achieving
state-of-the-art performance (+8.4\% MCC, +5.5\% AUC compared to baselines) and
efficiently capturing contextual, physicochemical, and structural information
from amino acid residues. The performance of MATE-Pred projects its potential
application in various drug discovery regimes.
Related papers
- SFM-Protein: Integrative Co-evolutionary Pre-training for Advanced Protein Sequence Representation [97.99658944212675]
We introduce a novel pre-training strategy for protein foundation models.
It emphasizes the interactions among amino acid residues to enhance the extraction of both short-range and long-range co-evolutionary features.
Trained on a large-scale protein sequence dataset, our model demonstrates superior generalization ability.
arXiv Detail & Related papers (2024-10-31T15:22:03Z) - DPLM-2: A Multimodal Diffusion Protein Language Model [75.98083311705182]
We introduce DPLM-2, a multimodal protein foundation model that extends discrete diffusion protein language model (DPLM) to accommodate both sequences and structures.
DPLM-2 learns the joint distribution of sequence and structure, as well as their marginals and conditionals.
Empirical evaluation shows that DPLM-2 can simultaneously generate highly compatible amino acid sequences and their corresponding 3D structures.
arXiv Detail & Related papers (2024-10-17T17:20:24Z) - Autoregressive Enzyme Function Prediction with Multi-scale Multi-modality Fusion [11.278610817877578]
We introduce MAPred, a novel multi-modality and multi-scale model designed to autoregressively predict the EC number of proteins.
MAPred integrates both the primary amino acid sequence and the 3D tokens of proteins, employing a dual-pathway approach to capture comprehensive protein characteristics.
Evaluations on benchmark datasets, including New-392, Price, and New-815, demonstrate that our method outperforms existing models.
arXiv Detail & Related papers (2024-08-11T08:28:43Z) - Multi-Peptide: Multimodality Leveraged Language-Graph Learning of Peptide Properties [5.812284760539713]
Multi-Peptide is an innovative approach that combines transformer-based language models with Graph Neural Networks (GNNs) to predict peptide properties.
Evaluations on hemolysis and nonfouling datasets demonstrate Multi-Peptide's robustness, achieving state-of-the-art 86.185% accuracy in hemolysis prediction.
This study highlights the potential of multimodal learning in bioinformatics, paving the way for accurate and reliable predictions in peptide-based research and applications.
arXiv Detail & Related papers (2024-07-02T20:13:47Z) - Improving Paratope and Epitope Prediction by Multi-Modal Contrastive Learning and Interaction Informativeness Estimation [8.403085770216313]
We propose a novel Multi-modal contrastive learning and Interaction informativeness estimation-based method for Paratope and Epitope prediction, named MIPE.
MIPE implements a multi-modal contrastive learning strategy, which maximizes representations of binding and non-binding residues within each modality.
To exploit the spatial interaction information, MIPE also incorporates an interaction informativeness estimation that computes the estimated interaction matrices between antibodies and antigens.
arXiv Detail & Related papers (2024-05-31T08:09:36Z) - PSC-CPI: Multi-Scale Protein Sequence-Structure Contrasting for
Efficient and Generalizable Compound-Protein Interaction Prediction [63.50967073653953]
Compound-Protein Interaction prediction aims to predict the pattern and strength of compound-protein interactions for rational drug discovery.
Existing deep learning-based methods utilize only the single modality of protein sequences or structures.
We propose a novel multi-scale Protein Sequence-structure Contrasting framework for CPI prediction.
arXiv Detail & Related papers (2024-02-13T03:51:10Z) - Co-modeling the Sequential and Graphical Routes for Peptide
Representation Learning [67.66393016797181]
We propose a peptide co-modeling method, RepCon, to enhance the mutual information of representations from decoupled sequential and graphical end-to-end models.
RepCon learns to enhance the consistency of representations between positive sample pairs and to repel representations between negative pairs.
Our results demonstrate the superiority of the co-modeling approach over independent modeling, as well as the superiority of RepCon over other methods under the co-modeling framework.
arXiv Detail & Related papers (2023-10-04T16:58:25Z) - Efficient Prediction of Peptide Self-assembly through Sequential and
Graphical Encoding [57.89530563948755]
This work provides a benchmark analysis of peptide encoding with advanced deep learning models.
It serves as a guide for a wide range of peptide-related predictions such as isoelectric points, hydration free energy, etc.
arXiv Detail & Related papers (2023-07-17T00:43:33Z) - From Static to Dynamic Structures: Improving Binding Affinity Prediction with Graph-Based Deep Learning [40.83037811977803]
Dynaformer is a graph-based deep learning model developed to predict protein-ligand binding affinities.
It exhibits state-of-the-art scoring and ranking power on the CASF-2016 benchmark dataset.
In a virtual screening on heat shock protein 90 (HSP90), 20 candidates are identified and their binding affinities are experimentally validated.
arXiv Detail & Related papers (2022-08-19T14:55:12Z) - Ranking-based Convolutional Neural Network Models for Peptide-MHC
Binding Prediction [15.932922003001034]
identifying peptides that can bind to MHC class-I molecules plays a vital role in the design of peptide vaccines.
We develop two allele-specific CNN-based methods named ConvM and SpConvM to tackle the binding prediction problem.
arXiv Detail & Related papers (2020-12-04T20:40:36Z) - Confidence-guided Lesion Mask-based Simultaneous Synthesis of Anatomic
and Molecular MR Images in Patients with Post-treatment Malignant Gliomas [65.64363834322333]
Confidence Guided SAMR (CG-SAMR) synthesizes data from lesion information to multi-modal anatomic sequences.
module guides the synthesis based on confidence measure about the intermediate results.
experiments on real clinical data demonstrate that the proposed model can perform better than the state-of-theart synthesis methods.
arXiv Detail & Related papers (2020-08-06T20:20:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.