DapPep: Domain Adaptive Peptide-agnostic Learning for Universal T-cell Receptor-antigen Binding Affinity Prediction
- URL: http://arxiv.org/abs/2411.17798v1
- Date: Tue, 26 Nov 2024 18:06:42 GMT
- Title: DapPep: Domain Adaptive Peptide-agnostic Learning for Universal T-cell Receptor-antigen Binding Affinity Prediction
- Authors: Jiangbin Zheng, Qianhui Xu, Ruichen Xia, Stan Z. Li,
- Abstract summary: We introduce a domain-adaptive peptide-agnostic learning framework DapPep for universal TCR-antigen binding affinity prediction.
DapPep consistently outperforms existing tools, showcasing robust generalization capability.
It proves effective in challenging clinical tasks such as sorting reactive T cells in tumor neoantigen therapy and identifying key positions in 3D structures.
- Score: 38.358558338444624
- License:
- Abstract: Identifying T-cell receptors (TCRs) that interact with antigenic peptides provides the technical basis for developing vaccines and immunotherapies. The emergent deep learning methods excel at learning antigen binding patterns from known TCRs but struggle with novel or sparsely represented antigens. However, binding specificity for unseen antigens or exogenous peptides is critical. We introduce a domain-adaptive peptide-agnostic learning framework DapPep for universal TCR-antigen binding affinity prediction to address this challenge. The lightweight self-attention architecture combines a pre-trained protein language model with an inner-loop self-supervised regime to enable robust TCR-peptide representations. Extensive experiments on various benchmarks demonstrate that DapPep consistently outperforms existing tools, showcasing robust generalization capability, especially for data-scarce settings and unseen peptides. Moreover, DapPep proves effective in challenging clinical tasks such as sorting reactive T cells in tumor neoantigen therapy and identifying key positions in 3D structures.
Related papers
- Relation-Aware Equivariant Graph Networks for Epitope-Unknown Antibody Design and Specificity Optimization [61.06622479173572]
We propose a novel Relation-Aware Design (RAAD) framework, which models antigen-antibody interactions for co-designing sequences and structures of antigen-specific CDRs.
Furthermore, we propose a new evaluation metric to better measure antibody specificity and develop a contrasting specificity-enhancing constraint to optimize the specificity of antibodies.
arXiv Detail & Related papers (2024-12-14T03:00:44Z) - TCR-GPT: Integrating Autoregressive Model and Reinforcement Learning for T-Cell Receptor Repertoires Generation [6.920411338236452]
T-cell receptors (TCRs) play a crucial role in the immune system by recognizing and binding to specific antigens presented by infected or cancerous cells.
Language models, such as auto-regressive transformers, offer a powerful solution by learning the probability distributions of TCR repertoires.
We introduce TCR-GPT, a probabilistic model built on a decoder-only transformer architecture, designed to uncover and replicate sequence patterns in TCR repertoires.
arXiv Detail & Related papers (2024-08-02T10:16:28Z) - tcrLM: a lightweight protein language model for predicting T cell receptor and epitope binding specificity [4.120928123714289]
Anti-cancer immune response relies on bindings between T-cell receptors (TCRs) and antigens, which elicits adaptive immunity to eliminate tumor cells.
In this study, we introduce a lightweight masked language model, termed tcrLM, to address this challenge.
We construct the largest TCR CDR3 sequence set with more than 100 million distinct sequences, and pretrain tcrLM on these sequences.
The results demonstrate that tcrLM not only surpasses existing TCR-antigen binding prediction methods, but also outperforms other mainstream protein language models.
arXiv Detail & Related papers (2024-06-24T08:36:40Z) - A unified cross-attention model for predicting antigen binding specificity to both HLA and TCR molecules [4.501817929699959]
The immune checkpoint inhibitors have demonstrated promising clinical efficacy across various tumor types.
The bindings between tumor antigens and HLA-I/TCR molecules determine the antigen presentation and T-cell activation.
We propose UnifyImmun, a unified cross-attention transformer model designed to simultaneously predict the bindings of peptides to both receptors.
arXiv Detail & Related papers (2024-04-08T08:25:25Z) - Transfer Learning for T-Cell Response Prediction [0.1874930567916036]
We study the prediction of T-cell response for specific given peptides.
We show that the danger of inflated predictive performance is not merely theoretical but occurs in practice.
arXiv Detail & Related papers (2024-03-18T17:32:19Z) - A Hierarchical Training Paradigm for Antibody Structure-sequence
Co-design [54.30457372514873]
We propose a hierarchical training paradigm (HTP) for the antibody sequence-structure co-design.
HTP consists of four levels of training stages, each corresponding to a specific protein modality.
Empirical experiments show that HTP sets the new state-of-the-art performance in the co-design problem.
arXiv Detail & Related papers (2023-10-30T02:39:15Z) - Efficient Prediction of Peptide Self-assembly through Sequential and
Graphical Encoding [57.89530563948755]
This work provides a benchmark analysis of peptide encoding with advanced deep learning models.
It serves as a guide for a wide range of peptide-related predictions such as isoelectric points, hydration free energy, etc.
arXiv Detail & Related papers (2023-07-17T00:43:33Z) - T-Cell Receptor Optimization with Reinforcement Learning and Mutation
Policies for Precesion Immunotherapy [21.004878412411053]
T-cell receptors (TCRs) are protein complexes found on the surface of T cells and can bind to peptides.
This process is known as TCR recognition and constitutes a key step for immune response.
In this paper, we formulated the search for optimized TCRs as a reinforcement learning problem and presented a framework TCRPPO with a mutation policy.
arXiv Detail & Related papers (2023-03-02T20:25:14Z) - xTrimoABFold: De novo Antibody Structure Prediction without MSA [77.47606749555686]
We develop a novel model named xTrimoABFold to predict antibody structure from antibody sequence.
The model was trained end-to-end on the antibody structures in PDB by minimizing the ensemble loss of domain-specific focal loss on CDR and the frame-aligned point loss.
arXiv Detail & Related papers (2022-11-30T09:26:08Z) - Incorporating Pre-training Paradigm for Antibody Sequence-Structure
Co-design [134.65287929316673]
Deep learning-based computational antibody design has attracted popular attention since it automatically mines the antibody patterns from data that could be complementary to human experiences.
The computational methods heavily rely on high-quality antibody structure data, which is quite limited.
Fortunately, there exists a large amount of sequence data of antibodies that can help model the CDR and alleviate the reliance on structure data.
arXiv Detail & Related papers (2022-10-26T15:31:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.