Simple End-to-end Deep Learning Model for CDR-H3 Loop Structure
Prediction
- URL: http://arxiv.org/abs/2111.10656v1
- Date: Sat, 20 Nov 2021 18:55:09 GMT
- Title: Simple End-to-end Deep Learning Model for CDR-H3 Loop Structure
Prediction
- Authors: Natalia Zenkova, Ekaterina Sedykh, Tatiana Shugaeva, Vladislav
Strashko, Timofei Ermak, Aleksei Shpilman
- Abstract summary: We present an end-to-end model to predict CDR H3 loop structure, that performs on par with state-of-the-art methods.
We also raise an issue with a commonly used RosettaAntibody benchmark that leads to data leaks.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Predicting a structure of an antibody from its sequence is important since it
allows for a better design process of synthetic antibodies that play a vital
role in the health industry. Most of the structure of an antibody is
conservative. The most variable and hard-to-predict part is the {\it third
complementarity-determining region of the antibody heavy chain} (CDR H3).
Lately, deep learning has been employed to solve the task of CDR H3 prediction.
However, current state-of-the-art methods are not end-to-end, but rather they
output inter-residue distances and orientations to the RosettaAntibody package
that uses this additional information alongside statistical and physics-based
methods to predict the 3D structure. This does not allow a fast screening
process and, therefore, inhibits the development of targeted synthetic
antibodies. In this work, we present an end-to-end model to predict CDR H3 loop
structure, that performs on par with state-of-the-art methods in terms of
accuracy but an order of magnitude faster. We also raise an issue with a
commonly used RosettaAntibody benchmark that leads to data leaks, i.e., the
presence of identical sequences in the train and test datasets.
Related papers
- Efficient Antibody Structure Refinement Using Energy-Guided SE(3) Flow Matching [16.192361788505558]
FlowAB is a novel antibody structure refinement method based on energy-guided flow matching.
It achieves new state-of-the-art performance on the antibody structure prediction task when used in conjunction with an appropriate prior model.
arXiv Detail & Related papers (2024-10-22T04:13:55Z) - AntibodyFlow: Normalizing Flow Model for Designing Antibody Complementarity-Determining Regions [9.427196604657215]
Therapeutic antibodies are specialized protective proteins that bind to antigens in a lock-to-key manner.
The binding strength/affinity between an antibody and a specific antigen is heavily determined by the complementarity-determining regions (CDRs) on the antibodies.
Existing machine learning methods cast in silico development of CDRs as either sequence or 3D graph (with a single chain) generation tasks.
arXiv Detail & Related papers (2024-06-19T02:31:23Z) - A Hierarchical Training Paradigm for Antibody Structure-sequence
Co-design [54.30457372514873]
We propose a hierarchical training paradigm (HTP) for the antibody sequence-structure co-design.
HTP consists of four levels of training stages, each corresponding to a specific protein modality.
Empirical experiments show that HTP sets the new state-of-the-art performance in the co-design problem.
arXiv Detail & Related papers (2023-10-30T02:39:15Z) - Cross-Gate MLP with Protein Complex Invariant Embedding is A One-Shot
Antibody Designer [58.97153056120193]
The specificity of an antibody is determined by its complementarity-determining regions (CDRs)
Previous studies have utilized complex techniques to generate CDRs, but they suffer from inadequate geometric modeling.
We propose a textitsimple yet effective model that can co-design 1D sequences and 3D structures of CDRs in a one-shot manner.
arXiv Detail & Related papers (2023-04-21T13:24:26Z) - End-to-End Full-Atom Antibody Design [28.199522831859998]
There are two major defects in current learning-based methods, making them suboptimal or resource-intensive.
We propose dynamic Multi-channel Equivariant grAph Network (dyMEAN), an end-to-end full-atom model for E(3) antibody design.
Both 1D sequences and 3D structures are updated via an adaptive multi-channel encoder that is able to process protein residues of variable sizes when considering full atoms.
Experiments on CDR-H3 design, complex structure prediction, and affinity optimization demonstrate the superiority of our end-to-end framework and full-atom modeling.
arXiv Detail & Related papers (2023-02-01T03:13:19Z) - Deciphering RNA Secondary Structure Prediction: A Probabilistic K-Rook Matching Perspective [63.3632827588974]
We introduce RFold, a method that learns to predict the most matching K-Rook solution from the given sequence.
RFold achieves competitive performance and about eight times faster inference efficiency than state-of-the-art approaches.
arXiv Detail & Related papers (2022-12-02T16:34:56Z) - xTrimoABFold: De novo Antibody Structure Prediction without MSA [77.47606749555686]
We develop a novel model named xTrimoABFold to predict antibody structure from antibody sequence.
The model was trained end-to-end on the antibody structures in PDB by minimizing the ensemble loss of domain-specific focal loss on CDR and the frame-aligned point loss.
arXiv Detail & Related papers (2022-11-30T09:26:08Z) - Incorporating Pre-training Paradigm for Antibody Sequence-Structure
Co-design [134.65287929316673]
Deep learning-based computational antibody design has attracted popular attention since it automatically mines the antibody patterns from data that could be complementary to human experiences.
The computational methods heavily rely on high-quality antibody structure data, which is quite limited.
Fortunately, there exists a large amount of sequence data of antibodies that can help model the CDR and alleviate the reliance on structure data.
arXiv Detail & Related papers (2022-10-26T15:31:36Z) - Reprogramming Pretrained Language Models for Antibody Sequence Infilling [72.13295049594585]
Computational design of antibodies involves generating novel and diverse sequences, while maintaining structural consistency.
Recent deep learning models have shown impressive results, however the limited number of known antibody sequence/structure pairs frequently leads to degraded performance.
In our work we address this challenge by leveraging Model Reprogramming (MR), which repurposes pretrained models on a source language to adapt to the tasks that are in a different language and have scarce data.
arXiv Detail & Related papers (2022-10-05T20:44:55Z) - Conditional Antibody Design as 3D Equivariant Graph Translation [28.199522831859998]
We propose Multi-channel Equivariant Attention Network (MEAN) to co-design 1D sequences and 3D structures of CDRs.
Our method significantly surpasses state-of-the-art models in sequence and structure modeling, antigen-binding CDR design, and binding affinity optimization.
arXiv Detail & Related papers (2022-08-12T01:00:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.