A Hierarchical Training Paradigm for Antibody Structure-sequence
Co-design
- URL: http://arxiv.org/abs/2311.16126v1
- Date: Mon, 30 Oct 2023 02:39:15 GMT
- Title: A Hierarchical Training Paradigm for Antibody Structure-sequence
Co-design
- Authors: Fang Wu, Stan Z. Li
- Abstract summary: We propose a hierarchical training paradigm (HTP) for the antibody sequence-structure co-design.
HTP consists of four levels of training stages, each corresponding to a specific protein modality.
Empirical experiments show that HTP sets the new state-of-the-art performance in the co-design problem.
- Score: 54.30457372514873
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Therapeutic antibodies are an essential and rapidly expanding drug modality.
The binding specificity between antibodies and antigens is decided by
complementarity-determining regions (CDRs) at the tips of these Y-shaped
proteins. In this paper, we propose a hierarchical training paradigm (HTP) for
the antibody sequence-structure co-design. HTP consists of four levels of
training stages, each corresponding to a specific protein modality within a
particular protein domain. Through carefully crafted tasks in different stages,
HTP seamlessly and effectively integrates geometric graph neural networks
(GNNs) with large-scale protein language models to excavate evolutionary
information from not only geometric structures but also vast antibody and
non-antibody sequence databases, which determines ligand binding pose and
strength. Empirical experiments show that HTP sets the new state-of-the-art
performance in the co-design problem as well as the fix-backbone design. Our
research offers a hopeful path to unleash the potential of deep generative
architectures and seeks to illuminate the way forward for the antibody sequence
and structure co-design challenge.
Related papers
- SFM-Protein: Integrative Co-evolutionary Pre-training for Advanced Protein Sequence Representation [97.99658944212675]
We introduce a novel pre-training strategy for protein foundation models.
It emphasizes the interactions among amino acid residues to enhance the extraction of both short-range and long-range co-evolutionary features.
Trained on a large-scale protein sequence dataset, our model demonstrates superior generalization ability.
arXiv Detail & Related papers (2024-10-31T15:22:03Z) - AbODE: Ab Initio Antibody Design using Conjoined ODEs [8.523238510909955]
We develop a new generative model AbODE that extends graph PDEs to accommodate both contextual information and external interactions.
We unravel fundamental connections between AbODE and temporal networks as well as graph-matching networks.
arXiv Detail & Related papers (2023-05-31T14:40:47Z) - Cross-Gate MLP with Protein Complex Invariant Embedding is A One-Shot
Antibody Designer [58.97153056120193]
The specificity of an antibody is determined by its complementarity-determining regions (CDRs)
Previous studies have utilized complex techniques to generate CDRs, but they suffer from inadequate geometric modeling.
We propose a textitsimple yet effective model that can co-design 1D sequences and 3D structures of CDRs in a one-shot manner.
arXiv Detail & Related papers (2023-04-21T13:24:26Z) - End-to-End Full-Atom Antibody Design [28.199522831859998]
There are two major defects in current learning-based methods, making them suboptimal or resource-intensive.
We propose dynamic Multi-channel Equivariant grAph Network (dyMEAN), an end-to-end full-atom model for E(3) antibody design.
Both 1D sequences and 3D structures are updated via an adaptive multi-channel encoder that is able to process protein residues of variable sizes when considering full atoms.
Experiments on CDR-H3 design, complex structure prediction, and affinity optimization demonstrate the superiority of our end-to-end framework and full-atom modeling.
arXiv Detail & Related papers (2023-02-01T03:13:19Z) - xTrimoABFold: De novo Antibody Structure Prediction without MSA [77.47606749555686]
We develop a novel model named xTrimoABFold to predict antibody structure from antibody sequence.
The model was trained end-to-end on the antibody structures in PDB by minimizing the ensemble loss of domain-specific focal loss on CDR and the frame-aligned point loss.
arXiv Detail & Related papers (2022-11-30T09:26:08Z) - Incorporating Pre-training Paradigm for Antibody Sequence-Structure
Co-design [134.65287929316673]
Deep learning-based computational antibody design has attracted popular attention since it automatically mines the antibody patterns from data that could be complementary to human experiences.
The computational methods heavily rely on high-quality antibody structure data, which is quite limited.
Fortunately, there exists a large amount of sequence data of antibodies that can help model the CDR and alleviate the reliance on structure data.
arXiv Detail & Related papers (2022-10-26T15:31:36Z) - Structured Q-learning For Antibody Design [82.78798397798533]
One of the crucial steps involved in antibody design is to find an arrangement of amino acids in a protein sequence that improves its binding with a pathogen.
Combinatorial optimization of antibodies is difficult due to extremely large search spaces and non-linear objectives.
Applying traditional Reinforcement Learning to antibody design optimization results in poor performance.
We propose Q-learning, an extension of Q-learning that incorporates structural priors for optimization.
arXiv Detail & Related papers (2022-09-10T15:36:55Z) - Iterative Refinement Graph Neural Network for Antibody
Sequence-Structure Co-design [35.215029426177004]
We propose a generative model to automatically design antibodies with enhanced binding specificity or neutralization capabilities.
Our method achieves superior log-likelihood on the test set and outperforms previous baselines in designing antibodies capable of neutralizing the SARS-CoV-2 virus.
arXiv Detail & Related papers (2021-10-09T18:23:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.