Inverse folding for antibody sequence design using deep learning
- URL: http://arxiv.org/abs/2310.19513v1
- Date: Mon, 30 Oct 2023 13:12:41 GMT
- Title: Inverse folding for antibody sequence design using deep learning
- Authors: Fr\'ed\'eric A. Dreyer, Daniel Cutting, Constantin Schneider, Henry
Kenlay, Charlotte M. Deane
- Abstract summary: We propose a fine-tuned folding inverse model that is specifically optimised for antibody structures.
We study the canonical conformations of complementarity-determining regions and find improved encoding of these loops into known clusters.
- Score: 2.8998926117101367
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We consider the problem of antibody sequence design given 3D structural
information. Building on previous work, we propose a fine-tuned inverse folding
model that is specifically optimised for antibody structures and outperforms
generic protein models on sequence recovery and structure robustness when
applied on antibodies, with notable improvement on the hypervariable CDR-H3
loop. We study the canonical conformations of complementarity-determining
regions and find improved encoding of these loops into known clusters. Finally,
we consider the applications of our model to drug discovery and binder design
and evaluate the quality of proposed sequences using physics-based methods.
Related papers
- Retrieval Augmented Diffusion Model for Structure-informed Antibody Design and Optimization [8.546688995090491]
Antibodies are essential proteins responsible for immune responses in organisms.
Recent advances in generative models have significantly enhanced rational antibody design.
We propose a retrieval-augmented diffusion framework, termed RADAb, for efficient antibody design.
arXiv Detail & Related papers (2024-10-19T08:53:01Z) - Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization [51.28231365213679]
We tackle antigen-specific antibody sequence-structure co-design as an optimization problem towards specific preferences.
We propose direct energy-based preference optimization to guide the generation of antibodies with both rational structures and considerable binding affinities to given antigens.
arXiv Detail & Related papers (2024-03-25T09:41:49Z) - Decoupled Sequence and Structure Generation for Realistic Antibody Design [45.72237864940556]
We propose an antibody sequence-structure decoupling (ASSD) framework, which separates sequence generation and structure prediction.
We also find that the widely used non-autoregressive generators promote sequences with overly repeating tokens.
Our results demonstrate that ASSD consistently outperforms existing antibody design models.
arXiv Detail & Related papers (2024-02-08T13:02:05Z) - A Hierarchical Training Paradigm for Antibody Structure-sequence
Co-design [54.30457372514873]
We propose a hierarchical training paradigm (HTP) for the antibody sequence-structure co-design.
HTP consists of four levels of training stages, each corresponding to a specific protein modality.
Empirical experiments show that HTP sets the new state-of-the-art performance in the co-design problem.
arXiv Detail & Related papers (2023-10-30T02:39:15Z) - Cross-Gate MLP with Protein Complex Invariant Embedding is A One-Shot
Antibody Designer [58.97153056120193]
The specificity of an antibody is determined by its complementarity-determining regions (CDRs)
Previous studies have utilized complex techniques to generate CDRs, but they suffer from inadequate geometric modeling.
We propose a textitsimple yet effective model that can co-design 1D sequences and 3D structures of CDRs in a one-shot manner.
arXiv Detail & Related papers (2023-04-21T13:24:26Z) - xTrimoABFold: De novo Antibody Structure Prediction without MSA [77.47606749555686]
We develop a novel model named xTrimoABFold to predict antibody structure from antibody sequence.
The model was trained end-to-end on the antibody structures in PDB by minimizing the ensemble loss of domain-specific focal loss on CDR and the frame-aligned point loss.
arXiv Detail & Related papers (2022-11-30T09:26:08Z) - Incorporating Pre-training Paradigm for Antibody Sequence-Structure
Co-design [134.65287929316673]
Deep learning-based computational antibody design has attracted popular attention since it automatically mines the antibody patterns from data that could be complementary to human experiences.
The computational methods heavily rely on high-quality antibody structure data, which is quite limited.
Fortunately, there exists a large amount of sequence data of antibodies that can help model the CDR and alleviate the reliance on structure data.
arXiv Detail & Related papers (2022-10-26T15:31:36Z) - Structured Q-learning For Antibody Design [82.78798397798533]
One of the crucial steps involved in antibody design is to find an arrangement of amino acids in a protein sequence that improves its binding with a pathogen.
Combinatorial optimization of antibodies is difficult due to extremely large search spaces and non-linear objectives.
Applying traditional Reinforcement Learning to antibody design optimization results in poor performance.
We propose Q-learning, an extension of Q-learning that incorporates structural priors for optimization.
arXiv Detail & Related papers (2022-09-10T15:36:55Z) - Conditional Antibody Design as 3D Equivariant Graph Translation [28.199522831859998]
We propose Multi-channel Equivariant Attention Network (MEAN) to co-design 1D sequences and 3D structures of CDRs.
Our method significantly surpasses state-of-the-art models in sequence and structure modeling, antigen-binding CDR design, and binding affinity optimization.
arXiv Detail & Related papers (2022-08-12T01:00:59Z) - Benchmarking deep generative models for diverse antibody sequence design [18.515971640245997]
Deep generative models that learn from sequences alone or from sequences and structures jointly have shown impressive performance on this task.
We consider three recently proposed deep generative frameworks for protein design: (AR) the sequence-based autoregressive generative model, (GVP) the precise structure-based graph neural network, and Fold2Seq that leverages a fuzzy and scale-free representation of a three-dimensional fold.
We benchmark these models on the task of computational design of antibody sequences, which demand designing sequences with high diversity for functional implication.
arXiv Detail & Related papers (2021-11-12T16:23:32Z) - Iterative Refinement Graph Neural Network for Antibody
Sequence-Structure Co-design [35.215029426177004]
We propose a generative model to automatically design antibodies with enhanced binding specificity or neutralization capabilities.
Our method achieves superior log-likelihood on the test set and outperforms previous baselines in designing antibodies capable of neutralizing the SARS-CoV-2 virus.
arXiv Detail & Related papers (2021-10-09T18:23:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.