Generative Antibody Design for Complementary Chain Pairing Sequences
through Encoder-Decoder Language Model
- URL: http://arxiv.org/abs/2301.02748v4
- Date: Mon, 20 Nov 2023 19:45:49 GMT
- Title: Generative Antibody Design for Complementary Chain Pairing Sequences
through Encoder-Decoder Language Model
- Authors: Simon K.S. Chu, Kathy Y. Wei
- Abstract summary: We present paired T5 (pAbT5), an encoder-decoder model to generate complementary heavy or light chain from its pairing partner.
Our results showcase the potential of pAbT5 in generative antibody design, incorporating biological constraints from chain pairing preferences.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Current protein language models (pLMs) predominantly focus on single-chain
protein sequences and often have not accounted for constraints on generative
design imposed by protein-protein interactions. To address this gap, we present
paired Antibody T5 (pAbT5), an encoder-decoder model to generate complementary
heavy or light chain from its pairing partner. We show that our model respects
conservation in framework regions and variability in hypervariable domains,
demonstrated by agreement with sequence alignment and variable-length CDR
loops. We also show that our model captures chain pairing preferences through
the recovery of ground-truth chain type and gene families. Our results showcase
the potential of pAbT5 in generative antibody design, incorporating biological
constraints from chain pairing preferences.
Related papers
- IgCraft: A versatile sequence generation framework for antibody discovery and engineering [0.0]
IgCraft is a multi-purpose model for paired human antibody sequence generation, built on Bayesian Flow Networks.
By integrating previously separate tasks into a single scalable generative model, IgCraft provides a versatile platform for sampling human antibody sequences.
arXiv Detail & Related papers (2025-03-25T16:32:03Z) - Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction [88.65168366064061]
We introduce Discrete Denoising Posterior Prediction (DDPP), a novel framework that casts the task of steering pre-trained MDMs as a problem of probabilistic inference.
Our framework leads to a family of three novel objectives that are all simulation-free, and thus scalable.
We substantiate our designs via wet-lab validation, where we observe transient expression of reward-optimized protein sequences.
arXiv Detail & Related papers (2024-10-10T17:18:30Z) - De novo antibody design with SE(3) diffusion [0.3666125285899499]
We introduce IgDiff, an antibody variable domain diffusion model based on a general protein backbone diffusion framework.
We find that IgDiff produces highly designable antibodies that can contain novel binding regions.
We compare our model with a state-of-the-art generative backbone diffusion model on a range of antibody design tasks.
arXiv Detail & Related papers (2024-05-13T10:27:17Z) - Decoupled Sequence and Structure Generation for Realistic Antibody Design [45.72237864940556]
We propose an antibody sequence-structure decoupling (ASSD) framework, which separates sequence generation and structure prediction.
We also find that the widely used non-autoregressive generators promote sequences with overly repeating tokens.
Our results demonstrate that ASSD consistently outperforms existing antibody design models.
arXiv Detail & Related papers (2024-02-08T13:02:05Z) - A Hierarchical Training Paradigm for Antibody Structure-sequence
Co-design [54.30457372514873]
We propose a hierarchical training paradigm (HTP) for the antibody sequence-structure co-design.
HTP consists of four levels of training stages, each corresponding to a specific protein modality.
Empirical experiments show that HTP sets the new state-of-the-art performance in the co-design problem.
arXiv Detail & Related papers (2023-10-30T02:39:15Z) - AbODE: Ab Initio Antibody Design using Conjoined ODEs [8.523238510909955]
We develop a new generative model AbODE that extends graph PDEs to accommodate both contextual information and external interactions.
We unravel fundamental connections between AbODE and temporal networks as well as graph-matching networks.
arXiv Detail & Related papers (2023-05-31T14:40:47Z) - Cross-Gate MLP with Protein Complex Invariant Embedding is A One-Shot
Antibody Designer [58.97153056120193]
The specificity of an antibody is determined by its complementarity-determining regions (CDRs)
Previous studies have utilized complex techniques to generate CDRs, but they suffer from inadequate geometric modeling.
We propose a textitsimple yet effective model that can co-design 1D sequences and 3D structures of CDRs in a one-shot manner.
arXiv Detail & Related papers (2023-04-21T13:24:26Z) - Incorporating Pre-training Paradigm for Antibody Sequence-Structure
Co-design [134.65287929316673]
Deep learning-based computational antibody design has attracted popular attention since it automatically mines the antibody patterns from data that could be complementary to human experiences.
The computational methods heavily rely on high-quality antibody structure data, which is quite limited.
Fortunately, there exists a large amount of sequence data of antibodies that can help model the CDR and alleviate the reliance on structure data.
arXiv Detail & Related papers (2022-10-26T15:31:36Z) - Reprogramming Pretrained Language Models for Antibody Sequence Infilling [72.13295049594585]
Computational design of antibodies involves generating novel and diverse sequences, while maintaining structural consistency.
Recent deep learning models have shown impressive results, however the limited number of known antibody sequence/structure pairs frequently leads to degraded performance.
In our work we address this challenge by leveraging Model Reprogramming (MR), which repurposes pretrained models on a source language to adapt to the tasks that are in a different language and have scarce data.
arXiv Detail & Related papers (2022-10-05T20:44:55Z) - Iterative Refinement Graph Neural Network for Antibody
Sequence-Structure Co-design [35.215029426177004]
We propose a generative model to automatically design antibodies with enhanced binding specificity or neutralization capabilities.
Our method achieves superior log-likelihood on the test set and outperforms previous baselines in designing antibodies capable of neutralizing the SARS-CoV-2 virus.
arXiv Detail & Related papers (2021-10-09T18:23:32Z) - Conditional Hybrid GAN for Sequence Generation [56.67961004064029]
We propose a novel conditional hybrid GAN (C-Hybrid-GAN) to solve this issue.
We exploit the Gumbel-Softmax technique to approximate the distribution of discrete-valued sequences.
We demonstrate that the proposed C-Hybrid-GAN outperforms the existing methods in context-conditioned discrete-valued sequence generation.
arXiv Detail & Related papers (2020-09-18T03:52:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.