Guided Sequence-Structure Generative Modeling for Iterative Antibody Optimization
- URL: http://arxiv.org/abs/2509.16357v1
- Date: Fri, 19 Sep 2025 19:03:37 GMT
- Title: Guided Sequence-Structure Generative Modeling for Iterative Antibody Optimization
- Authors: Aniruddh Raghu, Sebastian Ober, Maxwell Kazman, Hunter Elliott,
- Abstract summary: Therapeutic antibody candidates often require extensive engineering to improve key functional and developability properties before clinical development.<n>This can be achieved through iterative design, where starting molecules are optimized over several rounds of in vitro experiments.<n>We propose a strategy for iterative antibody optimization that leverages both sequence and structure as well as accumulating lab measurements of binding and developability.
- Score: 1.979371144736248
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Therapeutic antibody candidates often require extensive engineering to improve key functional and developability properties before clinical development. This can be achieved through iterative design, where starting molecules are optimized over several rounds of in vitro experiments. While protein structure can provide a strong inductive bias, it is rarely used in iterative design due to the lack of structural data for continually evolving lead molecules over the course of optimization. In this work, we propose a strategy for iterative antibody optimization that leverages both sequence and structure as well as accumulating lab measurements of binding and developability. Building on prior work, we first train a sequence-structure diffusion generative model that operates on antibody-antigen complexes. We then outline an approach to use this model, together with carefully predicted antibody-antigen complexes, to optimize lead candidates throughout the iterative design process. Further, we describe a guided sampling approach that biases generation toward desirable properties by integrating models trained on experimental data from iterative design. We evaluate our approach in multiple in silico and in vitro experiments, demonstrating that it produces high-affinity binders at multiple stages of an active antibody optimization campaign.
Related papers
- Efficient Protein Optimization via Structure-aware Hamiltonian Dynamics [16.336540408998598]
HADES is a Bayesian optimization method utilizing Hamiltonian dynamics to efficiently sample from a structure-aware approximated posterior.<n>A position discretization procedure is introduced to propose discrete protein sequences from such a continuous state system.<n>Experiments demonstrate that our method outperforms state-of-the-art baselines in in-silico evaluations.
arXiv Detail & Related papers (2026-01-16T05:53:53Z) - Protein Inverse Folding From Structure Feedback [78.27854221882572]
We introduce a novel approach to fine-tune an inverse folding model using feedback from a protein folding model.<n>Our results on the CATH 4.2 test set demonstrate that DPO fine-tuning leads to a significant improvement in average TM-Score.
arXiv Detail & Related papers (2025-06-03T16:02:12Z) - AffinityFlow: Guided Flows for Antibody Affinity Maturation [6.690846683150576]
Antibodies are widely used as therapeutics, but their development requires affinity maturation to enhance binding affinity.<n>Recently AlphaFlow wraps AlphaFold within flow matching to generate diverse protein structures, enabling a sequence-conditioned generative model of structure.<n>We propose an alternating optimization framework that (1) fixes the sequence to guide structure generation toward high binding affinity using a structure-based affinity predictor, then (2) applies inverse folding to create sequence mutations, refined by a sequence-based affinity predictor for post selection.
arXiv Detail & Related papers (2025-02-14T18:43:22Z) - AlignAb: Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies [4.679586996508103]
We present a three-stage framework for training deep learning models specializing in antibody sequence-structure co-design.<n>We employ the learned representations to guide the training of a diffusion model for joint optimization over both sequence and structure of antibodies.
arXiv Detail & Related papers (2024-12-30T14:50:32Z) - Retrieval Augmented Diffusion Model for Structure-informed Antibody Design and Optimization [8.546688995090491]
Antibodies are essential proteins responsible for immune responses in organisms.
Recent advances in generative models have significantly enhanced rational antibody design.
We propose a retrieval-augmented diffusion framework, termed RADAb, for efficient antibody design.
arXiv Detail & Related papers (2024-10-19T08:53:01Z) - DecompOpt: Controllable and Decomposed Diffusion Models for Structure-based Molecular Optimization [49.85944390503957]
DecompOpt is a structure-based molecular optimization method based on a controllable and diffusion model.
We show that DecompOpt can efficiently generate molecules with improved properties than strong de novo baselines.
arXiv Detail & Related papers (2024-03-07T02:53:40Z) - Protein Design with Guided Discrete Diffusion [67.06148688398677]
A popular approach to protein design is to combine a generative model with a discriminative model for conditional sampling.
We propose diffusioN Optimized Sampling (NOS), a guidance method for discrete diffusion models.
NOS makes it possible to perform design directly in sequence space, circumventing significant limitations of structure-based methods.
arXiv Detail & Related papers (2023-05-31T16:31:24Z) - Cross-Gate MLP with Protein Complex Invariant Embedding is A One-Shot
Antibody Designer [58.97153056120193]
The specificity of an antibody is determined by its complementarity-determining regions (CDRs)
Previous studies have utilized complex techniques to generate CDRs, but they suffer from inadequate geometric modeling.
We propose a textitsimple yet effective model that can co-design 1D sequences and 3D structures of CDRs in a one-shot manner.
arXiv Detail & Related papers (2023-04-21T13:24:26Z) - A Pareto-optimal compositional energy-based model for sampling and
optimization of protein sequences [55.25331349436895]
Deep generative models have emerged as a popular machine learning-based approach for inverse problems in the life sciences.
These problems often require sampling new designs that satisfy multiple properties of interest in addition to learning the data distribution.
arXiv Detail & Related papers (2022-10-19T19:04:45Z) - ODBO: Bayesian Optimization with Search Space Prescreening for Directed Protein Evolution [18.726398852721204]
We propose an efficient, experimental design-oriented closed-loop optimization framework for protein directed evolution.
ODBO employs a combination of novel low-dimensional protein encoding strategy and Bayesian optimization enhanced with search space prescreening via outlier detection.
We conduct and report four protein directed evolution experiments that substantiate the capability of the proposed framework for finding variants with properties of interest.
arXiv Detail & Related papers (2022-05-19T13:21:31Z) - EBM-Fold: Fully-Differentiable Protein Folding Powered by Energy-based
Models [53.17320541056843]
We propose a fully-differentiable approach for protein structure optimization, guided by a data-driven generative network.
Our EBM-Fold approach can efficiently produce high-quality decoys, compared against traditional Rosetta-based structure optimization routines.
arXiv Detail & Related papers (2021-05-11T03:40:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.