Active learning for energy-based antibody optimization and enhanced screening
- URL: http://arxiv.org/abs/2409.10964v2
- Date: Wed, 18 Sep 2024 07:37:31 GMT
- Title: Active learning for energy-based antibody optimization and enhanced screening
- Authors: Kairi Furui, Masahito Ohue,
- Abstract summary: We propose an active learning workflow that efficiently trains a deep learning model to learn energy functions for specific targets.
In a case study targeting HER2-binding Trastuzumab mutants, our approach significantly improved the screening performance over random selection.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Accurate prediction and optimization of protein-protein binding affinity is crucial for therapeutic antibody development. Although machine learning-based prediction methods $\Delta\Delta G$ are suitable for large-scale mutant screening, they struggle to predict the effects of multiple mutations for targets without existing binders. Energy function-based methods, though more accurate, are time consuming and not ideal for large-scale screening. To address this, we propose an active learning workflow that efficiently trains a deep learning model to learn energy functions for specific targets, combining the advantages of both approaches. Our method integrates the RDE-Network deep learning model with Rosetta's energy function-based Flex ddG to efficiently explore mutants. In a case study targeting HER2-binding Trastuzumab mutants, our approach significantly improved the screening performance over random selection and demonstrated the ability to identify mutants with better binding properties without experimental $\Delta\Delta G$ data. This workflow advances computational antibody design by combining machine learning, physics-based computations, and active learning to achieve more efficient antibody development.
Related papers
- Active learning for affinity prediction of antibodies [45.58662352490961]
For large molecules such as antibodies, identifying mutations that enhance antibody affinity is challenging.
FERB methods can offer valuable insights into how different mutations will impact the potency and selectivity of a drug candidate.
We present an active learning framework that iteratively proposes sequences for simulators to evaluate, thereby accelerating the search for improved binders.
arXiv Detail & Related papers (2024-06-11T13:42:49Z) - Gradual Optimization Learning for Conformational Energy Minimization [69.36925478047682]
Gradual Optimization Learning Framework (GOLF) for energy minimization with neural networks significantly reduces the required additional data.
Our results demonstrate that the neural network trained with GOLF performs on par with the oracle on a benchmark of diverse drug-like molecules.
arXiv Detail & Related papers (2023-11-05T11:48:08Z) - Pareto Optimization to Accelerate Multi-Objective Virtual Screening [11.356174411578515]
We develop a tool to search a virtual library of over 4M molecules for those predicted to be selective dual inhibitors of EGFR and IGF1R.
This workflow and associated open source software can reduce the screening burden of molecular design projects.
arXiv Detail & Related papers (2023-10-16T17:19:46Z) - PIGNet2: A Versatile Deep Learning-based Protein-Ligand Interaction
Prediction Model for Binding Affinity Scoring and Virtual Screening [0.0]
Prediction of protein-ligand interactions (PLI) plays a crucial role in drug discovery.
The development of a versatile model capable of accurately scoring binding affinity and conducting efficient virtual screening remains a challenge.
Here, we propose a viable solution by introducing a novel data augmentation strategy combined with a physics-informed graph neural network.
arXiv Detail & Related papers (2023-07-03T14:46:49Z) - Robust Learning with Progressive Data Expansion Against Spurious
Correlation [65.83104529677234]
We study the learning process of a two-layer nonlinear convolutional neural network in the presence of spurious features.
Our analysis suggests that imbalanced data groups and easily learnable spurious features can lead to the dominance of spurious features during the learning process.
We propose a new training algorithm called PDE that efficiently enhances the model's robustness for a better worst-group performance.
arXiv Detail & Related papers (2023-06-08T05:44:06Z) - Consistent Training via Energy-Based GFlowNets for Modeling Discrete
Joint Distributions [79.47120571256026]
Generative Flow Networks (GFlowNets) have demonstrated significant performance improvements for generating diverse objects $x$ given a reward function $R(x)$.
We build upon recent work on jointly learning energy-based models with GFlowNets and extend it to learn the joint over multiple variables.
We find that this joint training or joint energy-based formulation leads to significant improvements in generating anti-microbial peptides.
arXiv Detail & Related papers (2022-11-01T16:25:14Z) - Towards Learning Universal Hyperparameter Optimizers with Transformers [57.35920571605559]
We introduce the OptFormer, the first text-based Transformer HPO framework that provides a universal end-to-end interface for jointly learning policy and function prediction.
Our experiments demonstrate that the OptFormer can imitate at least 7 different HPO algorithms, which can be further improved via its function uncertainty estimates.
arXiv Detail & Related papers (2022-05-26T12:51:32Z) - De novo design of protein target specific scaffold-based Inhibitors via
Reinforcement Learning [8.210294479991118]
Current approaches to develop molecules for a target protein are intuition-driven, hampered by slow iterative design-test cycles.
We propose a novel framework, called 3D-MolGNN$_RL$, coupling reinforcement learning to a deep generative model based on 3D-Scaffold.
Our approach can serve as an interpretable artificial intelligence (AI) tool for lead optimization with optimized activity, potency, and biophysical properties.
arXiv Detail & Related papers (2022-05-21T00:47:35Z) - Using Genetic Programming to Predict and Optimize Protein Function [65.25258357832584]
We propose POET, a computational Genetic Programming tool based on evolutionary methods to enhance screening and mutagenesis in Directed Evolution.
As a proof-of-concept we use peptides that generate MRI contrast detected by the Chemical Exchange Saturation Transfer mechanism.
Our results indicate that a computational modelling tool like POET can help to find peptides with 400% better functionality than used before.
arXiv Detail & Related papers (2022-02-08T18:08:08Z) - Maximum Mutation Reinforcement Learning for Scalable Control [25.935468948833073]
Reinforcement Learning (RL) has demonstrated data efficiency and optimal control over large state spaces at the cost of scalable performance.
We present the Evolution-based Soft Actor-Critic (ESAC), a scalable RL algorithm.
arXiv Detail & Related papers (2020-07-24T16:29:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.