Pareto Optimization to Accelerate Multi-Objective Virtual Screening
- URL: http://arxiv.org/abs/2310.10598v1
- Date: Mon, 16 Oct 2023 17:19:46 GMT
- Title: Pareto Optimization to Accelerate Multi-Objective Virtual Screening
- Authors: Jenna C. Fromer, David E. Graff, Connor W. Coley
- Abstract summary: We develop a tool to search a virtual library of over 4M molecules for those predicted to be selective dual inhibitors of EGFR and IGF1R.
This workflow and associated open source software can reduce the screening burden of molecular design projects.
- Score: 11.356174411578515
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The discovery of therapeutic molecules is fundamentally a multi-objective
optimization problem. One formulation of the problem is to identify molecules
that simultaneously exhibit strong binding affinity for a target protein,
minimal off-target interactions, and suitable pharmacokinetic properties.
Inspired by prior work that uses active learning to accelerate the
identification of strong binders, we implement multi-objective Bayesian
optimization to reduce the computational cost of multi-property virtual
screening and apply it to the identification of ligands predicted to be
selective based on docking scores to on- and off-targets. We demonstrate the
superiority of Pareto optimization over scalarization across three case
studies. Further, we use the developed optimization tool to search a virtual
library of over 4M molecules for those predicted to be selective dual
inhibitors of EGFR and IGF1R, acquiring 100% of the molecules that form the
library's Pareto front after exploring only 8% of the library. This workflow
and associated open source software can reduce the screening burden of
molecular design projects and is complementary to research aiming to improve
the accuracy of binding predictions and other molecular properties.
Related papers
- Text-Guided Multi-Property Molecular Optimization with a Diffusion Language Model [77.50732023411811]
We propose a text-guided multi-property molecular optimization method utilizing transformer-based diffusion language model (TransDLM)
TransDLM leverages standardized chemical nomenclature as semantic representations of molecules and implicitly embeds property requirements into textual descriptions.
Our approach surpasses state-of-the-art methods in optimizing molecular structural similarity and enhancing chemical properties on the benchmark dataset.
arXiv Detail & Related papers (2024-10-17T14:30:27Z) - XMOL: Explainable Multi-property Optimization of Molecules [2.320539066224081]
We propose Explainable Multi-property Optimization of Molecules (XMOL) to optimize multiple molecular properties simultaneously.
Our approach builds on state-of-the-art geometric diffusion models, extending them to multi-property optimization.
We integrate interpretive and explainable techniques throughout the optimization process.
arXiv Detail & Related papers (2024-09-12T06:35:04Z) - Active learning for affinity prediction of antibodies [45.58662352490961]
For large molecules such as antibodies, identifying mutations that enhance antibody affinity is challenging.
FERB methods can offer valuable insights into how different mutations will impact the potency and selectivity of a drug candidate.
We present an active learning framework that iteratively proposes sequences for simulators to evaluate, thereby accelerating the search for improved binders.
arXiv Detail & Related papers (2024-06-11T13:42:49Z) - Computer-Aided Multi-Objective Optimization in Small Molecule Discovery [3.032184156362992]
We describe pool-based and de novo generative approaches to multi-objective molecular discovery.
We show how pool-based molecular discovery is a relatively direct extension of multi-objective Bayesian optimization.
We discuss some remaining challenges and opportunities in the field.
arXiv Detail & Related papers (2022-10-13T17:33:07Z) - De novo design of protein target specific scaffold-based Inhibitors via
Reinforcement Learning [8.210294479991118]
Current approaches to develop molecules for a target protein are intuition-driven, hampered by slow iterative design-test cycles.
We propose a novel framework, called 3D-MolGNN$_RL$, coupling reinforcement learning to a deep generative model based on 3D-Scaffold.
Our approach can serve as an interpretable artificial intelligence (AI) tool for lead optimization with optimized activity, potency, and biophysical properties.
arXiv Detail & Related papers (2022-05-21T00:47:35Z) - Molecular Attributes Transfer from Non-Parallel Data [57.010952598634944]
We formulate molecular optimization as a style transfer problem and present a novel generative model that could automatically learn internal differences between two groups of non-parallel data.
Experiments on two molecular optimization tasks, toxicity modification and synthesizability improvement, demonstrate that our model significantly outperforms several state-of-the-art methods.
arXiv Detail & Related papers (2021-11-30T06:10:22Z) - Improved Drug-target Interaction Prediction with Intermolecular Graph
Transformer [98.8319016075089]
We propose a novel approach to model intermolecular information with a three-way Transformer-based architecture.
Intermolecular Graph Transformer (IGT) outperforms state-of-the-art approaches by 9.1% and 20.5% over the second best for binding activity and binding pose prediction respectively.
IGT exhibits promising drug screening ability against SARS-CoV-2 by identifying 83.1% active drugs that have been validated by wet-lab experiments with near-native predicted binding poses.
arXiv Detail & Related papers (2021-10-14T13:28:02Z) - Optimizing Molecules using Efficient Queries from Property Evaluations [66.66290256377376]
We propose QMO, a generic query-based molecule optimization framework.
QMO improves the desired properties of an input molecule based on efficient queries.
We show that QMO outperforms existing methods in the benchmark tasks of optimizing small organic molecules.
arXiv Detail & Related papers (2020-11-03T18:51:18Z) - MIMOSA: Multi-constraint Molecule Sampling for Molecule Optimization [51.00815310242277]
generative models and reinforcement learning approaches made initial success, but still face difficulties in simultaneously optimizing multiple drug properties.
We propose the MultI-constraint MOlecule SAmpling (MIMOSA) approach, a sampling framework to use input molecule as an initial guess and sample molecules from the target distribution.
arXiv Detail & Related papers (2020-10-05T20:18:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.