Related papers: Pareto Optimization to Accelerate Multi-Objective Virtual Screening

Pareto Optimization to Accelerate Multi-Objective Virtual Screening

URL: http://arxiv.org/abs/2310.10598v1
Date: Mon, 16 Oct 2023 17:19:46 GMT
Title: Pareto Optimization to Accelerate Multi-Objective Virtual Screening
Authors: Jenna C. Fromer, David E. Graff, Connor W. Coley
Abstract summary: We develop a tool to search a virtual library of over 4M molecules for those predicted to be selective dual inhibitors of EGFR and IGF1R. This workflow and associated open source software can reduce the screening burden of molecular design projects.
Score: 11.356174411578515
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The discovery of therapeutic molecules is fundamentally a multi-objective optimization problem. One formulation of the problem is to identify molecules that simultaneously exhibit strong binding affinity for a target protein, minimal off-target interactions, and suitable pharmacokinetic properties. Inspired by prior work that uses active learning to accelerate the identification of strong binders, we implement multi-objective Bayesian optimization to reduce the computational cost of multi-property virtual screening and apply it to the identification of ligands predicted to be selective based on docking scores to on- and off-targets. We demonstrate the superiority of Pareto optimization over scalarization across three case studies. Further, we use the developed optimization tool to search a virtual library of over 4M molecules for those predicted to be selective dual inhibitors of EGFR and IGF1R, acquiring 100% of the molecules that form the library's Pareto front after exploring only 8% of the library. This workflow and associated open source software can reduce the screening burden of molecular design projects and is complementary to research aiming to improve the accuracy of binding predictions and other molecular properties.

Related papers

Collaborative Expert LLMs Guided Multi-Objective Molecular Optimization [51.104444856052204]
We present MultiMol, a collaborative large language model (LLM) system designed to guide multi-objective molecular optimization. In evaluations across six multi-objective optimization tasks, MultiMol significantly outperforms existing methods, achieving a 82.30% success rate.
arXiv Detail & Related papers (2025-03-05T13:47:55Z)
DrugImproverGPT: A Large Language Model for Drug Optimization with Fine-Tuning via Structured Policy Optimization [53.27954325490941]
Finetuning a Large Language Model (LLM) is crucial for generating results towards specific objectives. This research introduces a novel reinforcement learning algorithm to finetune a drug optimization LLM-based generative model.
arXiv Detail & Related papers (2025-02-11T04:00:21Z)
Text-Guided Multi-Property Molecular Optimization with a Diffusion Language Model [77.50732023411811]
We propose a text-guided multi-property molecular optimization method utilizing transformer-based diffusion language model (TransDLM) TransDLM leverages standardized chemical nomenclature as semantic representations of molecules and implicitly embeds property requirements into textual descriptions. Our approach surpasses state-of-the-art methods in optimizing molecular structural similarity and enhancing chemical properties on the benchmark dataset.
arXiv Detail & Related papers (2024-10-17T14:30:27Z)
XMOL: Explainable Multi-property Optimization of Molecules [2.320539066224081]
We propose Explainable Multi-property Optimization of Molecules (XMOL) to optimize multiple molecular properties simultaneously. Our approach builds on state-of-the-art geometric diffusion models, extending them to multi-property optimization. We integrate interpretive and explainable techniques throughout the optimization process.
arXiv Detail & Related papers (2024-09-12T06:35:04Z)
Active learning for affinity prediction of antibodies [45.58662352490961]
For large molecules such as antibodies, identifying mutations that enhance antibody affinity is challenging. FERB methods can offer valuable insights into how different mutations will impact the potency and selectivity of a drug candidate. We present an active learning framework that iteratively proposes sequences for simulators to evaluate, thereby accelerating the search for improved binders.
arXiv Detail & Related papers (2024-06-11T13:42:49Z)
Computer-Aided Multi-Objective Optimization in Small Molecule Discovery [3.032184156362992]
We describe pool-based and de novo generative approaches to multi-objective molecular discovery. We show how pool-based molecular discovery is a relatively direct extension of multi-objective Bayesian optimization. We discuss some remaining challenges and opportunities in the field.
arXiv Detail & Related papers (2022-10-13T17:33:07Z)
De novo design of protein target specific scaffold-based Inhibitors via Reinforcement Learning [8.210294479991118]
Current approaches to develop molecules for a target protein are intuition-driven, hampered by slow iterative design-test cycles. We propose a novel framework, called 3D-MolGNN$_RL$, coupling reinforcement learning to a deep generative model based on 3D-Scaffold. Our approach can serve as an interpretable artificial intelligence (AI) tool for lead optimization with optimized activity, potency, and biophysical properties.
arXiv Detail & Related papers (2022-05-21T00:47:35Z)
Molecular Attributes Transfer from Non-Parallel Data [57.010952598634944]
We formulate molecular optimization as a style transfer problem and present a novel generative model that could automatically learn internal differences between two groups of non-parallel data. Experiments on two molecular optimization tasks, toxicity modification and synthesizability improvement, demonstrate that our model significantly outperforms several state-of-the-art methods.
arXiv Detail & Related papers (2021-11-30T06:10:22Z)
Improved Drug-target Interaction Prediction with Intermolecular Graph Transformer [98.8319016075089]
We propose a novel approach to model intermolecular information with a three-way Transformer-based architecture. Intermolecular Graph Transformer (IGT) outperforms state-of-the-art approaches by 9.1% and 20.5% over the second best for binding activity and binding pose prediction respectively. IGT exhibits promising drug screening ability against SARS-CoV-2 by identifying 83.1% active drugs that have been validated by wet-lab experiments with near-native predicted binding poses.
arXiv Detail & Related papers (2021-10-14T13:28:02Z)
Optimizing Molecules using Efficient Queries from Property Evaluations [66.66290256377376]
We propose QMO, a generic query-based molecule optimization framework. QMO improves the desired properties of an input molecule based on efficient queries. We show that QMO outperforms existing methods in the benchmark tasks of optimizing small organic molecules.
arXiv Detail & Related papers (2020-11-03T18:51:18Z)
MIMOSA: Multi-constraint Molecule Sampling for Molecule Optimization [51.00815310242277]
generative models and reinforcement learning approaches made initial success, but still face difficulties in simultaneously optimizing multiple drug properties. We propose the MultI-constraint MOlecule SAmpling (MIMOSA) approach, a sampling framework to use input molecule as an initial guess and sample molecules from the target distribution.
arXiv Detail & Related papers (2020-10-05T20:18:42Z)

This list is automatically generated from the titles and abstracts of the papers in this site.