FREED++: Improving RL Agents for Fragment-Based Molecule Generation by
Thorough Reproduction
- URL: http://arxiv.org/abs/2401.09840v1
- Date: Thu, 18 Jan 2024 09:54:19 GMT
- Title: FREED++: Improving RL Agents for Fragment-Based Molecule Generation by
Thorough Reproduction
- Authors: Alexander Telepov, Artem Tsypin, Kuzma Khrabrov, Sergey Yakukhnov,
Pavel Strashnov, Petr Zhilyaev, Egor Rumiantsev, Daniel Ezhov, Manvel
Avetisian, Olga Popova, Artur Kadurin
- Abstract summary: Reinforcement Learning (RL) has emerged as a promising approach to generating molecules with the docking score (DS) as a reward.
We reproduce, scrutinize and improve the recent model for molecule generation called FREED (arXiv:2110.01219)
- Score: 33.57089414199478
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: A rational design of new therapeutic drugs aims to find a molecular structure
with desired biological functionality, e.g., an ability to activate or suppress
a specific protein via binding to it. Molecular docking is a common technique
for evaluating protein-molecule interactions. Recently, Reinforcement Learning
(RL) has emerged as a promising approach to generating molecules with the
docking score (DS) as a reward. In this work, we reproduce, scrutinize and
improve the recent RL model for molecule generation called FREED
(arXiv:2110.01219). Extensive evaluation of the proposed method reveals several
limitations and challenges despite the outstanding results reported for three
target proteins. Our contributions include fixing numerous implementation bugs
and simplifying the model while increasing its quality, significantly extending
experiments, and conducting an accurate comparison with current
state-of-the-art methods for protein-conditioned molecule generation. We show
that the resulting fixed model is capable of producing molecules with superior
docking scores compared to alternative approaches.
Related papers
- Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization [147.7899503829411]
AliDiff is a novel framework to align pretrained target diffusion models with preferred functional properties.
It can generate molecules with state-of-the-art binding energies with up to -7.07 Avg. Vina Score.
arXiv Detail & Related papers (2024-07-01T06:10:29Z) - UniIF: Unified Molecule Inverse Folding [67.60267592514381]
We propose a unified model UniIF for inverse folding of all molecules.
Our proposed method surpasses state-of-the-art methods on all tasks.
arXiv Detail & Related papers (2024-05-29T10:26:16Z) - AUTODIFF: Autoregressive Diffusion Modeling for Structure-based Drug Design [16.946648071157618]
We propose a diffusion-based fragment-wise autoregressive generation model for structure-based drug design (SBDD)
We design a novel molecule assembly strategy named conformal motif that preserves the conformation of local structures of molecules first.
We then encode the interaction of the protein-ligand complex with an SE(3)-equivariant convolutional network and generate molecules motif-by-motif with diffusion modeling.
arXiv Detail & Related papers (2024-04-02T14:44:02Z) - ChemSpaceAL: An Efficient Active Learning Methodology Applied to
Protein-Specific Molecular Generation [0.0]
We present a computationally efficient active learning methodology that requires evaluation of only a subset of the generated data.
We demonstrate the applicability of this methodology to targeted molecular generation by fine-tuning a GPT-based molecular generator toward a protein with FDA-approved small-molecule inhibitors, c-Abl kinase.
Remarkably, the model learns to generate molecules similar to the inhibitors without prior knowledge of their existence, and even reproduces two of them exactly.
arXiv Detail & Related papers (2023-09-11T22:28:36Z) - Molecule Design by Latent Space Energy-Based Modeling and Gradual
Distribution Shifting [53.44684898432997]
Generation of molecules with desired chemical and biological properties is critical for drug discovery.
We propose a probabilistic generative model to capture the joint distribution of molecules and their properties.
Our method achieves very strong performances on various molecule design tasks.
arXiv Detail & Related papers (2023-06-09T03:04:21Z) - SILVR: Guided Diffusion for Molecule Generation [0.0]
We introduce a machine-learning method for conditioning an existing generative model without retraining.
The model allows the generation of new molecules that fit into a binding site of a protein based on fragment hits.
We show that moderate SILVR rates make it possible to generate new molecules of similar shape to the original fragments.
arXiv Detail & Related papers (2023-04-21T11:47:38Z) - Retrieval-based Controllable Molecule Generation [63.44583084888342]
We propose a new retrieval-based framework for controllable molecule generation.
We use a small set of molecules to steer the pre-trained generative model towards synthesizing molecules that satisfy the given design criteria.
Our approach is agnostic to the choice of generative models and requires no task-specific fine-tuning.
arXiv Detail & Related papers (2022-08-23T17:01:16Z) - Widely Used and Fast De Novo Drug Design by a Protein Sequence-Based
Reinforcement Learning Model [4.815696666006742]
Structure-based de novo method can overcome the data scarcity of active by incorporating drug-target interaction into deep generative architectures.
Here, we demonstrate a widely used and fast protein sequence-based reinforcement learning model for drug discovery.
As a proof of concept, the RL model was utilized to design molecules for four targets.
arXiv Detail & Related papers (2022-08-14T10:41:52Z) - Exploring Chemical Space with Score-based Out-of-distribution Generation [57.15855198512551]
We propose a score-based diffusion scheme that incorporates out-of-distribution control in the generative differential equation (SDE)
Since some novel molecules may not meet the basic requirements of real-world drugs, MOOD performs conditional generation by utilizing the gradients from a property predictor.
We experimentally validate that MOOD is able to explore the chemical space beyond the training distribution, generating molecules that outscore ones found with existing methods, and even the top 0.01% of the original training pool.
arXiv Detail & Related papers (2022-06-06T06:17:11Z) - Hit and Lead Discovery with Explorative RL and Fragment-based Molecule
Generation [34.26748101294543]
We propose a novel framework that generates pharmacochemically acceptable molecules with large docking scores.
Our method constrains the generated molecules to a realistic and qualified chemical space and effectively explores the space to find drugs.
Our model produces molecules of higher quality compared to existing methods while achieving state-of-the-art performance on two of three targets.
arXiv Detail & Related papers (2021-10-04T07:21:00Z) - CogMol: Target-Specific and Selective Drug Design for COVID-19 Using
Deep Generative Models [74.58583689523999]
We propose an end-to-end framework, named CogMol, for designing new drug-like small molecules targeting novel viral proteins.
CogMol combines adaptive pre-training of a molecular SMILES Variational Autoencoder (VAE) and an efficient multi-attribute controlled sampling scheme.
CogMol handles multi-constraint design of synthesizable, low-toxic, drug-like molecules with high target specificity and selectivity.
arXiv Detail & Related papers (2020-04-02T18:17:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.