Curiosity as a Self-Supervised Method to Improve Exploration in De novo
Drug Design
- URL: http://arxiv.org/abs/2401.06771v1
- Date: Sun, 24 Sep 2023 06:44:51 GMT
- Title: Curiosity as a Self-Supervised Method to Improve Exploration in De novo
Drug Design
- Authors: Mohamed-Amine Chadi, Hajar Mousannif, Ahmed Aamouche
- Abstract summary: We introduce a curiosity-driven method to force the model to navigate many parts of the chemical space.
At first, we train a recurrent neural network-based general molecular generator (G), then we fine-tune G to maximize curiosity and desirability.
We benchmarked our approach against two desirable chemical properties related to drug-likeness and showed that the discovered chemical space can be significantly expanded.
- Score: 0.276240219662896
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In recent years, deep learning has demonstrated promising results in de novo
drug design. However, the proposed techniques still lack an efficient
exploration of the large chemical space. Most of these methods explore a small
fragment of the chemical space of known drugs, if the desired molecules were
not found, the process ends. In this work, we introduce a curiosity-driven
method to force the model to navigate many parts of the chemical space,
therefore, achieving higher desirability and diversity as well. At first, we
train a recurrent neural network-based general molecular generator (G), then we
fine-tune G to maximize curiosity and desirability. We define curiosity as the
Tanimoto similarity between two generated molecules, a first molecule generated
by G, and a second one generated by a copy of G (Gcopy). We only backpropagate
the loss through G while keeping Gcopy unchanged. We benchmarked our approach
against two desirable chemical properties related to drug-likeness and showed
that the discovered chemical space can be significantly expanded, thus,
discovering a higher number of desirable molecules with more diversity and
potentially easier to synthesize. All Code and data used in this paper are
available at https://github.com/amine179/Curiosity-RL-for-Drug-Design.
Related papers
- De novo Drug Design using Reinforcement Learning with Multiple GPT
Agents [16.508471997999496]
MolRL-MGPT is a reinforcement learning algorithm with multiple GPT agents for drug molecular generation.
Our algorithm has shown promising results on the GuacaMol benchmark and exhibits efficacy in designing inhibitors against SARS-CoV-2 protein targets.
arXiv Detail & Related papers (2023-12-21T13:24:03Z) - Genetic algorithms are strong baselines for molecule generation [3.0832873002777785]
Genetic algorithms (GAs) generate molecules by randomly modifying known molecules.
In this paper we show that GAs are very strong algorithms for such tasks, outperforming many complicated machine learning methods.
arXiv Detail & Related papers (2023-10-13T17:25:11Z) - Retrieval-based Controllable Molecule Generation [63.44583084888342]
We propose a new retrieval-based framework for controllable molecule generation.
We use a small set of molecules to steer the pre-trained generative model towards synthesizing molecules that satisfy the given design criteria.
Our approach is agnostic to the choice of generative models and requires no task-specific fine-tuning.
arXiv Detail & Related papers (2022-08-23T17:01:16Z) - Graph-based Molecular Representation Learning [59.06193431883431]
Molecular representation learning (MRL) is a key step to build the connection between machine learning and chemical science.
Recently, MRL has achieved considerable progress, especially in methods based on deep molecular graph learning.
arXiv Detail & Related papers (2022-07-08T17:43:20Z) - Exploring Chemical Space with Score-based Out-of-distribution Generation [57.15855198512551]
We propose a score-based diffusion scheme that incorporates out-of-distribution control in the generative differential equation (SDE)
Since some novel molecules may not meet the basic requirements of real-world drugs, MOOD performs conditional generation by utilizing the gradients from a property predictor.
We experimentally validate that MOOD is able to explore the chemical space beyond the training distribution, generating molecules that outscore ones found with existing methods, and even the top 0.01% of the original training pool.
arXiv Detail & Related papers (2022-06-06T06:17:11Z) - Scalable Fragment-Based 3D Molecular Design with Reinforcement Learning [68.8204255655161]
We introduce a novel framework for scalable 3D design that uses a hierarchical agent to build molecules.
In a variety of experiments, we show that our agent, guided only by energy considerations, can efficiently learn to produce molecules with over 100 atoms.
arXiv Detail & Related papers (2022-02-01T18:54:24Z) - Hit and Lead Discovery with Explorative RL and Fragment-based Molecule
Generation [34.26748101294543]
We propose a novel framework that generates pharmacochemically acceptable molecules with large docking scores.
Our method constrains the generated molecules to a realistic and qualified chemical space and effectively explores the space to find drugs.
Our model produces molecules of higher quality compared to existing methods while achieving state-of-the-art performance on two of three targets.
arXiv Detail & Related papers (2021-10-04T07:21:00Z) - Chemical-Reaction-Aware Molecule Representation Learning [88.79052749877334]
We propose using chemical reactions to assist learning molecule representation.
Our approach is proven effective to 1) keep the embedding space well-organized and 2) improve the generalization ability of molecule embeddings.
Experimental results demonstrate that our method achieves state-of-the-art performance in a variety of downstream tasks.
arXiv Detail & Related papers (2021-09-21T00:08:43Z) - MIMOSA: Multi-constraint Molecule Sampling for Molecule Optimization [51.00815310242277]
generative models and reinforcement learning approaches made initial success, but still face difficulties in simultaneously optimizing multiple drug properties.
We propose the MultI-constraint MOlecule SAmpling (MIMOSA) approach, a sampling framework to use input molecule as an initial guess and sample molecules from the target distribution.
arXiv Detail & Related papers (2020-10-05T20:18:42Z) - ChemoVerse: Manifold traversal of latent spaces for novel molecule
discovery [0.7742297876120561]
It is essential to identify molecular structures with the desired chemical properties.
Recent advances in generative models using neural networks and machine learning are being widely used to design virtual libraries of drug-like compounds.
arXiv Detail & Related papers (2020-09-29T12:11:40Z) - Reinforcement Learning for Molecular Design Guided by Quantum Mechanics [10.112779201155005]
We present a novel RL formulation for molecular design in coordinates, thereby extending the class of molecules that can be built.
Our reward function is directly based on fundamental physical properties such as the energy, which we approximate via fast quantum-chemical methods.
In our experiments, we show that our agent can efficiently learn to solve these tasks from scratch by working in a translation and rotation invariant state-action space.
arXiv Detail & Related papers (2020-02-18T16:43:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.