Related papers: Genetic algorithms are strong baselines for molecule generation

Genetic algorithms are strong baselines for molecule generation

URL: http://arxiv.org/abs/2310.09267v1
Date: Fri, 13 Oct 2023 17:25:11 GMT
Title: Genetic algorithms are strong baselines for molecule generation
Authors: Austin Tripp, Jos\'e Miguel Hern\'andez-Lobato
Abstract summary: Genetic algorithms (GAs) generate molecules by randomly modifying known molecules. In this paper we show that GAs are very strong algorithms for such tasks, outperforming many complicated machine learning methods.
Score: 3.0832873002777785
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Generating molecules, both in a directed and undirected fashion, is a huge part of the drug discovery pipeline. Genetic algorithms (GAs) generate molecules by randomly modifying known molecules. In this paper we show that GAs are very strong algorithms for such tasks, outperforming many complicated machine learning methods: a result which many researchers may find surprising. We therefore propose insisting during peer review that new algorithms must have some clear advantage over GAs, which we call the GA criterion. Ultimately our work suggests that a lot of research in molecule generation should be re-assessed.

Related papers

Gradient GA: Gradient Genetic Algorithm for Drug Molecular Design [17.597915824192953]
Experimental results demonstrate that our method significantly improves both convergence speed and solution quality, outperforming cutting-edge techniques. For example, it achieves up to a 25% improvement in the top-10 score over the vanilla genetic algorithm.
arXiv Detail & Related papers (2025-02-14T02:03:39Z)
Curiosity as a Self-Supervised Method to Improve Exploration in De novo Drug Design [0.276240219662896]
We introduce a curiosity-driven method to force the model to navigate many parts of the chemical space. At first, we train a recurrent neural network-based general molecular generator (G), then we fine-tune G to maximize curiosity and desirability. We benchmarked our approach against two desirable chemical properties related to drug-likeness and showed that the discovered chemical space can be significantly expanded.
arXiv Detail & Related papers (2023-09-24T06:44:51Z)
Towards Predicting Equilibrium Distributions for Molecular Systems with Deep Learning [60.02391969049972]
We introduce a novel deep learning framework, called Distributional Graphormer (DiG), in an attempt to predict the equilibrium distribution of molecular systems. DiG employs deep neural networks to transform a simple distribution towards the equilibrium distribution, conditioned on a descriptor of a molecular system.
arXiv Detail & Related papers (2023-06-08T17:12:08Z)
MolCPT: Molecule Continuous Prompt Tuning to Generalize Molecular Representation Learning [77.31492888819935]
We propose a novel paradigm of "pre-train, prompt, fine-tune" for molecular representation learning, named molecule continuous prompt tuning (MolCPT) MolCPT defines a motif prompting function that uses the pre-trained model to project the standalone input into an expressive prompt. Experiments on several benchmark datasets show that MolCPT efficiently generalizes pre-trained GNNs for molecular property prediction.
arXiv Detail & Related papers (2022-12-20T19:32:30Z)
Retrieval-based Controllable Molecule Generation [63.44583084888342]
We propose a new retrieval-based framework for controllable molecule generation. We use a small set of molecules to steer the pre-trained generative model towards synthesizing molecules that satisfy the given design criteria. Our approach is agnostic to the choice of generative models and requires no task-specific fine-tuning.
arXiv Detail & Related papers (2022-08-23T17:01:16Z)
Genetic Algorithm for Constrained Molecular Inverse Design [0.1086166673827221]
We introduce a genetic algorithm featuring a constrained molecular inverse design. The proposed algorithm successfully produces valid molecules for crossover and mutation. Experiments prove that our algorithm effectively finds molecules that satisfy specific properties while maintaining structural constraints.
arXiv Detail & Related papers (2021-12-07T05:58:44Z)
Improving RNA Secondary Structure Design using Deep Reinforcement Learning [69.63971634605797]
We propose a new benchmark of applying reinforcement learning to RNA sequence design, in which the objective function is defined to be the free energy in the sequence's secondary structure. We show results of the ablation analysis that we do for these algorithms, as well as graphs indicating the algorithm's performance across batches.
arXiv Detail & Related papers (2021-11-05T02:54:06Z)
Goal directed molecule generation using Monte Carlo Tree Search [15.462930062711237]
We propose a novel method, which we call unitMCTS, to perform molecule generation by making a unit change to the molecule at every step using Monte Carlo Tree Search. We show that this method outperforms the recently published techniques on benchmark molecular optimization tasks such as QED and penalized logP.
arXiv Detail & Related papers (2020-10-30T17:49:59Z)
A summary of the prevalence of Genetic Algorithms in Bioinformatics from 2015 onwards [0.0]
Genetic algorithms rarely form a full application, instead they rely on other vital algorithms such as support vector machines. Population-based searches, like GA, are often combined with other machine learning algorithms. The future of genetic algorithms could be open-ended evolutionary algorithms, which attempt to increase complexity and find diverse solutions.
arXiv Detail & Related papers (2020-08-20T15:15:43Z)
Guiding Deep Molecular Optimization with Genetic Exploration [79.50698140997726]
We propose genetic expert-guided learning (GEGL), a framework for training a deep neural network (DNN) to generate highly-rewarding molecules. Extensive experiments show that GEGL significantly improves over state-of-the-art methods.
arXiv Detail & Related papers (2020-07-04T05:01:26Z)
Self-Supervised Graph Transformer on Large-Scale Molecular Data [73.3448373618865]
We propose a novel framework, GROVER, for molecular representation learning. GROVER can learn rich structural and semantic information of molecules from enormous unlabelled molecular data. We pre-train GROVER with 100 million parameters on 10 million unlabelled molecules -- the biggest GNN and the largest training dataset in molecular representation learning.
arXiv Detail & Related papers (2020-06-18T08:37:04Z)
Using Genetic Algorithm To Evolve Cellular Automata In Performing Edge Detection [0.0]
We have made an effort to perform edge detection on an image using genetic algorithm. We have tried to evolve the cellular automata and shown that how with time it converges to the desired results.
arXiv Detail & Related papers (2020-05-13T04:07:43Z)

This list is automatically generated from the titles and abstracts of the papers in this site.