FastFlows: Flow-Based Models for Molecular Graph Generation
- URL: http://arxiv.org/abs/2201.12419v1
- Date: Fri, 28 Jan 2022 21:08:31 GMT
- Title: FastFlows: Flow-Based Models for Molecular Graph Generation
- Authors: Nathan C. Frey, Vijay Gadepally, Bharath Ramsundar
- Abstract summary: FastFlows generates thousands of chemically valid molecules in seconds.
Our model is significantly simpler and easier to train than autoregressive molecular generative models.
- Score: 4.9252608053969675
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We propose a framework using normalizing-flow based models, SELF-Referencing
Embedded Strings, and multi-objective optimization that efficiently generates
small molecules. With an initial training set of only 100 small molecules,
FastFlows generates thousands of chemically valid molecules in seconds. Because
of the efficient sampling, substructure filters can be applied as desired to
eliminate compounds with unreasonable moieties. Using easily computable and
learned metrics for druglikeness, synthetic accessibility, and synthetic
complexity, we perform a multi-objective optimization to demonstrate how
FastFlows functions in a high-throughput virtual screening context. Our model
is significantly simpler and easier to train than autoregressive molecular
generative models, and enables fast generation and identification of druglike,
synthesizable molecules.
Related papers
- Efficient 3D Molecular Generation with Flow Matching and Scale Optimal Transport [43.56824843205882]
Semla is a scalable E(3)-equivariant message passing architecture.
SemlaFlow is trained using flow matching along with scale optimal transport.
Our model produces state-of-the-art results on benchmark datasets with just 100 sampling steps.
arXiv Detail & Related papers (2024-06-11T13:51:51Z) - RGFN: Synthesizable Molecular Generation Using GFlowNets [51.33672611338754]
We propose Reaction-GFlowNet, an extension of the GFlowNet framework that operates directly in the space of chemical reactions.
RGFN allows out-of-the-box synthesizability while maintaining comparable quality of generated candidates.
We demonstrate the effectiveness of the proposed approach across a range of oracle models, including pretrained proxy models and GPU-accelerated docking.
arXiv Detail & Related papers (2024-06-01T13:11:11Z) - SynFlowNet: Towards Molecule Design with Guaranteed Synthesis Pathways [17.704264588418035]
We introduce SynFlowNet, a GFlowNet model whose action space uses chemically validated reactions and reactants to sequentially build new molecules.
We evaluate our approach using synthetic accessibility scores and an independent retrosynthesis tool.
We compare molecules designed with SynFlowNet to experimentally validated actives, and find that they show comparable properties of interest, such as molecular weight, SA score and predicted protein binding affinity.
arXiv Detail & Related papers (2024-05-02T10:15:59Z) - Mixed Continuous and Categorical Flow Matching for 3D De Novo Molecule Generation [0.0]
Flow matching is a recently proposed generative modeling framework that generalizes diffusion models.
We extend the flow matching framework to categorical data by constructing flows that are constrained to exist on a continuous representation of categorical data known as the probability simplex.
We find that, in practice, a simpler approach that makes no accommodations for the categorical nature of the data yields equivalent or superior performance.
arXiv Detail & Related papers (2024-04-30T17:37:21Z) - Retrieval-based Controllable Molecule Generation [63.44583084888342]
We propose a new retrieval-based framework for controllable molecule generation.
We use a small set of molecules to steer the pre-trained generative model towards synthesizing molecules that satisfy the given design criteria.
Our approach is agnostic to the choice of generative models and requires no task-specific fine-tuning.
arXiv Detail & Related papers (2022-08-23T17:01:16Z) - Accurate Machine Learned Quantum-Mechanical Force Fields for
Biomolecular Simulations [51.68332623405432]
Molecular dynamics (MD) simulations allow atomistic insights into chemical and biological processes.
Recently, machine learned force fields (MLFFs) emerged as an alternative means to execute MD simulations.
This work proposes a general approach to constructing accurate MLFFs for large-scale molecular simulations.
arXiv Detail & Related papers (2022-05-17T13:08:28Z) - Molecular Attributes Transfer from Non-Parallel Data [57.010952598634944]
We formulate molecular optimization as a style transfer problem and present a novel generative model that could automatically learn internal differences between two groups of non-parallel data.
Experiments on two molecular optimization tasks, toxicity modification and synthesizability improvement, demonstrate that our model significantly outperforms several state-of-the-art methods.
arXiv Detail & Related papers (2021-11-30T06:10:22Z) - RetroGNN: Approximating Retrosynthesis by Graph Neural Networks for De
Novo Drug Design [75.14290780116002]
We train deep graph neural networks to approximate the outputs of a retrosynthesis planning software.
Our approach finds molecules predicted to be more likely to be antibiotics while maintaining good drug-like properties and being easily synthesizable.
arXiv Detail & Related papers (2020-11-25T22:04:16Z) - MIMOSA: Multi-constraint Molecule Sampling for Molecule Optimization [51.00815310242277]
generative models and reinforcement learning approaches made initial success, but still face difficulties in simultaneously optimizing multiple drug properties.
We propose the MultI-constraint MOlecule SAmpling (MIMOSA) approach, a sampling framework to use input molecule as an initial guess and sample molecules from the target distribution.
arXiv Detail & Related papers (2020-10-05T20:18:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.