FastFlows: Flow-Based Models for Molecular Graph Generation
- URL: http://arxiv.org/abs/2201.12419v1
- Date: Fri, 28 Jan 2022 21:08:31 GMT
- Title: FastFlows: Flow-Based Models for Molecular Graph Generation
- Authors: Nathan C. Frey, Vijay Gadepally, Bharath Ramsundar
- Abstract summary: FastFlows generates thousands of chemically valid molecules in seconds.
Our model is significantly simpler and easier to train than autoregressive molecular generative models.
- Score: 4.9252608053969675
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We propose a framework using normalizing-flow based models, SELF-Referencing
Embedded Strings, and multi-objective optimization that efficiently generates
small molecules. With an initial training set of only 100 small molecules,
FastFlows generates thousands of chemically valid molecules in seconds. Because
of the efficient sampling, substructure filters can be applied as desired to
eliminate compounds with unreasonable moieties. Using easily computable and
learned metrics for druglikeness, synthetic accessibility, and synthetic
complexity, we perform a multi-objective optimization to demonstrate how
FastFlows functions in a high-throughput virtual screening context. Our model
is significantly simpler and easier to train than autoregressive molecular
generative models, and enables fast generation and identification of druglike,
synthesizable molecules.
Related papers
- Text-Guided Multi-Property Molecular Optimization with a Diffusion Language Model [77.50732023411811]
We propose a text-guided multi-property molecular optimization method utilizing transformer-based diffusion language model (TransDLM)
TransDLM leverages standardized chemical nomenclature as semantic representations of molecules and implicitly embeds property requirements into textual descriptions.
Our approach surpasses state-of-the-art methods in optimizing molecular structural similarity and enhancing chemical properties on the benchmark dataset.
arXiv Detail & Related papers (2024-10-17T14:30:27Z) - Generative Flows on Synthetic Pathway for Drug Design [39.69010664056235]
We propose RxnFlow, which sequentially assembles molecules using predefined molecular building blocks and chemical reaction templates.
RxnFlow achieves state-of-the-art performance on CrossDocked 2020 for pocket-conditional generation, with an average Vina score of -8.85kcal/mol and 34.8% synthesizability.
arXiv Detail & Related papers (2024-10-06T16:34:01Z) - Efficient 3D Molecular Generation with Flow Matching and Scale Optimal Transport [43.56824843205882]
Semla is a scalable E(3)-equivariant message passing architecture.
SemlaFlow is trained using flow matching along with scale optimal transport.
Our model produces state-of-the-art results on benchmark datasets with just 100 sampling steps.
arXiv Detail & Related papers (2024-06-11T13:51:51Z) - RGFN: Synthesizable Molecular Generation Using GFlowNets [51.33672611338754]
We propose Reaction-GFlowNet, an extension of the GFlowNet framework that operates directly in the space of chemical reactions.
RGFN allows out-of-the-box synthesizability while maintaining comparable quality of generated candidates.
We demonstrate the effectiveness of the proposed approach across a range of oracle models, including pretrained proxy models and GPU-accelerated docking.
arXiv Detail & Related papers (2024-06-01T13:11:11Z) - Mixed Continuous and Categorical Flow Matching for 3D De Novo Molecule Generation [0.0]
Flow matching is a recently proposed generative modeling framework that generalizes diffusion models.
We extend the flow matching framework to categorical data by constructing flows that are constrained to exist on a continuous representation of categorical data known as the probability simplex.
We find that, in practice, a simpler approach that makes no accommodations for the categorical nature of the data yields equivalent or superior performance.
arXiv Detail & Related papers (2024-04-30T17:37:21Z) - Retrieval-based Controllable Molecule Generation [63.44583084888342]
We propose a new retrieval-based framework for controllable molecule generation.
We use a small set of molecules to steer the pre-trained generative model towards synthesizing molecules that satisfy the given design criteria.
Our approach is agnostic to the choice of generative models and requires no task-specific fine-tuning.
arXiv Detail & Related papers (2022-08-23T17:01:16Z) - Accurate Machine Learned Quantum-Mechanical Force Fields for
Biomolecular Simulations [51.68332623405432]
Molecular dynamics (MD) simulations allow atomistic insights into chemical and biological processes.
Recently, machine learned force fields (MLFFs) emerged as an alternative means to execute MD simulations.
This work proposes a general approach to constructing accurate MLFFs for large-scale molecular simulations.
arXiv Detail & Related papers (2022-05-17T13:08:28Z) - Molecular Attributes Transfer from Non-Parallel Data [57.010952598634944]
We formulate molecular optimization as a style transfer problem and present a novel generative model that could automatically learn internal differences between two groups of non-parallel data.
Experiments on two molecular optimization tasks, toxicity modification and synthesizability improvement, demonstrate that our model significantly outperforms several state-of-the-art methods.
arXiv Detail & Related papers (2021-11-30T06:10:22Z) - MIMOSA: Multi-constraint Molecule Sampling for Molecule Optimization [51.00815310242277]
generative models and reinforcement learning approaches made initial success, but still face difficulties in simultaneously optimizing multiple drug properties.
We propose the MultI-constraint MOlecule SAmpling (MIMOSA) approach, a sampling framework to use input molecule as an initial guess and sample molecules from the target distribution.
arXiv Detail & Related papers (2020-10-05T20:18:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.