Barking up the right tree: an approach to search over molecule synthesis
DAGs
- URL: http://arxiv.org/abs/2012.11522v1
- Date: Mon, 21 Dec 2020 17:35:06 GMT
- Title: Barking up the right tree: an approach to search over molecule synthesis
DAGs
- Authors: John Bradshaw, Brooks Paige, Matt J. Kusner, Marwin H. S. Segler,
Jos\'e Miguel Hern\'andez-Lobato
- Abstract summary: Current deep generative models for molecules ignore synthesizability.
We propose a deep generative model that better represents the real world process.
We show that our approach is able to model chemical space well, producing a wide range of diverse molecules.
- Score: 28.13323960125482
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: When designing new molecules with particular properties, it is not only
important what to make but crucially how to make it. These instructions form a
synthesis directed acyclic graph (DAG), describing how a large vocabulary of
simple building blocks can be recursively combined through chemical reactions
to create more complicated molecules of interest. In contrast, many current
deep generative models for molecules ignore synthesizability. We therefore
propose a deep generative model that better represents the real world process,
by directly outputting molecule synthesis DAGs. We argue that this provides
sensible inductive biases, ensuring that our model searches over the same
chemical space that chemists would also have access to, as well as
interpretability. We show that our approach is able to model chemical space
well, producing a wide range of diverse molecules, and allows for unconstrained
optimization of an inherently constrained problem: maximize certain chemical
properties such that discovered molecules are synthesizable.
Related papers
- STRIDE: Structure-guided Generation for Inverse Design of Molecules [0.24578723416255752]
$textbfSTRIDE$ is a generative molecule workflow that generates novel molecules with an unconditional generative model guided by known molecules without any retraining.
Our generated molecules have on average 21.7% lower synthetic accessibility scores and also reduce ionization potential by 5.9% of generated molecules via guiding.
arXiv Detail & Related papers (2023-11-06T08:22:35Z) - MUDiff: Unified Diffusion for Complete Molecule Generation [104.7021929437504]
We present a new model for generating a comprehensive representation of molecules, including atom features, 2D discrete molecule structures, and 3D continuous molecule coordinates.
We propose a novel graph transformer architecture to denoise the diffusion process.
Our model is a promising approach for designing stable and diverse molecules and can be applied to a wide range of tasks in molecular modeling.
arXiv Detail & Related papers (2023-04-28T04:25:57Z) - Domain-Agnostic Molecular Generation with Chemical Feedback [44.063584808910896]
MolGen is a pre-trained molecular language model tailored specifically for molecule generation.
It internalizes structural and grammatical insights through the reconstruction of over 100 million molecular SELFIES.
Our chemical feedback paradigm steers the model away from molecular hallucinations, ensuring alignment between the model's estimated probabilities and real-world chemical preferences.
arXiv Detail & Related papers (2023-01-26T17:52:56Z) - A Molecular Multimodal Foundation Model Associating Molecule Graphs with
Natural Language [63.60376252491507]
We propose a molecular multimodal foundation model which is pretrained from molecular graphs and their semantically related textual data.
We believe that our model would have a broad impact on AI-empowered fields across disciplines such as biology, chemistry, materials, environment, and medicine.
arXiv Detail & Related papers (2022-09-12T00:56:57Z) - Scalable Fragment-Based 3D Molecular Design with Reinforcement Learning [68.8204255655161]
We introduce a novel framework for scalable 3D design that uses a hierarchical agent to build molecules.
In a variety of experiments, we show that our agent, guided only by energy considerations, can efficiently learn to produce molecules with over 100 atoms.
arXiv Detail & Related papers (2022-02-01T18:54:24Z) - RetroGNN: Approximating Retrosynthesis by Graph Neural Networks for De
Novo Drug Design [75.14290780116002]
We train deep graph neural networks to approximate the outputs of a retrosynthesis planning software.
Our approach finds molecules predicted to be more likely to be antibiotics while maintaining good drug-like properties and being easily synthesizable.
arXiv Detail & Related papers (2020-11-25T22:04:16Z) - Learning Latent Space Energy-Based Prior Model for Molecule Generation [59.875533935578375]
We learn latent space energy-based prior model with SMILES representation for molecule modeling.
Our method is able to generate molecules with validity and uniqueness competitive with state-of-the-art models.
arXiv Detail & Related papers (2020-10-19T09:34:20Z) - Learning Graph Models for Retrosynthesis Prediction [90.15523831087269]
Retrosynthesis prediction is a fundamental problem in organic synthesis.
This paper introduces a graph-based approach that capitalizes on the idea that the graph topology of precursor molecules is largely unaltered during a chemical reaction.
Our model achieves a top-1 accuracy of $53.7%$, outperforming previous template-free and semi-template-based methods.
arXiv Detail & Related papers (2020-06-12T09:40:42Z) - Multi-Objective Molecule Generation using Interpretable Substructures [38.637412590671865]
Drug discovery aims to find novel compounds with specified chemical property profiles.
The goal is to learn to sample molecules in the intersection of multiple property constraints.
We propose to offset this complexity by composing molecules from a vocabulary of substructures that we call molecular rationales.
arXiv Detail & Related papers (2020-02-08T22:55:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.