Fragment-based molecular generative model with high generalization
ability and synthetic accessibility
- URL: http://arxiv.org/abs/2111.12907v1
- Date: Thu, 25 Nov 2021 04:44:37 GMT
- Title: Fragment-based molecular generative model with high generalization
ability and synthetic accessibility
- Authors: Seonghwan Seo, Jaechang Lim, and Woo Youn Kim
- Abstract summary: We propose a fragment-based molecular generative model which designs new molecules with target properties.
A key feature of our model is a high generalization ability in terms of property control and fragment types.
We show that the model can generate molecules with the simultaneous control of multiple target properties at a high success rate.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Deep generative models are attracting great attention for molecular design
with desired properties. Most existing models generate molecules by
sequentially adding atoms. This often renders generated molecules with less
correlation with target properties and low synthetic accessibility. Molecular
fragments such as functional groups are more closely related to molecular
properties and synthetic accessibility than atoms. Here, we propose a
fragment-based molecular generative model which designs new molecules with
target properties by sequentially adding molecular fragments to any given
starting molecule. A key feature of our model is a high generalization ability
in terms of property control and fragment types. The former becomes possible by
learning the contribution of individual fragments to the target properties in
an auto-regressive manner. For the latter, we used a deep neural network that
predicts the bonding probability of two molecules from the embedding vectors of
the two molecules as input. The high synthetic accessibility of the generated
molecules is implicitly considered while preparing the fragment library with
the BRICS decomposition method. We show that the model can generate molecules
with the simultaneous control of multiple target properties at a high success
rate. It also works equally well with unseen fragments even in the property
range where the training data is rare, verifying the high generalization
ability. As a practical application, we demonstrated that the model can
generate potential inhibitors with high binding affinities against the 3CL
protease of SARS-COV-2 in terms of docking score.
Related papers
- Molecule Design by Latent Prompt Transformer [76.2112075557233]
This work explores the challenging problem of molecule design by framing it as a conditional generative modeling task.
We propose a novel generative model comprising three components: (1) a latent vector with a learnable prior distribution; (2) a molecule generation model based on a causal Transformer, which uses the latent vector as a prompt; and (3) a property prediction model that predicts a molecule's target properties and/or constraint values using the latent prompt.
arXiv Detail & Related papers (2024-02-27T03:33:23Z) - A novel molecule generative model of VAE combined with Transformer for unseen structure generation [0.0]
Transformer and VAE are widely used as powerful models, but they are rarely used in combination due to structural and performance mismatch.
This study proposes a model that combines these two models through structural and parameter optimization in handling diverse molecules.
The proposed model shows comparable performance to existing models in generating molecules, and showed by far superior performance in generating molecules with unseen structures.
arXiv Detail & Related papers (2024-02-19T08:46:04Z) - STRIDE: Structure-guided Generation for Inverse Design of Molecules [0.24578723416255752]
$textbfSTRIDE$ is a generative molecule workflow that generates novel molecules with an unconditional generative model guided by known molecules without any retraining.
Our generated molecules have on average 21.7% lower synthetic accessibility scores and also reduce ionization potential by 5.9% of generated molecules via guiding.
arXiv Detail & Related papers (2023-11-06T08:22:35Z) - Diffusing on Two Levels and Optimizing for Multiple Properties: A Novel
Approach to Generating Molecules with Desirable Properties [33.2976176283611]
We present a novel approach to generating molecules with desirable properties, which expands the diffusion model framework with multiple innovative designs.
To get desirable molecular fragments, we develop a novel electronic effect based fragmentation method.
We show that the molecules generated by our proposed method have better validity, uniqueness, novelty, Fr'echet ChemNet Distance (FCD), QED, and PlogP than those generated by current SOTA models.
arXiv Detail & Related papers (2023-10-05T11:43:21Z) - Molecule Design by Latent Space Energy-Based Modeling and Gradual
Distribution Shifting [53.44684898432997]
Generation of molecules with desired chemical and biological properties is critical for drug discovery.
We propose a probabilistic generative model to capture the joint distribution of molecules and their properties.
Our method achieves very strong performances on various molecule design tasks.
arXiv Detail & Related papers (2023-06-09T03:04:21Z) - Functional-Group-Based Diffusion for Pocket-Specific Molecule Generation and Elaboration [63.23362798102195]
We propose D3FG, a functional-group-based diffusion model for pocket-specific molecule generation and elaboration.
D3FG decomposes molecules into two categories of components: functional groups defined as rigid bodies and linkers as mass points.
In the experiments, our method can generate molecules with more realistic 3D structures, competitive affinities toward the protein targets, and better drug properties.
arXiv Detail & Related papers (2023-05-30T06:41:20Z) - DiffBP: Generative Diffusion of 3D Molecules for Target Protein Binding [51.970607704953096]
Previous works usually generate atoms in an auto-regressive way, where element types and 3D coordinates of atoms are generated one by one.
In real-world molecular systems, the interactions among atoms in an entire molecule are global, leading to the energy function pair-coupled among atoms.
In this work, a generative diffusion model for molecular 3D structures based on target proteins is established, at a full-atom level in a non-autoregressive way.
arXiv Detail & Related papers (2022-11-21T07:02:15Z) - Equivariant 3D-Conditional Diffusion Models for Molecular Linker Design [82.23006955069229]
We propose DiffLinker, an E(3)-equivariant 3D-conditional diffusion model for molecular linker design.
Our model places missing atoms in between and designs a molecule incorporating all the initial fragments.
We demonstrate that DiffLinker outperforms other methods on the standard datasets generating more diverse and synthetically-accessible molecules.
arXiv Detail & Related papers (2022-10-11T09:13:37Z) - MIMOSA: Multi-constraint Molecule Sampling for Molecule Optimization [51.00815310242277]
generative models and reinforcement learning approaches made initial success, but still face difficulties in simultaneously optimizing multiple drug properties.
We propose the MultI-constraint MOlecule SAmpling (MIMOSA) approach, a sampling framework to use input molecule as an initial guess and sample molecules from the target distribution.
arXiv Detail & Related papers (2020-10-05T20:18:42Z) - Multi-Objective Molecule Generation using Interpretable Substructures [38.637412590671865]
Drug discovery aims to find novel compounds with specified chemical property profiles.
The goal is to learn to sample molecules in the intersection of multiple property constraints.
We propose to offset this complexity by composing molecules from a vocabulary of substructures that we call molecular rationales.
arXiv Detail & Related papers (2020-02-08T22:55:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.