An Equivariant Generative Framework for Molecular Graph-Structure
Co-Design
- URL: http://arxiv.org/abs/2304.12436v1
- Date: Wed, 12 Apr 2023 13:34:22 GMT
- Title: An Equivariant Generative Framework for Molecular Graph-Structure
Co-Design
- Authors: Zaixi Zhang, Qi Liu, Chee-Kong Lee, Chang-Yu Hsieh, Enhong Chen
- Abstract summary: We present MolCode, a machine learning-based generative framework for underlineMolecular graph-structure underlineCo-design.
In MolCode, 3D geometric information empowers the molecular 2D graph generation, which in turn helps guide the prediction of molecular 3D structure.
Our investigation reveals that the 2D topology and 3D geometry contain intrinsically complementary information in molecule design.
- Score: 54.92529253182004
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Designing molecules with desirable physiochemical properties and
functionalities is a long-standing challenge in chemistry, material science,
and drug discovery. Recently, machine learning-based generative models have
emerged as promising approaches for \emph{de novo} molecule design. However,
further refinement of methodology is highly desired as most existing methods
lack unified modeling of 2D topology and 3D geometry information and fail to
effectively learn the structure-property relationship for molecule design. Here
we present MolCode, a roto-translation equivariant generative framework for
\underline{Mol}ecular graph-structure \underline{Co-de}sign. In MolCode, 3D
geometric information empowers the molecular 2D graph generation, which in turn
helps guide the prediction of molecular 3D structure. Extensive experimental
results show that MolCode outperforms previous methods on a series of
challenging tasks including \emph{de novo} molecule design, targeted molecule
discovery, and structure-based drug design. Particularly, MolCode not only
consistently generates valid (99.95$\%$ Validity) and diverse (98.75$\%$
Uniqueness) molecular graphs/structures with desirable properties, but also
generate drug-like molecules with high affinity to target proteins (61.8$\%$
high-affinity ratio), which demonstrates MolCode's potential applications in
material design and drug discovery. Our extensive investigation reveals that
the 2D topology and 3D geometry contain intrinsically complementary information
in molecule design, and provide new insights into machine learning-based
molecule representation and generation.
Related papers
- GraphXForm: Graph transformer for computer-aided molecular design with application to extraction [73.1842164721868]
We present GraphXForm, a decoder-only graph transformer architecture, which is pretrained on existing compounds and then fine-tuned.
We evaluate it on two solvent design tasks for liquid-liquid extraction, showing that it outperforms four state-of-the-art molecular design techniques.
arXiv Detail & Related papers (2024-11-03T19:45:15Z) - Diffusion Models in $\textit{De Novo}$ Drug Design [0.0]
Diffusion models have emerged as powerful tools for molecular generation, particularly in the context of 3D molecular structures.
This review focuses on the technical implementation of diffusion models tailored for 3D molecular generation.
arXiv Detail & Related papers (2024-06-07T06:34:13Z) - UniIF: Unified Molecule Inverse Folding [67.60267592514381]
We propose a unified model UniIF for inverse folding of all molecules.
Our proposed method surpasses state-of-the-art methods on all tasks.
arXiv Detail & Related papers (2024-05-29T10:26:16Z) - Molecule Joint Auto-Encoding: Trajectory Pretraining with 2D and 3D
Diffusion [19.151643496588022]
We propose a pretraining method for molecule joint auto-encoding (MoleculeJAE)
MoleculeJAE can learn both the 2D bond (topology) and 3D conformation (geometry) information.
Empirically, MoleculeJAE proves its effectiveness by reaching state-of-the-art performance on 15 out of 20 tasks.
arXiv Detail & Related papers (2023-12-06T12:58:37Z) - MUDiff: Unified Diffusion for Complete Molecule Generation [104.7021929437504]
We present a new model for generating a comprehensive representation of molecules, including atom features, 2D discrete molecule structures, and 3D continuous molecule coordinates.
We propose a novel graph transformer architecture to denoise the diffusion process.
Our model is a promising approach for designing stable and diverse molecules and can be applied to a wide range of tasks in molecular modeling.
arXiv Detail & Related papers (2023-04-28T04:25:57Z) - Implicit Geometry and Interaction Embeddings Improve Few-Shot Molecular
Property Prediction [53.06671763877109]
We develop molecular embeddings that encode complex molecular characteristics to improve the performance of few-shot molecular property prediction.
Our approach leverages large amounts of synthetic data, namely the results of molecular docking calculations.
On multiple molecular property prediction benchmarks, training from the embedding space substantially improves Multi-Task, MAML, and Prototypical Network few-shot learning performance.
arXiv Detail & Related papers (2023-02-04T01:32:40Z) - Domain-Agnostic Molecular Generation with Chemical Feedback [44.063584808910896]
MolGen is a pre-trained molecular language model tailored specifically for molecule generation.
It internalizes structural and grammatical insights through the reconstruction of over 100 million molecular SELFIES.
Our chemical feedback paradigm steers the model away from molecular hallucinations, ensuring alignment between the model's estimated probabilities and real-world chemical preferences.
arXiv Detail & Related papers (2023-01-26T17:52:56Z) - Scalable Fragment-Based 3D Molecular Design with Reinforcement Learning [68.8204255655161]
We introduce a novel framework for scalable 3D design that uses a hierarchical agent to build molecules.
In a variety of experiments, we show that our agent, guided only by energy considerations, can efficiently learn to produce molecules with over 100 atoms.
arXiv Detail & Related papers (2022-02-01T18:54:24Z) - Augmenting Molecular Deep Generative Models with Topological Data
Analysis Representations [21.237758981760784]
We present a SMILES Variational Auto-Encoder (VAE) augmented with topological data analysis (TDA) representations of molecules.
Our experiments show that this TDA augmentation enables a SMILES VAE to capture the complex relation between 3D geometry and electronic properties.
arXiv Detail & Related papers (2021-06-08T15:49:21Z) - Learning to design drug-like molecules in three-dimensional space using
deep generative models [0.0]
Ligand Neural Network (L-Net) is a novel graph generative model for designing drug-like molecules with high-quality 3D structures.
L-Net is capable of generating chemically correct, conformationally valid, and highly druglike molecules.
arXiv Detail & Related papers (2021-04-17T07:30:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.