Accelerating Inference in Molecular Diffusion Models with Latent Representations of Protein Structure
- URL: http://arxiv.org/abs/2311.13466v2
- Date: Wed, 8 May 2024 21:04:32 GMT
- Title: Accelerating Inference in Molecular Diffusion Models with Latent Representations of Protein Structure
- Authors: Ian Dunn, David Ryan Koes,
- Abstract summary: Diffusion generative models operate directly on 3D molecular structures.
We present a novel GNN-based architecture for learning latent representations of molecular structure.
Our model achieves comparable performance to one with an all-atom protein representation while exhibiting a 3-fold reduction in inference time.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Diffusion generative models have emerged as a powerful framework for addressing problems in structural biology and structure-based drug design. These models operate directly on 3D molecular structures. Due to the unfavorable scaling of graph neural networks (GNNs) with graph size as well as the relatively slow inference speeds inherent to diffusion models, many existing molecular diffusion models rely on coarse-grained representations of protein structure to make training and inference feasible. However, such coarse-grained representations discard essential information for modeling molecular interactions and impair the quality of generated structures. In this work, we present a novel GNN-based architecture for learning latent representations of molecular structure. When trained end-to-end with a diffusion model for de novo ligand design, our model achieves comparable performance to one with an all-atom protein representation while exhibiting a 3-fold reduction in inference time.
Related papers
- Diffusion Models in $\textit{De Novo}$ Drug Design [0.0]
Diffusion models have emerged as powerful tools for molecular generation, particularly in the context of 3D molecular structures.
This review focuses on the technical implementation of diffusion models tailored for 3D molecular generation.
arXiv Detail & Related papers (2024-06-07T06:34:13Z) - SubGDiff: A Subgraph Diffusion Model to Improve Molecular Representation Learning [14.338345772161102]
We propose a novel diffusion model termed SubGDiff for involving the molecular subgraph information in diffusion.
SubGDiff adopts three vital techniques: subgraph prediction, expectation state, and k-step same subgraph diffusion.
Experimentally, extensive downstream tasks demonstrate the superior performance of our approach.
arXiv Detail & Related papers (2024-05-09T10:37:33Z) - AUTODIFF: Autoregressive Diffusion Modeling for Structure-based Drug Design [16.946648071157618]
We propose a diffusion-based fragment-wise autoregressive generation model for structure-based drug design (SBDD)
We design a novel molecule assembly strategy named conformal motif that preserves the conformation of local structures of molecules first.
We then encode the interaction of the protein-ligand complex with an SE(3)-equivariant convolutional network and generate molecules motif-by-motif with diffusion modeling.
arXiv Detail & Related papers (2024-04-02T14:44:02Z) - Molecule Design by Latent Space Energy-Based Modeling and Gradual
Distribution Shifting [53.44684898432997]
Generation of molecules with desired chemical and biological properties is critical for drug discovery.
We propose a probabilistic generative model to capture the joint distribution of molecules and their properties.
Our method achieves very strong performances on various molecule design tasks.
arXiv Detail & Related papers (2023-06-09T03:04:21Z) - Towards Predicting Equilibrium Distributions for Molecular Systems with
Deep Learning [60.02391969049972]
We introduce a novel deep learning framework, called Distributional Graphormer (DiG), in an attempt to predict the equilibrium distribution of molecular systems.
DiG employs deep neural networks to transform a simple distribution towards the equilibrium distribution, conditioned on a descriptor of a molecular system.
arXiv Detail & Related papers (2023-06-08T17:12:08Z) - MUDiff: Unified Diffusion for Complete Molecule Generation [104.7021929437504]
We present a new model for generating a comprehensive representation of molecules, including atom features, 2D discrete molecule structures, and 3D continuous molecule coordinates.
We propose a novel graph transformer architecture to denoise the diffusion process.
Our model is a promising approach for designing stable and diverse molecules and can be applied to a wide range of tasks in molecular modeling.
arXiv Detail & Related papers (2023-04-28T04:25:57Z) - An Equivariant Generative Framework for Molecular Graph-Structure
Co-Design [54.92529253182004]
We present MolCode, a machine learning-based generative framework for underlineMolecular graph-structure underlineCo-design.
In MolCode, 3D geometric information empowers the molecular 2D graph generation, which in turn helps guide the prediction of molecular 3D structure.
Our investigation reveals that the 2D topology and 3D geometry contain intrinsically complementary information in molecule design.
arXiv Detail & Related papers (2023-04-12T13:34:22Z) - Modeling Molecular Structures with Intrinsic Diffusion Models [2.487445341407889]
This thesis proposes Intrinsic Diffusion Modeling.
It combines diffusion generative models with scientific knowledge about the flexibility of biological complexes.
We demonstrate the effectiveness of this approach on two fundamental tasks at the basis of computational chemistry and biology.
arXiv Detail & Related papers (2023-02-23T03:26:48Z) - MolCPT: Molecule Continuous Prompt Tuning to Generalize Molecular
Representation Learning [77.31492888819935]
We propose a novel paradigm of "pre-train, prompt, fine-tune" for molecular representation learning, named molecule continuous prompt tuning (MolCPT)
MolCPT defines a motif prompting function that uses the pre-trained model to project the standalone input into an expressive prompt.
Experiments on several benchmark datasets show that MolCPT efficiently generalizes pre-trained GNNs for molecular property prediction.
arXiv Detail & Related papers (2022-12-20T19:32:30Z) - Predicting Aqueous Solubility of Organic Molecules Using Deep Learning
Models with Varied Molecular Representations [3.10678679607547]
The goal of this study is to develop a general model capable of predicting the solubility of a broad range of organic molecules.
Using the largest currently available solubility dataset, we implement deep learning-based models to predict solubility from molecular structure.
We find that models using molecular descriptors achieve the best performance, with GNN models also achieving good performance.
arXiv Detail & Related papers (2021-05-26T15:54:54Z) - Learning Neural Generative Dynamics for Molecular Conformation
Generation [89.03173504444415]
We study how to generate molecule conformations (textiti.e., 3D structures) from a molecular graph.
We propose a novel probabilistic framework to generate valid and diverse conformations given a molecular graph.
arXiv Detail & Related papers (2021-02-20T03:17:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.