Leveraging Side Information for Ligand Conformation Generation using
Diffusion-Based Approaches
- URL: http://arxiv.org/abs/2309.16684v1
- Date: Wed, 2 Aug 2023 09:56:47 GMT
- Title: Leveraging Side Information for Ligand Conformation Generation using
Diffusion-Based Approaches
- Authors: Jiamin Wu, He Cao, Yuan Yao
- Abstract summary: Ligand molecule conformation generation is a critical challenge in drug discovery.
Deep learning models have been developed to tackle this problem.
These models often generate conformations that lack meaningful structure and randomness due to the absence of essential side information.
- Score: 12.71967232020327
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Ligand molecule conformation generation is a critical challenge in drug
discovery. Deep learning models have been developed to tackle this problem,
particularly through the use of generative models in recent years. However,
these models often generate conformations that lack meaningful structure and
randomness due to the absence of essential side information. Examples of such
side information include the chemical and geometric features of the target
protein, ligand-target compound interactions, and ligand chemical properties.
Without these constraints, the generated conformations may not be suitable for
further selection and design of new drugs. To address this limitation, we
propose a novel method for generating ligand conformations that leverage side
information and incorporate flexible constraints into standard diffusion
models. Drawing inspiration from the concept of message passing, we introduce
ligand-target massage passing block, a mechanism that facilitates the exchange
of information between target nodes and ligand nodes, thereby incorporating
target node features. To capture non-covalent interactions, we introduce
ligand-target compound inter and intra edges. To further improve the biological
relevance of the generated conformations, we train energy models using scalar
chemical features. These models guide the progress of the standard Denoising
Diffusion Probabilistic Models, resulting in more biologically meaningful
conformations. We evaluate the performance of SIDEGEN using the PDBBind-2020
dataset, comparing it against other methods. The results demonstrate
improvements in both Aligned RMSD and Ligand RMSD evaluations. Specifically,
our model outperforms GeoDiff (trained on PDBBind-2020) by 20% in terms of the
median aligned RMSD metric.
Related papers
- Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization [147.7899503829411]
We propose a novel and general alignment framework to align pretrained target diffusion models with preferred functional properties, named AliDiff.
AliDiff shifts the target-conditioned chemical distribution towards regions with higher binding affinity and structural rationality, specified by user-defined reward functions.
We show that AliDiff can generate molecules with state-of-the-art binding energies with up to -7.07 Avg. Vina Score, while maintaining strong molecular properties.
arXiv Detail & Related papers (2024-07-01T06:10:29Z) - TAGMol: Target-Aware Gradient-guided Molecule Generation [19.977071499171903]
3D generative models have shown significant promise in structure-based drug design (SBDD)
We decouple the problem into molecular generation and property prediction.
The latter synergistically guides the diffusion sampling process, facilitating guided diffusion and resulting in the creation of meaningful molecules with the desired properties.
We call this guided molecular generation process as TAGMol.
arXiv Detail & Related papers (2024-06-03T14:43:54Z) - AUTODIFF: Autoregressive Diffusion Modeling for Structure-based Drug Design [16.946648071157618]
We propose a diffusion-based fragment-wise autoregressive generation model for structure-based drug design (SBDD)
We design a novel molecule assembly strategy named conformal motif that preserves the conformation of local structures of molecules first.
We then encode the interaction of the protein-ligand complex with an SE(3)-equivariant convolutional network and generate molecules motif-by-motif with diffusion modeling.
arXiv Detail & Related papers (2024-04-02T14:44:02Z) - DecompOpt: Controllable and Decomposed Diffusion Models for Structure-based Molecular Optimization [49.85944390503957]
DecompOpt is a structure-based molecular optimization method based on a controllable and diffusion model.
We show that DecompOpt can efficiently generate molecules with improved properties than strong de novo baselines.
arXiv Detail & Related papers (2024-03-07T02:53:40Z) - DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design [62.68420322996345]
Existing structured-based drug design methods treat all ligand atoms equally.
We propose a new diffusion model, DecompDiff, with decomposed priors over arms and scaffold.
Our approach achieves state-of-the-art performance in generating high-affinity molecules.
arXiv Detail & Related papers (2024-02-26T05:21:21Z) - Delta Score: Improving the Binding Assessment of Structure-Based Drug
Design Methods [14.272327734087598]
We introduce the delta score, a novel evaluation metric grounded in tangible pharmaceutical requisites.
Our experiments reveal that molecules produced by current deep generative models significantly lag behind ground reference truth when assessed with the delta score.
arXiv Detail & Related papers (2023-11-01T08:37:39Z) - Modular Flows: Differential Molecular Generation [18.41106104201439]
Flows can generate molecules effectively by inverting the encoding process.
Existing flow models require artifactual dequantization or specific node/edge orderings.
We develop continuous normalizing E(3)-equivariant flows, based on a system of node ODEs and a graph PDE.
Our models can be cast as message-passing temporal networks, and result in superlative performance on the tasks of density estimation and molecular generation.
arXiv Detail & Related papers (2022-10-12T09:08:35Z) - Retrieval-based Controllable Molecule Generation [63.44583084888342]
We propose a new retrieval-based framework for controllable molecule generation.
We use a small set of molecules to steer the pre-trained generative model towards synthesizing molecules that satisfy the given design criteria.
Our approach is agnostic to the choice of generative models and requires no task-specific fine-tuning.
arXiv Detail & Related papers (2022-08-23T17:01:16Z) - From Static to Dynamic Structures: Improving Binding Affinity Prediction
with a Graph-Based Deep Learning Model [33.92165575735532]
Accurate prediction of the protein-ligand binding affinities is an essential challenge in the structure-based drug design.
Here, we curated an MD dataset containing 3,218 different protein-ligand complexes, and developed Dynaformer, a graph-based deep learning model.
Dynaformer was able to accurately predict the binding affinities by learning the geometric characteristics of the protein-ligand interactions from the MD trajectories.
arXiv Detail & Related papers (2022-08-19T14:55:12Z) - Score-based Generative Modeling of Graphs via the System of Stochastic
Differential Equations [57.15855198512551]
We propose a novel score-based generative model for graphs with a continuous-time framework.
We show that our method is able to generate molecules that lie close to the training distribution yet do not violate the chemical valency rule.
arXiv Detail & Related papers (2022-02-05T08:21:04Z) - Distance-aware Molecule Graph Attention Network for Drug-Target Binding
Affinity Prediction [54.93890176891602]
We propose a diStance-aware Molecule graph Attention Network (S-MAN) tailored to drug-target binding affinity prediction.
As a dedicated solution, we first propose a position encoding mechanism to integrate the topological structure and spatial position information into the constructed pocket-ligand graph.
We also propose a novel edge-node hierarchical attentive aggregation structure which has edge-level aggregation and node-level aggregation.
arXiv Detail & Related papers (2020-12-17T17:44:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.