Apo2Mol: 3D Molecule Generation via Dynamic Pocket-Aware Diffusion Models
- URL: http://arxiv.org/abs/2511.14559v1
- Date: Tue, 18 Nov 2025 15:01:27 GMT
- Title: Apo2Mol: 3D Molecule Generation via Dynamic Pocket-Aware Diffusion Models
- Authors: Xinzhe Zheng, Shiyu Jiang, Gustavo Seabra, Chenglong Li, Yanjun Li,
- Abstract summary: We propose Apo2Mol, a diffusion-based generative framework for 3D molecule design.<n>Apo2Mol explicitly accounts for conformational flexibility in protein binding pockets.<n>It can achieve state-of-the-art performance in generating high-affinity and accurately capture realistic protein pocket conformational changes.
- Score: 10.08056210949295
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Deep generative models are rapidly advancing structure-based drug design, offering substantial promise for generating small molecule ligands that bind to specific protein targets. However, most current approaches assume a rigid protein binding pocket, neglecting the intrinsic flexibility of proteins and the conformational rearrangements induced by ligand binding, limiting their applicability in practical drug discovery. Here, we propose Apo2Mol, a diffusion-based generative framework for 3D molecule design that explicitly accounts for conformational flexibility in protein binding pockets. To support this, we curate a dataset of over 24,000 experimentally resolved apo-holo structure pairs from the Protein Data Bank, enabling the characterization of protein structure changes associated with ligand binding. Apo2Mol employs a full-atom hierarchical graph-based diffusion model that simultaneously generates 3D ligand molecules and their corresponding holo pocket conformations from input apo states. Empirical studies demonstrate that Apo2Mol can achieve state-of-the-art performance in generating high-affinity ligands and accurately capture realistic protein pocket conformational changes.
Related papers
- Dynamics-inspired Structure Hallucination for Protein-protein Interaction Modeling [60.57197355431804]
Protein-protein interaction (PPI) represents a central challenge within the biology field.<n>Deep learning has shown promise in forecasting the effects of such mutations, but is hindered by two primary constraints.<n>We present a novel framework named Refine-PPI with two key enhancements.
arXiv Detail & Related papers (2026-01-08T19:29:04Z) - A 3D pocket-aware and evolutionary conserved interaction guided diffusion model for molecular optimization [7.254508464118023]
We develop a new 3D target-aware diffusion model DiffDecip for molecule optimization through scaffold decoration.<n>The model performance revealed that DiffDecip outperforms baseline model DiffDec on molecule optimization towards higher affinity through forming more non-covalent interactions with highly conserved residues in the protein pocket.
arXiv Detail & Related papers (2025-05-09T08:33:45Z) - Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows [29.49146207945794]
Traditional structure-based drug design (SBDD) approaches typically target binding sites with rigid structures.<n>We propose to use generative modeling for SBDD considering conformational changes of protein pockets.<n>We show that DynamicFlow learns to transform apo pockets and noisy pockets into holo pockets and corresponding 3D molecules.
arXiv Detail & Related papers (2025-03-06T00:34:44Z) - Molecule Generation for Target Protein Binding with Hierarchical Consistency Diffusion Model [17.885767456439215]
Atom-Motif Consistency Diffusion Model (AMDiff) is a hierarchical diffusion architecture that integrates both atom- and motif-level views of molecules.<n>Compared to existing approaches, AMDiff exhibits superior validity and novelty in generating molecules tailored to fit various protein pockets.
arXiv Detail & Related papers (2025-03-02T17:54:30Z) - DPLM-2: A Multimodal Diffusion Protein Language Model [75.98083311705182]
We introduce DPLM-2, a multimodal protein foundation model that extends discrete diffusion protein language model (DPLM) to accommodate both sequences and structures.
DPLM-2 learns the joint distribution of sequence and structure, as well as their marginals and conditionals.
Empirical evaluation shows that DPLM-2 can simultaneously generate highly compatible amino acid sequences and their corresponding 3D structures.
arXiv Detail & Related papers (2024-10-17T17:20:24Z) - DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design [62.68420322996345]
Existing structured-based drug design methods treat all ligand atoms equally.
We propose a new diffusion model, DecompDiff, with decomposed priors over arms and scaffold.
Our approach achieves state-of-the-art performance in generating high-affinity molecules.
arXiv Detail & Related papers (2024-02-26T05:21:21Z) - Functional-Group-Based Diffusion for Pocket-Specific Molecule Generation and Elaboration [63.23362798102195]
We propose D3FG, a functional-group-based diffusion model for pocket-specific molecule generation and elaboration.
D3FG decomposes molecules into two categories of components: functional groups defined as rigid bodies and linkers as mass points.
In the experiments, our method can generate molecules with more realistic 3D structures, competitive affinities toward the protein targets, and better drug properties.
arXiv Detail & Related papers (2023-05-30T06:41:20Z) - A Latent Diffusion Model for Protein Structure Generation [50.74232632854264]
We propose a latent diffusion model that can reduce the complexity of protein modeling.
We show that our method can effectively generate novel protein backbone structures with high designability and efficiency.
arXiv Detail & Related papers (2023-05-06T19:10:19Z) - State-specific protein-ligand complex structure prediction with a
multi-scale deep generative model [68.28309982199902]
We present NeuralPLexer, a computational approach that can directly predict protein-ligand complex structures.
Our study suggests that a data-driven approach can capture the structural cooperativity between proteins and small molecules, showing promise in accelerating the design of enzymes, drug molecules, and beyond.
arXiv Detail & Related papers (2022-09-30T01:46:38Z) - Widely Used and Fast De Novo Drug Design by a Protein Sequence-Based
Reinforcement Learning Model [4.815696666006742]
Structure-based de novo method can overcome the data scarcity of active by incorporating drug-target interaction into deep generative architectures.
Here, we demonstrate a widely used and fast protein sequence-based reinforcement learning model for drug discovery.
As a proof of concept, the RL model was utilized to design molecules for four targets.
arXiv Detail & Related papers (2022-08-14T10:41:52Z) - In-Pocket 3D Graphs Enhance Ligand-Target Compatibility in Generative
Small-Molecule Creation [0.0]
We present a graph-based generative modeling technology that encodes explicit 3D protein-ligand contacts within a relational graph architecture.
The models combine a conditional variational autoencoder that allows for activity-specific molecule generation with putative contact generation that provides predictions of molecular interactions within the target binding pocket.
arXiv Detail & Related papers (2022-04-05T22:53:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.