State-specific protein-ligand complex structure prediction with a
multi-scale deep generative model
- URL: http://arxiv.org/abs/2209.15171v2
- Date: Wed, 19 Apr 2023 19:40:00 GMT
- Title: State-specific protein-ligand complex structure prediction with a
multi-scale deep generative model
- Authors: Zhuoran Qiao, Weili Nie, Arash Vahdat, Thomas F. Miller III, Anima
Anandkumar
- Abstract summary: We present NeuralPLexer, a computational approach that can directly predict protein-ligand complex structures.
Our study suggests that a data-driven approach can capture the structural cooperativity between proteins and small molecules, showing promise in accelerating the design of enzymes, drug molecules, and beyond.
- Score: 68.28309982199902
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The binding complexes formed by proteins and small molecule ligands are
ubiquitous and critical to life. Despite recent advancements in protein
structure prediction, existing algorithms are so far unable to systematically
predict the binding ligand structures along with their regulatory effects on
protein folding. To address this discrepancy, we present NeuralPLexer, a
computational approach that can directly predict protein-ligand complex
structures solely using protein sequence and ligand molecular graph inputs.
NeuralPLexer adopts a deep generative model to sample the 3D structures of the
binding complex and their conformational changes at an atomistic resolution.
The model is based on a diffusion process that incorporates essential
biophysical constraints and a multi-scale geometric deep learning system to
iteratively sample residue-level contact maps and all heavy-atom coordinates in
a hierarchical manner. NeuralPLexer achieves state-of-the-art performance
compared to all existing methods on benchmarks for both protein-ligand blind
docking and flexible binding site structure recovery. Moreover, owing to its
specificity in sampling both ligand-free-state and ligand-bound-state
ensembles, NeuralPLexer consistently outperforms AlphaFold2 in terms of global
protein structure accuracy on both representative structure pairs with large
conformational changes (average TM-score=0.93) and recently determined
ligand-binding proteins (average TM-score=0.89). Case studies reveal that the
predicted conformational variations are consistent with structure determination
experiments for important targets, including human KRAS$^\textrm{G12C}$,
ketol-acid reductoisomerase, and purine GPCRs. Our study suggests that a
data-driven approach can capture the structural cooperativity between proteins
and small molecules, showing promise in accelerating the design of enzymes,
drug molecules, and beyond.
Related papers
- SFM-Protein: Integrative Co-evolutionary Pre-training for Advanced Protein Sequence Representation [97.99658944212675]
We introduce a novel pre-training strategy for protein foundation models.
It emphasizes the interactions among amino acid residues to enhance the extraction of both short-range and long-range co-evolutionary features.
Trained on a large-scale protein sequence dataset, our model demonstrates superior generalization ability.
arXiv Detail & Related papers (2024-10-31T15:22:03Z) - 4D Diffusion for Dynamic Protein Structure Prediction with Reference Guided Motion Alignment [18.90451943620277]
This study introduces an innovative 4D diffusion model incorporating molecular dynamics (MD) simulation data to learn dynamic protein structures.
To our knowledge, this is the first diffusion-based model aimed at predicting protein trajectories across multiple time steps simultaneously.
arXiv Detail & Related papers (2024-08-22T14:12:50Z) - PSC-CPI: Multi-Scale Protein Sequence-Structure Contrasting for
Efficient and Generalizable Compound-Protein Interaction Prediction [63.50967073653953]
Compound-Protein Interaction prediction aims to predict the pattern and strength of compound-protein interactions for rational drug discovery.
Existing deep learning-based methods utilize only the single modality of protein sequences or structures.
We propose a novel multi-scale Protein Sequence-structure Contrasting framework for CPI prediction.
arXiv Detail & Related papers (2024-02-13T03:51:10Z) - FFF: Fragments-Guided Flexible Fitting for Building Complete Protein
Structures [10.682516227941592]
We propose a new method named FFF that bridges protein structure prediction and protein structure recognition with flexible fitting.
First, a multi-level recognition network is used to capture various structural features from the input 3D cryo-EM map.
Next, protein structural fragments are generated using pseudo peptide vectors and a protein sequence alignment method based on these extracted features.
arXiv Detail & Related papers (2023-08-07T15:10:21Z) - Structure-informed Language Models Are Protein Designers [69.70134899296912]
We present LM-Design, a generic approach to reprogramming sequence-based protein language models (pLMs)
We conduct a structural surgery on pLMs, where a lightweight structural adapter is implanted into pLMs and endows it with structural awareness.
Experiments show that our approach outperforms the state-of-the-art methods by a large margin.
arXiv Detail & Related papers (2023-02-03T10:49:52Z) - Protein Sequence and Structure Co-Design with Equivariant Translation [19.816174223173494]
Existing approaches generate both protein sequence and structure using either autoregressive models or diffusion models.
We propose a new approach capable of protein sequence and structure co-design, which iteratively translates both protein sequence and structure into the desired state.
Our model consists of a trigonometry-aware encoder that reasons geometrical constraints and interactions from context features.
All protein amino acids are updated in one shot in each translation step, which significantly accelerates the inference process.
arXiv Detail & Related papers (2022-10-17T06:00:12Z) - Learning Geometrically Disentangled Representations of Protein Folding
Simulations [72.03095377508856]
This work focuses on learning a generative neural network on a structural ensemble of a drug-target protein.
Model tasks involve characterizing the distinct structural fluctuations of the protein bound to various drug molecules.
Results show that our geometric learning-based method enjoys both accuracy and efficiency for generating complex structural variations.
arXiv Detail & Related papers (2022-05-20T19:38:00Z) - Transfer Learning for Protein Structure Classification at Low Resolution [124.5573289131546]
We show that it is possible to make accurate ($geq$80%) predictions of protein class and architecture from structures determined at low ($leq$3A) resolution.
We provide proof of concept for high-speed, low-cost protein structure classification at low resolution, and a basis for extension to prediction of function.
arXiv Detail & Related papers (2020-08-11T15:01:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.