EGR: Equivariant Graph Refinement and Assessment of 3D Protein Complex
Structures
- URL: http://arxiv.org/abs/2205.10390v1
- Date: Fri, 20 May 2022 18:11:41 GMT
- Title: EGR: Equivariant Graph Refinement and Assessment of 3D Protein Complex
Structures
- Authors: Alex Morehead, Xiao Chen, Tianqi Wu, Jian Liu, Jianlin Cheng
- Abstract summary: We introduce the Equivariant Graph Refiner (EGR), a novel E(3)-equivariant graph neural network (GNN) for multi-task structure refinement and assessment of protein complexes.
Our experiments on new, diverse protein complex datasets, all of which we make publicly available in this work, demonstrate the state-of-the-art effectiveness of EGR.
- Score: 8.494211223965703
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Protein complexes are macromolecules essential to the functioning and
well-being of all living organisms. As the structure of a protein complex, in
particular its region of interaction between multiple protein subunits (i.e.,
chains), has a notable influence on the biological function of the complex,
computational methods that can quickly and effectively be used to refine and
assess the quality of a protein complex's 3D structure can directly be used
within a drug discovery pipeline to accelerate the development of new
therapeutics and improve the efficacy of future vaccines. In this work, we
introduce the Equivariant Graph Refiner (EGR), a novel E(3)-equivariant graph
neural network (GNN) for multi-task structure refinement and assessment of
protein complexes. Our experiments on new, diverse protein complex datasets,
all of which we make publicly available in this work, demonstrate the
state-of-the-art effectiveness of EGR for atomistic refinement and assessment
of protein complexes and outline directions for future work in the field. In
doing so, we establish a baseline for future studies in macromolecular
refinement and structure analysis.
Related papers
- SFM-Protein: Integrative Co-evolutionary Pre-training for Advanced Protein Sequence Representation [97.99658944212675]
We introduce a novel pre-training strategy for protein foundation models.
It emphasizes the interactions among amino acid residues to enhance the extraction of both short-range and long-range co-evolutionary features.
Trained on a large-scale protein sequence dataset, our model demonstrates superior generalization ability.
arXiv Detail & Related papers (2024-10-31T15:22:03Z) - Long-context Protein Language Model [76.95505296417866]
Self-supervised training of language models (LMs) has seen great success for protein sequences in learning meaningful representations and for generative drug design.
Most protein LMs are based on the Transformer architecture trained on individual proteins with short context lengths.
We propose LC-PLM based on an alternative protein LM architecture, BiMamba-S, built off selective structured state-space models.
We also introduce its graph-contextual variant, LC-PLM-G, which contextualizes protein-protein interaction graphs for a second stage of training.
arXiv Detail & Related papers (2024-10-29T16:43:28Z) - Functional Geometry Guided Protein Sequence and Backbone Structure
Co-Design [12.585697288315846]
We propose a model to jointly design Protein sequence and structure based on automatically detected functional sites.
NAEPro is powered by an interleaving network of attention and equivariant layers, which can capture global correlation in a whole sequence.
Experimental results show that our model consistently achieves the highest amino acid recovery rate, TM-score, and the lowest RMSD among all competitors.
arXiv Detail & Related papers (2023-10-06T16:08:41Z) - A Latent Diffusion Model for Protein Structure Generation [50.74232632854264]
We propose a latent diffusion model that can reduce the complexity of protein modeling.
We show that our method can effectively generate novel protein backbone structures with high designability and efficiency.
arXiv Detail & Related papers (2023-05-06T19:10:19Z) - State-specific protein-ligand complex structure prediction with a
multi-scale deep generative model [68.28309982199902]
We present NeuralPLexer, a computational approach that can directly predict protein-ligand complex structures.
Our study suggests that a data-driven approach can capture the structural cooperativity between proteins and small molecules, showing promise in accelerating the design of enzymes, drug molecules, and beyond.
arXiv Detail & Related papers (2022-09-30T01:46:38Z) - DProQ: A Gated-Graph Transformer for Protein Complex Structure
Assessment [7.988932562855392]
DProQ is a gated neighborhood-modulating Graph Transformer (GGT) designed to predict the quality of 3D protein complex structures.
We incorporate node and edge gates within a novel Graph Transformer framework to control information flow during graph message passing.
Our rigorous experiments demonstrate that DProQ achieves state-of-the-art performance in ranking protein complex structures.
arXiv Detail & Related papers (2022-05-21T15:41:46Z) - Learning Geometrically Disentangled Representations of Protein Folding
Simulations [72.03095377508856]
This work focuses on learning a generative neural network on a structural ensemble of a drug-target protein.
Model tasks involve characterizing the distinct structural fluctuations of the protein bound to various drug molecules.
Results show that our geometric learning-based method enjoys both accuracy and efficiency for generating complex structural variations.
arXiv Detail & Related papers (2022-05-20T19:38:00Z) - PersGNN: Applying Topological Data Analysis and Geometric Deep Learning
to Structure-Based Protein Function Prediction [0.07340017786387766]
In this work, we isolate protein structure to make functional annotations for proteins in the Protein Data Bank.
We present PersGNN - an end-to-end trainable deep learning model that combines graph representation learning with topological data analysis.
arXiv Detail & Related papers (2020-10-30T02:24:35Z) - BERTology Meets Biology: Interpreting Attention in Protein Language
Models [124.8966298974842]
We demonstrate methods for analyzing protein Transformer models through the lens of attention.
We show that attention captures the folding structure of proteins, connecting amino acids that are far apart in the underlying sequence, but spatially close in the three-dimensional structure.
We also present a three-dimensional visualization of the interaction between attention and protein structure.
arXiv Detail & Related papers (2020-06-26T21:50:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.