Geometric Transformers for Protein Interface Contact Prediction
- URL: http://arxiv.org/abs/2110.02423v1
- Date: Wed, 6 Oct 2021 00:12:15 GMT
- Title: Geometric Transformers for Protein Interface Contact Prediction
- Authors: Alex Morehead, Chen Chen, Jianlin Cheng
- Abstract summary: We present the Geometric Transformer, a novel geometry-evolving graph transformer for rotation and translation-invariant protein interface contact prediction.
DeepInteract predicts partner-specific protein interface contacts given the 3D tertiary structures of two proteins as input.
- Score: 3.031630445636656
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Computational methods for predicting the interface contacts between proteins
come highly sought after for drug discovery as they can significantly advance
the accuracy of alternative approaches, such as protein-protein docking,
protein function analysis tools, and other computational methods for protein
bioinformatics. In this work, we present the Geometric Transformer, a novel
geometry-evolving graph transformer for rotation and translation-invariant
protein interface contact prediction, packaged within DeepInteract, an
end-to-end prediction pipeline. DeepInteract predicts partner-specific protein
interface contacts (i.e., inter-protein residue-residue contacts) given the 3D
tertiary structures of two proteins as input. In rigorous benchmarks,
DeepInteract, on challenging protein complex targets from the new Enhanced
Database of Interacting Protein Structures (DIPS-Plus) and the 13th and 14th
CASP-CAPRI experiments, achieves 17% and 13% top L/5 precision (L: length of a
protein unit in a complex), respectively. In doing so, DeepInteract, with the
Geometric Transformer as its graph-based backbone, outperforms existing methods
for interface contact prediction in addition to other graph-based neural
network backbones compatible with DeepInteract, thereby validating the
effectiveness of the Geometric Transformer for learning rich
relational-geometric features for downstream tasks on 3D protein structures.
Related papers
- EquiPocket: an E(3)-Equivariant Geometric Graph Neural Network for Ligand Binding Site Prediction [49.674494450107005]
Predicting the binding sites of target proteins plays a fundamental role in drug discovery.
Most existing deep-learning methods consider a protein as a 3D image by spatially clustering its atoms into voxels.
This work proposes EquiPocket, an E(3)-equivariant Graph Neural Network (GNN) for binding site prediction.
arXiv Detail & Related papers (2023-02-23T17:18:26Z) - Protein-Ligand Complex Generator & Drug Screening via Tiered Tensor
Transform [18.509174420141832]
We develop an algorithm to generate diverse protein-ligand complex conformations for both pose and affinity estimation in drug screening.
The 3T structure transformation is decoupled from the system physics, making future usage in other computational scientific domains possible.
arXiv Detail & Related papers (2023-01-03T07:33:20Z) - Integration of Pre-trained Protein Language Models into Geometric Deep
Learning Networks [68.90692290665648]
We integrate knowledge learned by protein language models into several state-of-the-art geometric networks.
Our findings show an overall improvement of 20% over baselines.
Strong evidence indicates that the incorporation of protein language models' knowledge enhances geometric networks' capacity by a significant margin.
arXiv Detail & Related papers (2022-12-07T04:04:04Z) - DProQ: A Gated-Graph Transformer for Protein Complex Structure
Assessment [7.988932562855392]
DProQ is a gated neighborhood-modulating Graph Transformer (GGT) designed to predict the quality of 3D protein complex structures.
We incorporate node and edge gates within a novel Graph Transformer framework to control information flow during graph message passing.
Our rigorous experiments demonstrate that DProQ achieves state-of-the-art performance in ranking protein complex structures.
arXiv Detail & Related papers (2022-05-21T15:41:46Z) - Learning Geometrically Disentangled Representations of Protein Folding
Simulations [72.03095377508856]
This work focuses on learning a generative neural network on a structural ensemble of a drug-target protein.
Model tasks involve characterizing the distinct structural fluctuations of the protein bound to various drug molecules.
Results show that our geometric learning-based method enjoys both accuracy and efficiency for generating complex structural variations.
arXiv Detail & Related papers (2022-05-20T19:38:00Z) - Protein 3D structure-based neural networks highly improve the accuracy
in compound-protein binding affinity prediction [7.059949221160259]
We develop Fast Evolutional Attention and Thoroughgoing-graph Neural Networks (FeatNN) to facilitate the application of protein 3D structure information for predicting compound-protein binding affinities (CPAs)
FeatNN considerably outperforms various state-of-the-art baselines in CPA prediction with the Pearson value elevated by about 35.7%.
arXiv Detail & Related papers (2022-03-30T00:44:15Z) - Independent SE(3)-Equivariant Models for End-to-End Rigid Protein
Docking [57.2037357017652]
We tackle rigid body protein-protein docking, i.e., computationally predicting the 3D structure of a protein-protein complex from the individual unbound structures.
We design a novel pairwise-independent SE(3)-equivariant graph matching network to predict the rotation and translation to place one of the proteins at the right docked position.
Our model, named EquiDock, approximates the binding pockets and predicts the docking poses using keypoint matching and alignment.
arXiv Detail & Related papers (2021-11-15T18:46:37Z) - DIPS-Plus: The Enhanced Database of Interacting Protein Structures for
Interface Prediction [2.697420611471228]
We present DIPS-Plus, an enhanced, feature-rich dataset of 42,112 complexes for geometric deep learning of protein interfaces.
The previous version of DIPS contains only the Cartesian coordinates and types of the atoms comprising a given protein complex.
DIPS-Plus now includes a plethora of new residue-level features including protrusion indices, half-sphere amino acid compositions, and new profile hidden Markov model (HMM)-based sequence features for each amino acid.
arXiv Detail & Related papers (2021-06-06T23:56:27Z) - Deep Learning of High-Order Interactions for Protein Interface
Prediction [58.164371994210406]
We propose to formulate the protein interface prediction as a 2D dense prediction problem.
We represent proteins as graphs and employ graph neural networks to learn node features.
We incorporate high-order pairwise interactions to generate a 3D tensor containing different pairwise interactions.
arXiv Detail & Related papers (2020-07-18T05:39:35Z) - Intrinsic-Extrinsic Convolution and Pooling for Learning on 3D Protein
Structures [18.961218808251076]
We propose two new learning operations enabling deep 3D analysis of large-scale protein data.
First, we introduce a novel convolution operator which considers both, the intrinsic (invariant under protein folding) as well as extrinsic (invariant under bonding) structure.
Second, we enable a multi-scale protein analysis by introducing hierarchical pooling operators, exploiting the fact that proteins are a recombination of a finite set of amino acids.
arXiv Detail & Related papers (2020-07-13T09:02:40Z) - BERTology Meets Biology: Interpreting Attention in Protein Language
Models [124.8966298974842]
We demonstrate methods for analyzing protein Transformer models through the lens of attention.
We show that attention captures the folding structure of proteins, connecting amino acids that are far apart in the underlying sequence, but spatially close in the three-dimensional structure.
We also present a three-dimensional visualization of the interaction between attention and protein structure.
arXiv Detail & Related papers (2020-06-26T21:50:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.