AI-Bind: Improving Binding Predictions for Novel Protein Targets and
Ligands
- URL: http://arxiv.org/abs/2112.13168v2
- Date: Tue, 28 Dec 2021 05:29:29 GMT
- Title: AI-Bind: Improving Binding Predictions for Novel Protein Targets and
Ligands
- Authors: Ayan Chatterjee, Omair Shafi Ahmed, Robin Walters, Zohair Shafi, Deisy
Gysi, Rose Yu, Tina Eliassi-Rad, Albert-L\'aszl\'o Barab\'asi and Giulia
Menichetti
- Abstract summary: We show that state-of-the-art models fail to generalize to novel structures.
We introduce AI-Bind, a pipeline that combines network-based sampling strategies with unsupervised pre-training.
We illustrate the value of AI-Bind by predicting drugs and natural compounds with binding affinity to SARS-CoV-2 viral proteins.
- Score: 9.135203550164833
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Identifying novel drug-target interactions (DTI) is a critical and rate
limiting step in drug discovery. While deep learning models have been proposed
to accelerate the identification process, we show that state-of-the-art models
fail to generalize to novel (i.e., never-before-seen) structures. We first
unveil the mechanisms responsible for this shortcoming, demonstrating how
models rely on shortcuts that leverage the topology of the protein-ligand
bipartite network, rather than learning the node features. Then, we introduce
AI-Bind, a pipeline that combines network-based sampling strategies with
unsupervised pre-training, allowing us to limit the annotation imbalance and
improve binding predictions for novel proteins and ligands. We illustrate the
value of AI-Bind by predicting drugs and natural compounds with binding
affinity to SARS-CoV-2 viral proteins and the associated human proteins. We
also validate these predictions via auto-docking simulations and comparison
with recent experimental evidence. Overall, AI-Bind offers a powerful
high-throughput approach to identify drug-target combinations, with the
potential of becoming a powerful tool in drug discovery.
Related papers
- PGraphDTA: Improving Drug Target Interaction Prediction using Protein
Language Models and Contact Maps [4.590060921188914]
Key aspect of drug discovery involves identifying novel drug-target (DT) interactions.
Protein-ligand interactions exhibit a continuum of binding strengths, known as binding affinity.
We propose novel enhancements to enhance their performance.
arXiv Detail & Related papers (2023-10-06T05:00:25Z) - DrugOOD: Out-of-Distribution (OOD) Dataset Curator and Benchmark for
AI-aided Drug Discovery -- A Focus on Affinity Prediction Problems with Noise
Annotations [90.27736364704108]
We present DrugOOD, a systematic OOD dataset curator and benchmark for AI-aided drug discovery.
DrugOOD comes with an open-source Python package that fully automates benchmarking processes.
We focus on one of the most crucial problems in AIDD: drug target binding affinity prediction.
arXiv Detail & Related papers (2022-01-24T12:32:48Z) - Pre-training Co-evolutionary Protein Representation via A Pairwise
Masked Language Model [93.9943278892735]
Key problem in protein sequence representation learning is to capture the co-evolutionary information reflected by the inter-residue co-variation in the sequences.
We propose a novel method to capture this information directly by pre-training via a dedicated language model, i.e., Pairwise Masked Language Model (PMLM)
Our result shows that the proposed method can effectively capture the interresidue correlations and improves the performance of contact prediction by up to 9% compared to the baseline.
arXiv Detail & Related papers (2021-10-29T04:01:32Z) - Improved Drug-target Interaction Prediction with Intermolecular Graph
Transformer [98.8319016075089]
We propose a novel approach to model intermolecular information with a three-way Transformer-based architecture.
Intermolecular Graph Transformer (IGT) outperforms state-of-the-art approaches by 9.1% and 20.5% over the second best for binding activity and binding pose prediction respectively.
IGT exhibits promising drug screening ability against SARS-CoV-2 by identifying 83.1% active drugs that have been validated by wet-lab experiments with near-native predicted binding poses.
arXiv Detail & Related papers (2021-10-14T13:28:02Z) - Structure-aware Interactive Graph Neural Networks for the Prediction of
Protein-Ligand Binding Affinity [52.67037774136973]
Drug discovery often relies on the successful prediction of protein-ligand binding affinity.
Recent advances have shown great promise in applying graph neural networks (GNNs) for better affinity prediction by learning the representations of protein-ligand complexes.
We propose a structure-aware interactive graph neural network (SIGN) which consists of two components: polar-inspired graph attention layers (PGAL) and pairwise interactive pooling (PiPool)
arXiv Detail & Related papers (2021-07-21T03:34:09Z) - Bayesian neural network with pretrained protein embedding enhances
prediction accuracy of drug-protein interaction [3.499870393443268]
Deep learning approaches can predict drug-protein interactions without trial-and-error by humans.
We propose two methods to construct a deep learning framework that exhibits superior performance with a small labeled dataset.
arXiv Detail & Related papers (2020-12-15T10:24:34Z) - PIGNet: A physics-informed deep learning model toward generalized
drug-target interaction predictions [0.0]
We propose two key strategies to enhance generalization in the DTI model.
The first is to predict the atom-atom pairwise interactions via physics-informed equations parameterized with neural networks.
We further improved the model generalization by augmenting a range of binding poses and to broader training data.
arXiv Detail & Related papers (2020-08-22T14:29:58Z) - Deep Learning of High-Order Interactions for Protein Interface
Prediction [58.164371994210406]
We propose to formulate the protein interface prediction as a 2D dense prediction problem.
We represent proteins as graphs and employ graph neural networks to learn node features.
We incorporate high-order pairwise interactions to generate a 3D tensor containing different pairwise interactions.
arXiv Detail & Related papers (2020-07-18T05:39:35Z) - Explainable Deep Relational Networks for Predicting Compound-Protein
Affinities and Contacts [80.69440684790925]
DeepRelations is a physics-inspired deep relational network with intrinsically explainable architecture.
It shows superior interpretability to the state-of-the-art.
It boosts the AUPRC of contact prediction 9.5, 16.9, 19.3 and 5.7-fold for the test, compound-unique, protein-unique, and both-unique sets.
arXiv Detail & Related papers (2019-12-29T00:14:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.