Deep Learning for Protein-Ligand Docking: Are We There Yet?
- URL: http://arxiv.org/abs/2405.14108v3
- Date: Sun, 7 Jul 2024 19:12:04 GMT
- Title: Deep Learning for Protein-Ligand Docking: Are We There Yet?
- Authors: Alex Morehead, Nabin Giri, Jian Liu, Jianlin Cheng,
- Abstract summary: We introduce PoseBench, the first comprehensive benchmark for practical protein-ligand docking.
PoseBench enables researchers to rigorously and systematically evaluate DL docking methods for apo-to-holo protein-ligand docking and protein-ligand structure generation.
- Score: 6.138222365802935
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The effects of ligand binding on protein structures and their in vivo functions carry numerous implications for modern biomedical research and biotechnology development efforts such as drug discovery. Although several deep learning (DL) methods and benchmarks designed for protein-ligand docking have recently been introduced, to date no prior works have systematically studied the behavior of docking methods within the practical context of (1) using predicted (apo) protein structures for docking (e.g., for broad applicability); (2) docking multiple ligands concurrently to a given target protein (e.g., for enzyme design); and (3) having no prior knowledge of binding pockets (e.g., for pocket generalization). To enable a deeper understanding of docking methods' real-world utility, we introduce PoseBench, the first comprehensive benchmark for practical protein-ligand docking. PoseBench enables researchers to rigorously and systematically evaluate DL docking methods for apo-to-holo protein-ligand docking and protein-ligand structure generation using both single and multi-ligand benchmark datasets, the latter of which we introduce for the first time to the DL community. Empirically, using PoseBench, we find that all recent DL docking methods but one fail to generalize to multi-ligand protein targets and also that template-based docking algorithms perform equally well or better for multi-ligand docking as recent single-ligand DL docking methods, suggesting areas of improvement for future work. Code, data, tutorials, and benchmark results are available at https://github.com/BioinfoMachineLearning/PoseBench.
Related papers
- Smiles2Dock: an open large-scale multi-task dataset for ML-based molecular docking [0.0]
We introduce Smiles2Dock, an open large-scale multi-task dataset for molecular docking.
We dock 1.7 million from the ChEMBL database against 15 AlphaFold proteins, giving us more than 25 million protein-ligand binding scores.
Our dataset and code are publicly available to support the development of novel ML-based methods for molecular docking.
arXiv Detail & Related papers (2024-06-09T11:13:03Z) - Re-Dock: Towards Flexible and Realistic Molecular Docking with Diffusion
Bridge [69.80471117520719]
Re-Dock is a novel diffusion bridge generative model extended to geometric manifold.
We propose energy-to-geometry mapping inspired by the Newton-Euler equation to co-model the binding energy and conformations.
Experiments on designed benchmark datasets including apo-dock and cross-dock demonstrate our model's superior effectiveness and efficiency over current methods.
arXiv Detail & Related papers (2024-02-18T05:04:50Z) - Rigid Protein-Protein Docking via Equivariant Elliptic-Paraboloid
Interface Prediction [19.73508673791042]
The study of rigid protein-protein docking plays an essential role in a variety of tasks such as drug design and protein engineering.
We propose a novel learning-based method called ElliDock, which predicts an elliptic paraboloid to represent the protein-protein docking interface.
By its design, ElliDock is independently equivariant with respect to arbitrary rotations/translations of the proteins.
arXiv Detail & Related papers (2024-01-17T05:39:03Z) - Multi-scale Iterative Refinement towards Robust and Versatile Molecular
Docking [17.28573902701018]
Molecular docking is a key computational tool utilized to predict the binding conformations of small molecules to protein targets.
We introduce DeltaDock, a robust and versatile framework designed for efficient molecular docking.
arXiv Detail & Related papers (2023-11-30T14:09:20Z) - FABind: Fast and Accurate Protein-Ligand Binding [127.7790493202716]
$mathbfFABind$ is an end-to-end model that combines pocket prediction and docking to achieve accurate and fast protein-ligand binding.
Our proposed model demonstrates strong advantages in terms of effectiveness and efficiency compared to existing methods.
arXiv Detail & Related papers (2023-10-10T16:39:47Z) - DockGame: Cooperative Games for Multimeric Rigid Protein Docking [45.970633276976045]
We introduce DockGame, a novel game-theoretic framework for docking.
We view protein docking as a cooperative game between proteins, where the final assembly structure(s) constitute stable equilibria.
On the Docking Benchmark 5.5 dataset, DockGame has much faster runtimes than traditional docking methods.
arXiv Detail & Related papers (2023-10-09T22:02:05Z) - DiffDock-PP: Rigid Protein-Protein Docking with Diffusion Models [47.73386438748902]
DiffDock-PP is a diffusion generative model that learns to translate and rotate unbound protein structures into their bound conformations.
We achieve state-of-the-art performance on DIPS with a median C-RMSD of 4.85, outperforming all considered baselines.
arXiv Detail & Related papers (2023-04-08T02:10:44Z) - HelixFold-Single: MSA-free Protein Structure Prediction by Using Protein
Language Model as an Alternative [61.984700682903096]
HelixFold-Single is proposed to combine a large-scale protein language model with the superior geometric learning capability of AlphaFold2.
Our proposed method pre-trains a large-scale protein language model with thousands of millions of primary sequences.
We obtain an end-to-end differentiable model to predict the 3D coordinates of atoms from only the primary sequence.
arXiv Detail & Related papers (2022-07-28T07:30:33Z) - Independent SE(3)-Equivariant Models for End-to-End Rigid Protein
Docking [57.2037357017652]
We tackle rigid body protein-protein docking, i.e., computationally predicting the 3D structure of a protein-protein complex from the individual unbound structures.
We design a novel pairwise-independent SE(3)-equivariant graph matching network to predict the rotation and translation to place one of the proteins at the right docked position.
Our model, named EquiDock, approximates the binding pockets and predicts the docking poses using keypoint matching and alignment.
arXiv Detail & Related papers (2021-11-15T18:46:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.