RapidDock: Unlocking Proteome-scale Molecular Docking
- URL: http://arxiv.org/abs/2411.00004v1
- Date: Wed, 16 Oct 2024 09:28:59 GMT
- Title: RapidDock: Unlocking Proteome-scale Molecular Docking
- Authors: Rafał Powalski, Bazyli Klockiewicz, Maciej Jaśkowski, Bartosz Topolski, Paweł Dąbrowski-Tumański, Maciej Wiśniewski, Łukasz Kuciński, Piotr Miłoś, Dariusz Plewczynski,
- Abstract summary: Current molecular docking tools are too slow to screen potential drugs against all relevant proteins.
We introduce RapidDock, an efficient transformer-based model for blind molecular docking.
RapidDock achieves at least a $100 times$ speed advantage over existing methods without compromising accuracy.
- Score: 0.0
- License:
- Abstract: Accelerating molecular docking -- the process of predicting how molecules bind to protein targets -- could boost small-molecule drug discovery and revolutionize medicine. Unfortunately, current molecular docking tools are too slow to screen potential drugs against all relevant proteins, which often results in missed drug candidates or unexpected side effects occurring in clinical trials. To address this gap, we introduce RapidDock, an efficient transformer-based model for blind molecular docking. RapidDock achieves at least a $100 \times$ speed advantage over existing methods without compromising accuracy. On the Posebusters and DockGen benchmarks, our method achieves $52.1\%$ and $44.0\%$ success rates ($\text{RMSD}<2$\r{A}), respectively. The average inference time is $0.04$ seconds on a single GPU, highlighting RapidDock's potential for large-scale docking studies. We examine the key features of RapidDock that enable leveraging the transformer architecture for molecular docking, including the use of relative distance embeddings of $3$D structures in attention matrices, pre-training on protein folding, and a custom loss function invariant to molecular symmetries.
Related papers
- Dockformer: A transformer-based molecular docking paradigm for large-scale virtual screening [29.947687129449278]
Deep learning algorithms can provide data-driven research and development models to increase the speed of the docking process.
A novel deep learning-based docking approach named Dockformer is introduced in this study.
The experimental results show that Dockformer achieves success rates of 90.53% and 82.71% on the PDBbind core set and PoseBusters benchmarks, respectively.
arXiv Detail & Related papers (2024-11-11T06:25:13Z) - Re-Dock: Towards Flexible and Realistic Molecular Docking with Diffusion
Bridge [69.80471117520719]
Re-Dock is a novel diffusion bridge generative model extended to geometric manifold.
We propose energy-to-geometry mapping inspired by the Newton-Euler equation to co-model the binding energy and conformations.
Experiments on designed benchmark datasets including apo-dock and cross-dock demonstrate our model's superior effectiveness and efficiency over current methods.
arXiv Detail & Related papers (2024-02-18T05:04:50Z) - Rigid Protein-Protein Docking via Equivariant Elliptic-Paraboloid
Interface Prediction [19.73508673791042]
The study of rigid protein-protein docking plays an essential role in a variety of tasks such as drug design and protein engineering.
We propose a novel learning-based method called ElliDock, which predicts an elliptic paraboloid to represent the protein-protein docking interface.
By its design, ElliDock is independently equivariant with respect to arbitrary rotations/translations of the proteins.
arXiv Detail & Related papers (2024-01-17T05:39:03Z) - FABind: Fast and Accurate Protein-Ligand Binding [127.7790493202716]
$mathbfFABind$ is an end-to-end model that combines pocket prediction and docking to achieve accurate and fast protein-ligand binding.
Our proposed model demonstrates strong advantages in terms of effectiveness and efficiency compared to existing methods.
arXiv Detail & Related papers (2023-10-10T16:39:47Z) - DockGame: Cooperative Games for Multimeric Rigid Protein Docking [45.970633276976045]
We introduce DockGame, a novel game-theoretic framework for docking.
We view protein docking as a cooperative game between proteins, where the final assembly structure(s) constitute stable equilibria.
On the Docking Benchmark 5.5 dataset, DockGame has much faster runtimes than traditional docking methods.
arXiv Detail & Related papers (2023-10-09T22:02:05Z) - DiffDock-PP: Rigid Protein-Protein Docking with Diffusion Models [47.73386438748902]
DiffDock-PP is a diffusion generative model that learns to translate and rotate unbound protein structures into their bound conformations.
We achieve state-of-the-art performance on DIPS with a median C-RMSD of 4.85, outperforming all considered baselines.
arXiv Detail & Related papers (2023-04-08T02:10:44Z) - Equivariant 3D-Conditional Diffusion Models for Molecular Linker Design [82.23006955069229]
We propose DiffLinker, an E(3)-equivariant 3D-conditional diffusion model for molecular linker design.
Our model places missing atoms in between and designs a molecule incorporating all the initial fragments.
We demonstrate that DiffLinker outperforms other methods on the standard datasets generating more diverse and synthetically-accessible molecules.
arXiv Detail & Related papers (2022-10-11T09:13:37Z) - DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking [28.225704750892795]
Predicting the binding structure of a small molecule ligand to a protein is critical to drug design.
Recent deep learning methods that treat docking as a regression problem have decreased runtime compared to traditional search-based methods.
We frame molecular docking as a generative modeling problem and develop DiffDock, a diffusion generative model over the non-Euclidean manifold of ligand poses.
arXiv Detail & Related papers (2022-10-04T17:38:14Z) - Protein-Ligand Docking Surrogate Models: A SARS-CoV-2 Benchmark for Deep
Learning Accelerated Virtual Screening [0.2561613071031438]
We show surrogate docking models have six orders of magnitude more throughput than standard docking protocols.
We demonstrate the power of high-speed surrogate models by running each target against 1 billion molecules in under a day.
Our analysis of the speedup explains that to screen more molecules under a docking paradigm, another order of magnitude speedup must come from model accuracy rather than computing speed.
arXiv Detail & Related papers (2021-06-13T16:27:38Z) - MIMOSA: Multi-constraint Molecule Sampling for Molecule Optimization [51.00815310242277]
generative models and reinforcement learning approaches made initial success, but still face difficulties in simultaneously optimizing multiple drug properties.
We propose the MultI-constraint MOlecule SAmpling (MIMOSA) approach, a sampling framework to use input molecule as an initial guess and sample molecules from the target distribution.
arXiv Detail & Related papers (2020-10-05T20:18:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.