Pandemic Drugs at Pandemic Speed: Accelerating COVID-19 Drug Discovery
with Hybrid Machine Learning- and Physics-based Simulations on High
Performance Computers
- URL: http://arxiv.org/abs/2103.02843v1
- Date: Thu, 4 Mar 2021 05:43:18 GMT
- Title: Pandemic Drugs at Pandemic Speed: Accelerating COVID-19 Drug Discovery
with Hybrid Machine Learning- and Physics-based Simulations on High
Performance Computers
- Authors: Agastya P. Bhati, Shunzhou Wan, Dario Alf\`e, Austin R. Clyde, Mathis
Bode, Li Tan, Mikhail Titov, Andre Merzky, Matteo Turilli, Shantenu Jha,
Roger R. Highfield, Walter Rocchia, Nicola Scafuri, Sauro Succi, Dieter
Kranzlm\"uller, Gerald Mathias, David Wifling, Yann Donon, Alberto Di Meglio,
Sofia Vallecorsa, Heng Ma, Anda Trifan, Arvind Ramanathan, Tom Brettin,
Alexander Partin, Fangfang Xia, Xiaotan Duan, Rick Stevens, Peter V. Coveney
- Abstract summary: Existing drug discovery process is expensive, inefficient and slow.
New opportunities to accelerate drug discovery lie at the interface between machine learning methods and physics-based methods.
Here, we present an innovative method that combines both approaches to accelerate drug discovery.
- Score: 36.11665744919561
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The race to meet the challenges of the global pandemic has served as a
reminder that the existing drug discovery process is expensive, inefficient and
slow. There is a major bottleneck screening the vast number of potential small
molecules to shortlist lead compounds for antiviral drug development. New
opportunities to accelerate drug discovery lie at the interface between machine
learning methods, in this case developed for linear accelerators, and
physics-based methods. The two in silico methods, each have their own
advantages and limitations which, interestingly, complement each other. Here,
we present an innovative method that combines both approaches to accelerate
drug discovery. The scale of the resulting workflow is such that it is
dependent on high performance computing. We have demonstrated the applicability
of this workflow on four COVID-19 target proteins and our ability to perform
the required large-scale calculations to identify lead compounds on a variety
of supercomputers.
Related papers
- Physical formula enhanced multi-task learning for pharmacokinetics prediction [54.13787789006417]
A major challenge for AI-driven drug discovery is the scarcity of high-quality data.
We develop a formula enhanced mul-ti-task learning (PEMAL) method that predicts four key parameters of pharmacokinetics simultaneously.
Our experiments reveal that PEMAL significantly lowers the data demand, compared to typical Graph Neural Networks.
arXiv Detail & Related papers (2024-04-16T07:42:55Z) - CardiGraphormer: Unveiling the Power of Self-Supervised Learning in
Revolutionizing Drug Discovery [0.32634122554914]
CardiGraphormer is a novel combination of Graphormer and Cardinality Preserving Attention.
SSL performs to learn potent molecular representations and employs GNNs to extract molecular fingerprints.
CardiGraphormer's potential applications in drug discovery and drug interactions are vast.
arXiv Detail & Related papers (2023-07-03T08:58:32Z) - HD-Bind: Encoding of Molecular Structure with Low Precision,
Hyperdimensional Binary Representations [3.3934198248179026]
Hyperdimensional Computing (HDC) is a proposed learning paradigm that is able to leverage low-precision binary vector arithmetic.
We show that HDC-based inference methods are as much as 90 times more efficient than more complex representative machine learning methods.
arXiv Detail & Related papers (2023-03-27T21:21:46Z) - Drug Synergistic Combinations Predictions via Large-Scale Pre-Training
and Graph Structure Learning [82.93806087715507]
Drug combination therapy is a well-established strategy for disease treatment with better effectiveness and less safety degradation.
Deep learning models have emerged as an efficient way to discover synergistic combinations.
Our framework achieves state-of-the-art results in comparison with other deep learning-based methods.
arXiv Detail & Related papers (2023-01-14T15:07:43Z) - ImDrug: A Benchmark for Deep Imbalanced Learning in AI-aided Drug
Discovery [79.08833067391093]
Real-world pharmaceutical datasets often exhibit highly imbalanced distribution.
We introduce ImDrug, a benchmark with an open-source Python library which consists of 4 imbalance settings, 11 AI-ready datasets, 54 learning tasks and 16 baseline algorithms tailored for imbalanced learning.
It provides an accessible and customizable testbed for problems and solutions spanning a broad spectrum of the drug discovery pipeline.
arXiv Detail & Related papers (2022-09-16T13:35:57Z) - SSM-DTA: Breaking the Barriers of Data Scarcity in Drug-Target Affinity
Prediction [127.43571146741984]
Drug-Target Affinity (DTA) is of vital importance in early-stage drug discovery.
wet experiments remain the most reliable method, but they are time-consuming and resource-intensive.
Existing methods have primarily focused on developing techniques based on the available DTA data, without adequately addressing the data scarcity issue.
We present the SSM-DTA framework, which incorporates three simple yet highly effective strategies.
arXiv Detail & Related papers (2022-06-20T14:53:25Z) - LIMO: Latent Inceptionism for Targeted Molecule Generation [14.391216237573369]
We present Latent Inceptionism on Molecules (LIMO), which significantly accelerates molecule generation with an inceptionism-like technique.
Comprehensive experiments show that LIMO performs competitively on benchmark tasks.
One of our generated drug-like compounds has a predicted $K_D$ of $6 cdot 10-14$ M against the human estrogen receptor.
arXiv Detail & Related papers (2022-06-17T21:05:58Z) - MoleHD: Ultra-Low-Cost Drug Discovery using Hyperdimensional Computing [2.7462881838152913]
We present MoleHD, a method based on brain-inspired hyperdimensional computing (HDC) for molecular property prediction.
MoleHD achieves highest ROC-AUC score on random and scaffold splits on average across 3 datasets.
To the best of our knowledge, this is the first HDC-based method for drug discovery.
arXiv Detail & Related papers (2021-06-05T13:33:21Z) - A Review of Biomedical Datasets Relating to Drug Discovery: A Knowledge
Graph Perspective [4.746544835197422]
We aim to help guide machine learning and knowledge graph practitioners who are interested in applying new techniques to the drug discovery field.
We detail publicly available primary data sources containing information suitable for use in constructing various drug discovery focused knowledge graphs.
arXiv Detail & Related papers (2021-02-19T17:49:38Z) - Neural networks for Anatomical Therapeutic Chemical (ATC) [83.73971067918333]
We propose combining multiple multi-label classifiers trained on distinct sets of features, including sets extracted from a Bidirectional Long Short-Term Memory Network (BiLSTM)
Experiments demonstrate the power of this approach, which is shown to outperform the best methods reported in the literature.
arXiv Detail & Related papers (2021-01-22T19:49:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.