DeepGS: Deep Representation Learning of Graphs and Sequences for
Drug-Target Binding Affinity Prediction
- URL: http://arxiv.org/abs/2003.13902v2
- Date: Fri, 3 Apr 2020 07:28:11 GMT
- Title: DeepGS: Deep Representation Learning of Graphs and Sequences for
Drug-Target Binding Affinity Prediction
- Authors: Xuan Lin
- Abstract summary: We propose a novel end-to-end learning framework, called DeepGS, which uses deep neural networks to extract the local chemical context from amino acids and SMILES sequences.
We have conducted extensive experiments to compare our proposed method with state-of-the-art models including KronRLS, Sim, DeepDTA and DeepCPI.
- Score: 8.292330541203647
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Accurately predicting drug-target binding affinity (DTA) in silico is a key
task in drug discovery. Most of the conventional DTA prediction methods are
simulation-based, which rely heavily on domain knowledge or the assumption of
having the 3D structure of the targets, which are often difficult to obtain.
Meanwhile, traditional machine learning-based methods apply various features
and descriptors, and simply depend on the similarities between drug-target
pairs. Recently, with the increasing amount of affinity data available and the
success of deep representation learning models on various domains, deep
learning techniques have been applied to DTA prediction. However, these methods
consider either label/one-hot encodings or the topological structure of
molecules, without considering the local chemical context of amino acids and
SMILES sequences. Motivated by this, we propose a novel end-to-end learning
framework, called DeepGS, which uses deep neural networks to extract the local
chemical context from amino acids and SMILES sequences, as well as the
molecular structure from the drugs. To assist the operations on the symbolic
data, we propose to use advanced embedding techniques (i.e., Smi2Vec and
Prot2Vec) to encode the amino acids and SMILES sequences to a distributed
representation. Meanwhile, we suggest a new molecular structure modeling
approach that works well under our framework. We have conducted extensive
experiments to compare our proposed method with state-of-the-art models
including KronRLS, SimBoost, DeepDTA and DeepCPI. Extensive experimental
results demonstrate the superiorities and competitiveness of DeepGS.
Related papers
- Pre-trained Molecular Language Models with Random Functional Group Masking [54.900360309677794]
We propose a SMILES-based underlineem Molecular underlineem Language underlineem Model, which randomly masking SMILES subsequences corresponding to specific molecular atoms.
This technique aims to compel the model to better infer molecular structures and properties, thus enhancing its predictive capabilities.
arXiv Detail & Related papers (2024-11-03T01:56:15Z) - Otter-Knowledge: benchmarks of multimodal knowledge graph representation
learning from different sources for drug discovery [5.913390277336069]
We release a set of multimodal knowledge graphs, integrating data from seven public data sources, and containing over 30 million triples.
Our intention is to foster additional research to explore how multimodal knowledge enhanced protein/molecule embeddings can improve prediction tasks.
arXiv Detail & Related papers (2023-06-22T11:01:41Z) - HD-Bind: Encoding of Molecular Structure with Low Precision,
Hyperdimensional Binary Representations [3.3934198248179026]
Hyperdimensional Computing (HDC) is a proposed learning paradigm that is able to leverage low-precision binary vector arithmetic.
We show that HDC-based inference methods are as much as 90 times more efficient than more complex representative machine learning methods.
arXiv Detail & Related papers (2023-03-27T21:21:46Z) - ResDTA: Predicting Drug-Target Binding Affinity Using Residual Skip
Connections [0.0]
We present a deep learning-based methodology for predicting DT binding affinities using just sequencing information from both targets and drugs.
The proposed model achieves the best Concordance Index (CI) performance in one of the largest benchmark datasets.
arXiv Detail & Related papers (2023-03-20T20:27:11Z) - Drug Synergistic Combinations Predictions via Large-Scale Pre-Training
and Graph Structure Learning [82.93806087715507]
Drug combination therapy is a well-established strategy for disease treatment with better effectiveness and less safety degradation.
Deep learning models have emerged as an efficient way to discover synergistic combinations.
Our framework achieves state-of-the-art results in comparison with other deep learning-based methods.
arXiv Detail & Related papers (2023-01-14T15:07:43Z) - Retrieval-based Controllable Molecule Generation [63.44583084888342]
We propose a new retrieval-based framework for controllable molecule generation.
We use a small set of molecules to steer the pre-trained generative model towards synthesizing molecules that satisfy the given design criteria.
Our approach is agnostic to the choice of generative models and requires no task-specific fine-tuning.
arXiv Detail & Related papers (2022-08-23T17:01:16Z) - SSM-DTA: Breaking the Barriers of Data Scarcity in Drug-Target Affinity
Prediction [127.43571146741984]
Drug-Target Affinity (DTA) is of vital importance in early-stage drug discovery.
wet experiments remain the most reliable method, but they are time-consuming and resource-intensive.
Existing methods have primarily focused on developing techniques based on the available DTA data, without adequately addressing the data scarcity issue.
We present the SSM-DTA framework, which incorporates three simple yet highly effective strategies.
arXiv Detail & Related papers (2022-06-20T14:53:25Z) - Hierarchical Graph Representation Learning for the Prediction of
Drug-Target Binding Affinity [7.023929372010717]
We propose a novel hierarchical graph representation learning model for the drug-target binding affinity prediction, namely HGRL-DTA.
In this paper, we adopt a message broadcasting mechanism to integrate the hierarchical representations learned from the global-level affinity graph and the local-level molecular graph. Besides, we design a similarity-based embedding map to solve the cold start problem of inferring representations for unseen drugs and targets.
arXiv Detail & Related papers (2022-03-22T04:50:16Z) - MoleHD: Ultra-Low-Cost Drug Discovery using Hyperdimensional Computing [2.7462881838152913]
We present MoleHD, a method based on brain-inspired hyperdimensional computing (HDC) for molecular property prediction.
MoleHD achieves highest ROC-AUC score on random and scaffold splits on average across 3 datasets.
To the best of our knowledge, this is the first HDC-based method for drug discovery.
arXiv Detail & Related papers (2021-06-05T13:33:21Z) - A Systematic Approach to Featurization for Cancer Drug Sensitivity
Predictions with Deep Learning [49.86828302591469]
We train >35,000 neural network models, sweeping over common featurization techniques.
We found the RNA-seq to be highly redundant and informative even with subsets larger than 128 features.
arXiv Detail & Related papers (2020-04-30T20:42:17Z) - Learning To Navigate The Synthetically Accessible Chemical Space Using
Reinforcement Learning [75.95376096628135]
We propose a novel forward synthesis framework powered by reinforcement learning (RL) for de novo drug design.
In this setup, the agent learns to navigate through the immense synthetically accessible chemical space.
We describe how the end-to-end training in this study represents an important paradigm in radically expanding the synthesizable chemical space.
arXiv Detail & Related papers (2020-04-26T21:40:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.