Semi-Supervised Hierarchical Drug Embedding in Hyperbolic Space
- URL: http://arxiv.org/abs/2006.00986v1
- Date: Mon, 1 Jun 2020 14:46:31 GMT
- Title: Semi-Supervised Hierarchical Drug Embedding in Hyperbolic Space
- Authors: Ke Yu, Shyam Visweswaran, Kayhan Batmanghelich
- Abstract summary: A drug hierarchy is a valuable source that encodes human knowledge of drug relations in a tree-like structure.
Here, we develop a semi-supervised drug embedding that incorporates two sources of information.
We show that the learned drug embedding can be used to find new uses for existing drugs and to discover side-effects.
- Score: 19.73556079390888
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Learning accurate drug representation is essential for tasks such as
computational drug repositioning and prediction of drug side-effects. A drug
hierarchy is a valuable source that encodes human knowledge of drug relations
in a tree-like structure where drugs that act on the same organs, treat the
same disease, or bind to the same biological target are grouped together.
However, its utility in learning drug representations has not yet been
explored, and currently described drug representations cannot place novel
molecules in a drug hierarchy. Here, we develop a semi-supervised drug
embedding that incorporates two sources of information: (1) underlying chemical
grammar that is inferred from molecular structures of drugs and drug-like
molecules (unsupervised), and (2) hierarchical relations that are encoded in an
expert-crafted hierarchy of approved drugs (supervised). We use the Variational
Auto-Encoder (VAE) framework to encode the chemical structures of molecules and
use the knowledge-based drug-drug similarity to induce the clustering of drugs
in hyperbolic space. The hyperbolic space is amenable for encoding hierarchical
concepts. Both quantitative and qualitative results support that the learned
drug embedding can accurately reproduce the chemical structure and induce the
hierarchical relations among drugs. Furthermore, our approach can infer the
pharmacological properties of novel molecules by retrieving similar drugs from
the embedding space. We demonstrate that the learned drug embedding can be used
to find new uses for existing drugs and to discover side-effects. We show that
it significantly outperforms baselines in both tasks.
Related papers
- DrugCLIP: Contrastive Drug-Disease Interaction For Drug Repurposing [4.969453745531116]
DrugCLIP is a contrastive learning method to learn drug and disease's interaction without negative labels.
We have curated a drug repurposing dataset based on real-world clinical trial records.
arXiv Detail & Related papers (2024-07-02T13:41:59Z) - When SMILES have Language: Drug Classification using Text Classification Methods on Drug SMILES Strings [5.648318448953635]
Complex chemical structures, like drugs, are usually defined by SMILES strings as a sequence of molecules and bonds.
Escaping from complex representation, in this work, we pose a single question: What if we treat drug SMILES as conventional sentences and engage in text classification for drug classification?
The study explores the notion of viewing each atom and bond as sentence components, employing basic NLP methods to categorize drug types.
arXiv Detail & Related papers (2024-03-03T11:09:32Z) - DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design [62.68420322996345]
Existing structured-based drug design methods treat all ligand atoms equally.
We propose a new diffusion model, DecompDiff, with decomposed priors over arms and scaffold.
Our approach achieves state-of-the-art performance in generating high-affinity molecules.
arXiv Detail & Related papers (2024-02-26T05:21:21Z) - Emerging Opportunities of Using Large Language Models for Translation
Between Drug Molecules and Indications [6.832024637226738]
We propose a new task, which is the translation between drug molecules and corresponding indications.
The creation of molecules from indications, or vice versa, will allow for more efficient targeting of diseases.
arXiv Detail & Related papers (2024-02-14T21:33:13Z) - A clustering and graph deep learning-based framework for COVID-19 drug
repurposing [0.3359875577705538]
This study presents a novel unsupervised machine learning framework that utilizes a graph-based autoencoder for multi-feature type clustering on heterogeneous drug data.
The dataset consists of 438 drugs, of which 224 are under clinical trials for COVID-19.
Our framework relies on reported drug data, including its pharmacological properties, chemical/physical properties, interaction with the host, and efficacy in different publicly available COVID-19 assays.
arXiv Detail & Related papers (2023-06-24T15:00:47Z) - Drug Synergistic Combinations Predictions via Large-Scale Pre-Training
and Graph Structure Learning [82.93806087715507]
Drug combination therapy is a well-established strategy for disease treatment with better effectiveness and less safety degradation.
Deep learning models have emerged as an efficient way to discover synergistic combinations.
Our framework achieves state-of-the-art results in comparison with other deep learning-based methods.
arXiv Detail & Related papers (2023-01-14T15:07:43Z) - Tailoring Molecules for Protein Pockets: a Transformer-based Generative
Solution for Structured-based Drug Design [133.1268990638971]
De novo drug design based on the structure of a target protein can provide novel drug candidates.
We present a generative solution named TamGent that can directly generate candidate drugs from scratch for a given target.
arXiv Detail & Related papers (2022-08-30T09:32:39Z) - Hyperbolic Molecular Representation Learning for Drug Repositioning [19.73556079390888]
A drug hierarchy is a valuable source that encodes knowledge of relations among drugs in a tree-like structure.
Here, we develop a semi-supervised drug embedding that incorporates two sources of information.
We show that the learned drug embedding can induce the hierarchical relations among drugs.
arXiv Detail & Related papers (2022-07-06T20:20:29Z) - MolDesigner: Interactive Design of Efficacious Drugs with Deep Learning [61.74958429818077]
MolDesigner is a human-in-the-loop web user-interface (UI) for drug developers.
A developer can draw a drug molecule in the interface.
In the backend, more than 17 state-of-the-art DL models generate predictions on important indices that are crucial for a drug's efficacy.
arXiv Detail & Related papers (2020-10-05T21:25:25Z) - Two Step Joint Model for Drug Drug Interaction Extraction [82.49278654043577]
Drug-Drug Interaction (DDI) Extraction from Drug Labels challenge of Text Analysis Conference (TAC) 2018.
We propose a two step joint model to detect DDI and it's related mentions jointly.
A sequence tagging system (CNN-GRU encoder-decoder) finds precipitants first and search its fine-grained Trigger and determine the DDI for each precipitant in the second step.
arXiv Detail & Related papers (2020-08-28T15:30:08Z) - Learning To Navigate The Synthetically Accessible Chemical Space Using
Reinforcement Learning [75.95376096628135]
We propose a novel forward synthesis framework powered by reinforcement learning (RL) for de novo drug design.
In this setup, the agent learns to navigate through the immense synthetically accessible chemical space.
We describe how the end-to-end training in this study represents an important paradigm in radically expanding the synthesizable chemical space.
arXiv Detail & Related papers (2020-04-26T21:40:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.