Self-supervised Learning for Label Sparsity in Computational Drug
Repositioning
- URL: http://arxiv.org/abs/2206.00262v1
- Date: Wed, 1 Jun 2022 06:43:23 GMT
- Title: Self-supervised Learning for Label Sparsity in Computational Drug
Repositioning
- Authors: Xinxing Yang, Genke Yang, Jian Chu
- Abstract summary: computational drug repositioning aims to discover new uses for marketed drugs.
The number of validated drug-disease associations is scarce compared to the number of drugs and diseases in the real world.
We propose a multi-task self-supervised learning framework for computational drug repositioning.
- Score: 2.523552067304274
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The computational drug repositioning aims to discover new uses for marketed
drugs, which can accelerate the drug development process and play an important
role in the existing drug discovery system. However, the number of validated
drug-disease associations is scarce compared to the number of drugs and
diseases in the real world. Too few labeled samples will make the
classification model unable to learn effective latent factors of drugs,
resulting in poor generalization performance. In this work, we propose a
multi-task self-supervised learning framework for computational drug
repositioning. The framework tackles label sparsity by learning a better drug
representation. Specifically, we take the drug-disease association prediction
problem as the main task, and the auxiliary task is to use data augmentation
strategies and contrast learning to mine the internal relationships of the
original drug features, so as to automatically learn a better drug
representation without supervised labels. And through joint training, it is
ensured that the auxiliary task can improve the prediction accuracy of the main
task. More precisely, the auxiliary task improves drug representation and
serving as additional regularization to improve generalization. Furthermore, we
design a multi-input decoding network to improve the reconstruction ability of
the autoencoder model. We evaluate our model using three real-world datasets.
The experimental results demonstrate the effectiveness of the multi-task
self-supervised learning framework, and its predictive ability is superior to
the state-of-the-art model.
Related papers
- DrugAgent: Explainable Drug Repurposing Agent with Large Language Model-based Reasoning [10.528489471229946]
We propose a multi-agent framework to enhance the drug repurposing process using state-of-the-art machine learning techniques and knowledge integration.
Our framework comprises several specialized agents: an AI Agent trains robust drug-target interaction (DTI) models; a Knowledge Graph Agent utilizes the drug-gene interaction database (DGIdb) to systematically extract DTIs.
By integrating outputs from these agents, our system effectively harnesses diverse data sources, including external databases, to propose viable repurposing candidates.
arXiv Detail & Related papers (2024-08-23T21:24:59Z) - Physical formula enhanced multi-task learning for pharmacokinetics prediction [54.13787789006417]
A major challenge for AI-driven drug discovery is the scarcity of high-quality data.
We develop a formula enhanced mul-ti-task learning (PEMAL) method that predicts four key parameters of pharmacokinetics simultaneously.
Our experiments reveal that PEMAL significantly lowers the data demand, compared to typical Graph Neural Networks.
arXiv Detail & Related papers (2024-04-16T07:42:55Z) - Drug Synergistic Combinations Predictions via Large-Scale Pre-Training
and Graph Structure Learning [82.93806087715507]
Drug combination therapy is a well-established strategy for disease treatment with better effectiveness and less safety degradation.
Deep learning models have emerged as an efficient way to discover synergistic combinations.
Our framework achieves state-of-the-art results in comparison with other deep learning-based methods.
arXiv Detail & Related papers (2023-01-14T15:07:43Z) - SSM-DTA: Breaking the Barriers of Data Scarcity in Drug-Target Affinity
Prediction [127.43571146741984]
Drug-Target Affinity (DTA) is of vital importance in early-stage drug discovery.
wet experiments remain the most reliable method, but they are time-consuming and resource-intensive.
Existing methods have primarily focused on developing techniques based on the available DTA data, without adequately addressing the data scarcity issue.
We present the SSM-DTA framework, which incorporates three simple yet highly effective strategies.
arXiv Detail & Related papers (2022-06-20T14:53:25Z) - Deep learning for drug repurposing: methods, databases, and applications [54.08583498324774]
Repurposing existing drugs for new therapies is an attractive solution that accelerates drug development at reduced experimental costs.
In this review, we introduce guidelines on how to utilize deep learning methodologies and tools for drug repurposing.
arXiv Detail & Related papers (2022-02-08T09:42:08Z) - DrugOOD: Out-of-Distribution (OOD) Dataset Curator and Benchmark for
AI-aided Drug Discovery -- A Focus on Affinity Prediction Problems with Noise
Annotations [90.27736364704108]
We present DrugOOD, a systematic OOD dataset curator and benchmark for AI-aided drug discovery.
DrugOOD comes with an open-source Python package that fully automates benchmarking processes.
We focus on one of the most crucial problems in AIDD: drug target binding affinity prediction.
arXiv Detail & Related papers (2022-01-24T12:32:48Z) - A Review of Biomedical Datasets Relating to Drug Discovery: A Knowledge
Graph Perspective [4.746544835197422]
We aim to help guide machine learning and knowledge graph practitioners who are interested in applying new techniques to the drug discovery field.
We detail publicly available primary data sources containing information suitable for use in constructing various drug discovery focused knowledge graphs.
arXiv Detail & Related papers (2021-02-19T17:49:38Z) - Deep Learning for Virtual Screening: Five Reasons to Use ROC Cost
Functions [80.12620331438052]
deep learning has become an important tool for rapid screening of billions of molecules in silico for potential hits containing desired chemical features.
Despite its importance, substantial challenges persist in training these models, such as severe class imbalance, high decision thresholds, and lack of ground truth labels in some datasets.
We argue in favor of directly optimizing the receiver operating characteristic (ROC) in such cases, due to its robustness to class imbalance.
arXiv Detail & Related papers (2020-06-25T08:46:37Z) - Drug-Drug Interaction Prediction with Wasserstein Adversarial
Autoencoder-based Knowledge Graph Embeddings [22.562175708415392]
We propose a new knowledge graph embedding framework for drug-drug interactions.
In our framework, the autoencoder is employed to generate high-quality negative samples.
The discriminator learns the embeddings of drugs and interactions based on both positive and negative triplets.
arXiv Detail & Related papers (2020-04-15T21:03:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.