Subtype-Former: a deep learning approach for cancer subtype discovery
with multi-omics data
- URL: http://arxiv.org/abs/2207.14639v1
- Date: Thu, 28 Jul 2022 08:15:06 GMT
- Title: Subtype-Former: a deep learning approach for cancer subtype discovery
with multi-omics data
- Authors: Hai Yang, Yuhang Sheng, Yi Jiang, Xiaoyang Fang, Dongdong Li, Jing
Zhang, Zhe Wang
- Abstract summary: This study proposed Subtype-Former, a deep learning method based on Transformer and Block.
We found that Subtype-Former can perform better on the benchmark datasets of more than 5000 tumors based on the survival analysis.
We identified 50 essential biomarkers, which can be used to study targeted cancer drugs.
- Score: 17.36619699329539
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Motivation: Cancer is heterogeneous, affecting the precise approach to
personalized treatment. Accurate subtyping can lead to better survival rates
for cancer patients. High-throughput technologies provide multiple omics data
for cancer subtyping. However, precise cancer subtyping remains challenging due
to the large amount and high dimensionality of omics data. Results: This study
proposed Subtype-Former, a deep learning method based on MLP and Transformer
Block, to extract the low-dimensional representation of the multi-omics data.
K-means and Consensus Clustering are also used to achieve accurate subtyping
results. We compared Subtype-Former with the other state-of-the-art subtyping
methods across the TCGA 10 cancer types. We found that Subtype-Former can
perform better on the benchmark datasets of more than 5000 tumors based on the
survival analysis. In addition, Subtype-Former also achieved outstanding
results in pan-cancer subtyping, which can help analyze the commonalities and
differences across various cancer types at the molecular level. Finally, we
applied Subtype-Former to the TCGA 10 types of cancers. We identified 50
essential biomarkers, which can be used to study targeted cancer drugs and
promote the development of cancer treatments in the era of precision medicine.
Related papers
- Improving Breast Cancer Grade Prediction with Multiparametric MRI Created Using Optimized Synthetic Correlated Diffusion Imaging [71.91773485443125]
Grading plays a vital role in breast cancer treatment planning.
The current tumor grading method involves extracting tissue from patients, leading to stress, discomfort, and high medical costs.
This paper examines using optimized CDI$s$ to improve breast cancer grade prediction.
arXiv Detail & Related papers (2024-05-13T15:48:26Z) - PACS: Prediction and analysis of cancer subtypes from multi-omics data
based on a multi-head attention mechanism model [2.275409158519155]
We propose a supervised multi-head attention mechanism model (SMA) to classify cancer subtypes successfully.
The attention mechanism and feature sharing module of the SMA model can successfully learn the global and local feature information of multi-omics data.
The SMA model achieves the highest accuracy, F1 macroscopic, F1 weighted, and accurate classification of cancer subtypes in simulated, single-cell, and cancer multiomics datasets.
arXiv Detail & Related papers (2023-08-21T03:54:21Z) - MoCLIM: Towards Accurate Cancer Subtyping via Multi-Omics Contrastive
Learning with Omics-Inference Modeling [9.900594964709116]
We develop MoCLIM, a representation learning framework for cancer subtyping.
We show that our approach significantly improves data fit and subtyping performance in fewer high-dimensional cancer instances.
Our framework incorporates various medical evaluations as the final component, providing high interpretability in medical analysis.
arXiv Detail & Related papers (2023-08-17T10:49:48Z) - Cancer-Net BCa-S: Breast Cancer Grade Prediction using Volumetric Deep
Radiomic Features from Synthetic Correlated Diffusion Imaging [82.74877848011798]
The prevalence of breast cancer continues to grow, affecting about 300,000 females in the United States in 2023.
The gold-standard Scarff-Bloom-Richardson (SBR) grade has been shown to consistently indicate a patient's response to chemotherapy.
In this paper, we study the efficacy of deep learning for breast cancer grading based on synthetic correlated diffusion (CDI$s$) imaging.
arXiv Detail & Related papers (2023-04-12T15:08:34Z) - A Multi-Institutional Open-Source Benchmark Dataset for Breast Cancer
Clinical Decision Support using Synthetic Correlated Diffusion Imaging Data [82.74877848011798]
Cancer-Net BCa is a multi-institutional open-source benchmark dataset of volumetric CDI$s$ imaging data of breast cancer patients.
Cancer-Net BCa is publicly available as a part of a global open-source initiative dedicated to accelerating advancement in machine learning to aid clinicians in the fight against cancer.
arXiv Detail & Related papers (2023-04-12T05:41:44Z) - Machine Learning Methods for Cancer Classification Using Gene Expression
Data: A Review [77.34726150561087]
Cancer is the second major cause of death after cardiovascular diseases.
Gene expression can play a fundamental role in the early detection of cancer.
This study reviews recent progress in gene expression analysis for cancer classification using machine learning methods.
arXiv Detail & Related papers (2023-01-28T15:03:03Z) - Automated Cancer Subtyping via Vector Quantization Mutual Information
Maximization [10.191396978971168]
We propose a novel clustering method for exploiting genetic expression profiles and distinguishing subtypes in an unsupervised manner.
Our method can refine existing controversial labels, and, by further medical analysis, this refinement is proven to have a high correlation with cancer survival rates.
arXiv Detail & Related papers (2022-06-22T01:55:08Z) - Gene selection from microarray expression data: A Multi-objective PSO
with adaptive K-nearest neighborhood [0.0]
This paper deals with the classification problem of human cancer diseases by using gene expression data.
It is presented a new methodology to analyze microarray datasets and efficiently classify cancer diseases.
arXiv Detail & Related papers (2022-05-27T04:22:10Z) - Cancer Subtyping via Embedded Unsupervised Learning on Transcriptomics
Data [5.232428469965068]
We propose to investigate automatic subtyping from an unsupervised learning perspective.
Specifically, we bypass the strong Gaussianity assumption that typically exists but fails in the unsupervised learning subtyping literature.
Our proposed method better captures the latent space features and models the cancer subtype manifestation on a molecular basis.
arXiv Detail & Related papers (2022-04-02T11:44:58Z) - Topological Data Analysis of copy number alterations in cancer [70.85487611525896]
We explore the potential to capture information contained in cancer genomic information using a novel topology-based approach.
We find that this technique has the potential to extract meaningful low-dimensional representations in cancer somatic genetic data.
arXiv Detail & Related papers (2020-11-22T17:31:23Z) - A Systematic Approach to Featurization for Cancer Drug Sensitivity
Predictions with Deep Learning [49.86828302591469]
We train >35,000 neural network models, sweeping over common featurization techniques.
We found the RNA-seq to be highly redundant and informative even with subsets larger than 128 features.
arXiv Detail & Related papers (2020-04-30T20:42:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.