Cancer Subtyping via Embedded Unsupervised Learning on Transcriptomics
Data
- URL: http://arxiv.org/abs/2204.02278v1
- Date: Sat, 2 Apr 2022 11:44:58 GMT
- Title: Cancer Subtyping via Embedded Unsupervised Learning on Transcriptomics
Data
- Authors: Ziwei Yang, Lingwei Zhu, Zheng Chen, Ming Huang, Naoaki Ono, MD
Altaf-Ul-Amin, Shigehiko Kanaya
- Abstract summary: We propose to investigate automatic subtyping from an unsupervised learning perspective.
Specifically, we bypass the strong Gaussianity assumption that typically exists but fails in the unsupervised learning subtyping literature.
Our proposed method better captures the latent space features and models the cancer subtype manifestation on a molecular basis.
- Score: 5.232428469965068
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Cancer is one of the deadliest diseases worldwide. Accurate diagnosis and
classification of cancer subtypes are indispensable for effective clinical
treatment. Promising results on automatic cancer subtyping systems have been
published recently with the emergence of various deep learning methods.
However, such automatic systems often overfit the data due to the high
dimensionality and scarcity. In this paper, we propose to investigate automatic
subtyping from an unsupervised learning perspective by directly constructing
the underlying data distribution itself, hence sufficient data can be generated
to alleviate the issue of overfitting. Specifically, we bypass the strong
Gaussianity assumption that typically exists but fails in the unsupervised
learning subtyping literature due to small-sized samples by vector
quantization. Our proposed method better captures the latent space features and
models the cancer subtype manifestation on a molecular basis, as demonstrated
by the extensive experimental results.
Related papers
- PACS: Prediction and analysis of cancer subtypes from multi-omics data
based on a multi-head attention mechanism model [2.275409158519155]
We propose a supervised multi-head attention mechanism model (SMA) to classify cancer subtypes successfully.
The attention mechanism and feature sharing module of the SMA model can successfully learn the global and local feature information of multi-omics data.
The SMA model achieves the highest accuracy, F1 macroscopic, F1 weighted, and accurate classification of cancer subtypes in simulated, single-cell, and cancer multiomics datasets.
arXiv Detail & Related papers (2023-08-21T03:54:21Z) - Self-omics: A Self-supervised Learning Framework for Multi-omics Cancer
Data [4.843654097048771]
Self-Supervised Learning (SSL) methods are typically used to deal with limited labelled data.
We develop a novel pre-training paradigm that consists of various SSL components.
Our approach outperforms the state-of-the-art method in cancer type classification on the TCGA pan-cancer dataset.
arXiv Detail & Related papers (2022-10-03T11:20:12Z) - Subtype-Former: a deep learning approach for cancer subtype discovery
with multi-omics data [17.36619699329539]
This study proposed Subtype-Former, a deep learning method based on Transformer and Block.
We found that Subtype-Former can perform better on the benchmark datasets of more than 5000 tumors based on the survival analysis.
We identified 50 essential biomarkers, which can be used to study targeted cancer drugs.
arXiv Detail & Related papers (2022-07-28T08:15:06Z) - Cancer Subtyping by Improved Transcriptomic Features Using Vector
Quantized Variational Autoencoder [10.835673227875615]
We propose Vector Quantized Variational AutoEncoder (VQ-VAE) to tackle the data issues and extract informative latent features that are crucial to the quality of subsequent clustering.
VQ-VAE does not impose strict assumptions and hence its latent features are better representations of the input, capable of yielding superior clustering performance with any mainstream clustering method.
arXiv Detail & Related papers (2022-07-20T09:47:53Z) - Automated Cancer Subtyping via Vector Quantization Mutual Information
Maximization [10.191396978971168]
We propose a novel clustering method for exploiting genetic expression profiles and distinguishing subtypes in an unsupervised manner.
Our method can refine existing controversial labels, and, by further medical analysis, this refinement is proven to have a high correlation with cancer survival rates.
arXiv Detail & Related papers (2022-06-22T01:55:08Z) - Multi-class versus One-class classifier in spontaneous speech analysis
oriented to Alzheimer Disease diagnosis [58.720142291102135]
The aim of our project is to contribute to earlier diagnosis of AD and better estimates of its severity by using automatic analysis performed through new biomarkers extracted from speech signal.
The use of information about outlier and Fractal Dimension features improves the system performance.
arXiv Detail & Related papers (2022-03-21T09:57:20Z) - Deep Semi-supervised Metric Learning with Dual Alignment for Cervical
Cancer Cell Detection [49.78612417406883]
We propose a novel semi-supervised deep metric learning method for cervical cancer cell detection.
Our model learns an embedding metric space and conducts dual alignment of semantic features on both the proposal and prototype levels.
We construct a large-scale dataset for semi-supervised cervical cancer cell detection for the first time, consisting of 240,860 cervical cell images.
arXiv Detail & Related papers (2021-04-07T17:11:27Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Manifolds for Unsupervised Visual Anomaly Detection [79.22051549519989]
Unsupervised learning methods that don't necessarily encounter anomalies in training would be immensely useful.
We develop a novel hyperspherical Variational Auto-Encoder (VAE) via stereographic projections with a gyroplane layer.
We present state-of-the-art results on visual anomaly benchmarks in precision manufacturing and inspection, demonstrating real-world utility in industrial AI scenarios.
arXiv Detail & Related papers (2020-06-19T20:41:58Z) - A Systematic Approach to Featurization for Cancer Drug Sensitivity
Predictions with Deep Learning [49.86828302591469]
We train >35,000 neural network models, sweeping over common featurization techniques.
We found the RNA-seq to be highly redundant and informative even with subsets larger than 128 features.
arXiv Detail & Related papers (2020-04-30T20:42:17Z) - Unsupervised Anomaly Detection with Adversarial Mirrored AutoEncoders [51.691585766702744]
We propose a variant of Adversarial Autoencoder which uses a mirrored Wasserstein loss in the discriminator to enforce better semantic-level reconstruction.
We put forward an alternative measure of anomaly score to replace the reconstruction-based metric.
Our method outperforms the current state-of-the-art methods for anomaly detection on several OOD detection benchmarks.
arXiv Detail & Related papers (2020-03-24T08:26:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.