Automated Cancer Subtyping via Vector Quantization Mutual Information
Maximization
- URL: http://arxiv.org/abs/2206.10801v1
- Date: Wed, 22 Jun 2022 01:55:08 GMT
- Title: Automated Cancer Subtyping via Vector Quantization Mutual Information
Maximization
- Authors: Zheng Chen, Lingwei Zhu, Ziwei Yang, Takashi Matsubara
- Abstract summary: We propose a novel clustering method for exploiting genetic expression profiles and distinguishing subtypes in an unsupervised manner.
Our method can refine existing controversial labels, and, by further medical analysis, this refinement is proven to have a high correlation with cancer survival rates.
- Score: 10.191396978971168
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Cancer subtyping is crucial for understanding the nature of tumors and
providing suitable therapy. However, existing labelling methods are medically
controversial, and have driven the process of subtyping away from teaching
signals. Moreover, cancer genetic expression profiles are high-dimensional,
scarce, and have complicated dependence, thereby posing a serious challenge to
existing subtyping models for outputting sensible clustering. In this study, we
propose a novel clustering method for exploiting genetic expression profiles
and distinguishing subtypes in an unsupervised manner. The proposed method
adaptively learns categorical correspondence from latent representations of
expression profiles to the subtypes output by the model. By maximizing the
problem -- agnostic mutual information between input expression profiles and
output subtypes, our method can automatically decide a suitable number of
subtypes. Through experiments, we demonstrate that our proposed method can
refine existing controversial labels, and, by further medical analysis, this
refinement is proven to have a high correlation with cancer survival rates.
Related papers
- Seeing Unseen: Discover Novel Biomedical Concepts via
Geometry-Constrained Probabilistic Modeling [53.7117640028211]
We present a geometry-constrained probabilistic modeling treatment to resolve the identified issues.
We incorporate a suite of critical geometric properties to impose proper constraints on the layout of constructed embedding space.
A spectral graph-theoretic method is devised to estimate the number of potential novel classes.
arXiv Detail & Related papers (2024-03-02T00:56:05Z) - Subtype-Former: a deep learning approach for cancer subtype discovery
with multi-omics data [17.36619699329539]
This study proposed Subtype-Former, a deep learning method based on Transformer and Block.
We found that Subtype-Former can perform better on the benchmark datasets of more than 5000 tumors based on the survival analysis.
We identified 50 essential biomarkers, which can be used to study targeted cancer drugs.
arXiv Detail & Related papers (2022-07-28T08:15:06Z) - Cancer Subtyping by Improved Transcriptomic Features Using Vector
Quantized Variational Autoencoder [10.835673227875615]
We propose Vector Quantized Variational AutoEncoder (VQ-VAE) to tackle the data issues and extract informative latent features that are crucial to the quality of subsequent clustering.
VQ-VAE does not impose strict assumptions and hence its latent features are better representations of the input, capable of yielding superior clustering performance with any mainstream clustering method.
arXiv Detail & Related papers (2022-07-20T09:47:53Z) - Cancer Subtyping via Embedded Unsupervised Learning on Transcriptomics
Data [5.232428469965068]
We propose to investigate automatic subtyping from an unsupervised learning perspective.
Specifically, we bypass the strong Gaussianity assumption that typically exists but fails in the unsupervised learning subtyping literature.
Our proposed method better captures the latent space features and models the cancer subtype manifestation on a molecular basis.
arXiv Detail & Related papers (2022-04-02T11:44:58Z) - Multi-class versus One-class classifier in spontaneous speech analysis
oriented to Alzheimer Disease diagnosis [58.720142291102135]
The aim of our project is to contribute to earlier diagnosis of AD and better estimates of its severity by using automatic analysis performed through new biomarkers extracted from speech signal.
The use of information about outlier and Fractal Dimension features improves the system performance.
arXiv Detail & Related papers (2022-03-21T09:57:20Z) - DeepGene Transformer: Transformer for the gene expression-based classification of cancer subtypes [5.179504118679301]
Cancer and its subtypes constitute approximately 30% of all causes of death globally.
DeepGene Transformer is proposed which addresses the complexity of high-dimensional gene expression with a multi-head self-attention module.
arXiv Detail & Related papers (2021-08-26T15:02:55Z) - G-MIND: An End-to-End Multimodal Imaging-Genetics Framework for
Biomarker Identification and Disease Classification [49.53651166356737]
We propose a novel deep neural network architecture to integrate imaging and genetics data, as guided by diagnosis, that provides interpretable biomarkers.
We have evaluated our model on a population study of schizophrenia that includes two functional MRI (fMRI) paradigms and Single Nucleotide Polymorphism (SNP) data.
arXiv Detail & Related papers (2021-01-27T19:28:04Z) - Topological Data Analysis of copy number alterations in cancer [70.85487611525896]
We explore the potential to capture information contained in cancer genomic information using a novel topology-based approach.
We find that this technique has the potential to extract meaningful low-dimensional representations in cancer somatic genetic data.
arXiv Detail & Related papers (2020-11-22T17:31:23Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Unsupervised Feature Selection for Tumor Profiles using Autoencoders and
Kernel Methods [1.9078991171384014]
This work aims to learn meaningful and low dimensional representations of tumor samples and find tumor subtype clusters.
The proposed method named Latent Kernel Feature Selection (LKFS) is an unsupervised approach for gene selection in tumor gene expression profiles.
arXiv Detail & Related papers (2020-07-12T21:59:05Z) - Semi-supervised Medical Image Classification with Relation-driven
Self-ensembling Model [71.80319052891817]
We present a relation-driven semi-supervised framework for medical image classification.
It exploits the unlabeled data by encouraging the prediction consistency of given input under perturbations.
Our method outperforms many state-of-the-art semi-supervised learning methods on both single-label and multi-label image classification scenarios.
arXiv Detail & Related papers (2020-05-15T06:57:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.