Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction
- URL: http://arxiv.org/abs/2009.00792v2
- Date: Thu, 3 Sep 2020 15:50:22 GMT
- Title: Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction
- Authors: Ziyi Yang, Jun Shu, Yong Liang, Deyu Meng and Zongben Xu
- Abstract summary: We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
- Score: 55.94378672172967
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Current machine learning has made great progress on computer vision and many
other fields attributed to the large amount of high-quality training samples,
while it does not work very well on genomic data analysis, since they are
notoriously known as small data. In our work, we focus on few-shot disease
subtype prediction problem, identifying subgroups of similar patients that can
guide treatment decisions for a specific individual through training on small
data. In fact, doctors and clinicians always address this problem by studying
several interrelated clinical variables simultaneously. We attempt to simulate
such clinical perspective, and introduce meta learning techniques to develop a
new model, which can extract the common experience or knowledge from
interrelated clinical tasks and transfer it to help address new tasks. Our new
model is built upon a carefully designed meta-learner, called Prototypical
Network, that is a simple yet effective meta learning machine for few-shot
image classification. Observing that gene expression data have specifically
high dimensionality and high noise properties compared with image data, we
proposed a new extension of it by appending two modules to address these
issues. Concretely, we append a feature selection layer to automatically filter
out the disease-irrelated genes and incorporate a sample reweighting strategy
to adaptively remove noisy data, and meanwhile the extended model is capable of
learning from a limited number of training examples and generalize well.
Simulations and real gene expression data experiments substantiate the
superiority of the proposed method for predicting the subtypes of disease and
identifying potential disease-related genes.
Related papers
- An Autoencoder and Generative Adversarial Networks Approach for Multi-Omics Data Imbalanced Class Handling and Classification [2.2940141855172036]
In molecular biology, there has been an explosion of data generated from multi-omics sequencing.
Traditional statistical methods face challenging tasks when dealing with such high dimensional data.
This study, focused on tackling these challenges in a neural network that incorporates autoencoder to extract latent space of the features.
arXiv Detail & Related papers (2024-05-16T01:45:55Z) - Incomplete Multimodal Learning for Complex Brain Disorders Prediction [65.95783479249745]
We propose a new incomplete multimodal data integration approach that employs transformers and generative adversarial networks.
We apply our new method to predict cognitive degeneration and disease outcomes using the multimodal imaging genetic data from Alzheimer's Disease Neuroimaging Initiative cohort.
arXiv Detail & Related papers (2023-05-25T16:29:16Z) - Continual Learning with Bayesian Model based on a Fixed Pre-trained
Feature Extractor [55.9023096444383]
Current deep learning models are characterised by catastrophic forgetting of old knowledge when learning new classes.
Inspired by the process of learning new knowledge in human brains, we propose a Bayesian generative model for continual learning.
arXiv Detail & Related papers (2022-04-28T08:41:51Z) - LifeLonger: A Benchmark for Continual Disease Classification [59.13735398630546]
We introduce LifeLonger, a benchmark for continual disease classification on the MedMNIST collection.
Task and class incremental learning of diseases address the issue of classifying new samples without re-training the models from scratch.
Cross-domain incremental learning addresses the issue of dealing with datasets originating from different institutions while retaining the previously obtained knowledge.
arXiv Detail & Related papers (2022-04-12T12:25:05Z) - MG-NET: Leveraging Pseudo-Imaging for Multi-Modal Metagenome Analysis [5.04905391284093]
We propose MG-Net, a self-supervised representation learning framework.
We show that MG-Net can learn robust representations from unlabeled data.
Experiments show that the learned features outperform current baseline metagenome representations.
arXiv Detail & Related papers (2021-07-21T05:53:01Z) - Covid-19 Detection from Chest X-ray and Patient Metadata using Graph
Convolutional Neural Networks [6.420262246029286]
We propose a novel Graph Convolution Neural Network (GCN) that is capable of identifying bio-markers of Covid-19 pneumonia.
The proposed method exploits important relational knowledge between data instances and their features using graph representation and applies convolution to learn the graph data.
arXiv Detail & Related papers (2021-05-20T13:13:29Z) - Relational Subsets Knowledge Distillation for Long-tailed Retinal
Diseases Recognition [65.77962788209103]
We propose class subset learning by dividing the long-tailed data into multiple class subsets according to prior knowledge.
It enforces the model to focus on learning the subset-specific knowledge.
The proposed framework proved to be effective for the long-tailed retinal diseases recognition task.
arXiv Detail & Related papers (2021-04-22T13:39:33Z) - Active Selection of Classification Features [0.0]
Auxiliary data, such as demographics, might help in selecting a smaller sample that comprises the individuals with the most informative MRI scans.
We propose two utility-based approaches for this problem, and evaluate their performance on three public real-world benchmark datasets.
arXiv Detail & Related papers (2021-02-26T18:19:08Z) - Data-Driven Logistic Regression Ensembles With Applications in Genomics [0.0]
We propose a new approach for dealing with high-dimensional binary classification problems that combines ideas from regularization and ensembling.
We demonstrate the good performance of our method in terms of prediction accuracy and identification of key biomarkers using several medical datasets involving common diseases such as cancer, multiple sclerosis and psoriasis.
arXiv Detail & Related papers (2021-02-17T05:57:26Z) - Towards an Automatic Analysis of CHO-K1 Suspension Growth in
Microfluidic Single-cell Cultivation [63.94623495501023]
We propose a novel Machine Learning architecture, which allows us to infuse a neural deep network with human-powered abstraction on the level of data.
Specifically, we train a generative model simultaneously on natural and synthetic data, so that it learns a shared representation, from which a target variable, such as the cell count, can be reliably estimated.
arXiv Detail & Related papers (2020-10-20T08:36:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.