DeepMCAT: Large-Scale Deep Clustering for Medical Image Categorization
- URL: http://arxiv.org/abs/2110.00109v1
- Date: Thu, 30 Sep 2021 22:39:57 GMT
- Title: DeepMCAT: Large-Scale Deep Clustering for Medical Image Categorization
- Authors: Turkay Kart, Wenjia Bai, Ben Glocker and Daniel Rueckert
- Abstract summary: We propose an unsupervised approach for automatically clustering and categorizing large-scale medical image datasets.
We investigated the end-to-end training using both class-balanced and imbalanced large-scale datasets.
- Score: 24.100651548850895
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: In recent years, the research landscape of machine learning in medical
imaging has changed drastically from supervised to semi-, weakly- or
unsupervised methods. This is mainly due to the fact that ground-truth labels
are time-consuming and expensive to obtain manually. Generating labels from
patient metadata might be feasible but it suffers from user-originated errors
which introduce biases. In this work, we propose an unsupervised approach for
automatically clustering and categorizing large-scale medical image datasets,
with a focus on cardiac MR images, and without using any labels. We
investigated the end-to-end training using both class-balanced and imbalanced
large-scale datasets. Our method was able to create clusters with high purity
and achieved over 0.99 cluster purity on these datasets. The results
demonstrate the potential of the proposed method for categorizing unstructured
large medical databases, such as organizing clinical PACS systems in hospitals.
Related papers
- Multi-task Explainable Skin Lesion Classification [54.76511683427566]
We propose a few-shot-based approach for skin lesions that generalizes well with few labelled data.
The proposed approach comprises a fusion of a segmentation network that acts as an attention module and classification network.
arXiv Detail & Related papers (2023-10-11T05:49:47Z) - Dynamic Conceptional Contrastive Learning for Generalized Category
Discovery [76.82327473338734]
Generalized category discovery (GCD) aims to automatically cluster partially labeled data.
Unlabeled data contain instances that are not only from known categories of the labeled data but also from novel categories.
One effective way for GCD is applying self-supervised learning to learn discriminate representation for unlabeled data.
We propose a Dynamic Conceptional Contrastive Learning framework, which can effectively improve clustering accuracy.
arXiv Detail & Related papers (2023-03-30T14:04:39Z) - VAESim: A probabilistic approach for self-supervised prototype discovery [0.23624125155742057]
We propose an architecture for image stratification based on a conditional variational autoencoder.
We use a continuous latent space to represent the continuum of disorders and find clusters during training, which can then be used for image/patient stratification.
We demonstrate that our method outperforms baselines in terms of kNN accuracy measured on a classification task against a standard VAE.
arXiv Detail & Related papers (2022-09-25T17:55:31Z) - PCA: Semi-supervised Segmentation with Patch Confidence Adversarial
Training [52.895952593202054]
We propose a new semi-supervised adversarial method called Patch Confidence Adrial Training (PCA) for medical image segmentation.
PCA learns the pixel structure and context information in each patch to get enough gradient feedback, which aids the discriminator in convergent to an optimal state.
Our method outperforms the state-of-the-art semi-supervised methods, which demonstrates its effectiveness for medical image segmentation.
arXiv Detail & Related papers (2022-07-24T07:45:47Z) - Self-Supervised Learning as a Means To Reduce the Need for Labeled Data
in Medical Image Analysis [64.4093648042484]
We use a dataset of chest X-ray images with bounding box labels for 13 different classes of anomalies.
We show that it is possible to achieve similar performance to a fully supervised model in terms of mean average precision and accuracy with only 60% of the labeled data.
arXiv Detail & Related papers (2022-06-01T09:20:30Z) - 3N-GAN: Semi-Supervised Classification of X-Ray Images with a 3-Player
Adversarial Framework [0.0]
We propose 3N-GAN, or 3 Network Generative Adversarial Networks, to perform semi-supervised classification of medical images.
Preliminary results show improved classification performance and GAN generations over various algorithms.
arXiv Detail & Related papers (2021-09-22T23:18:59Z) - A Real Use Case of Semi-Supervised Learning for Mammogram Classification
in a Local Clinic of Costa Rica [0.5541644538483946]
Training a deep learning model requires a considerable amount of labeled images.
A number of publicly available datasets have been built with data from different hospitals and clinics.
The use of the semi-supervised deep learning approach known as MixMatch, to leverage the usage of unlabeled data is proposed and evaluated.
arXiv Detail & Related papers (2021-07-24T22:26:50Z) - Deep Semi-Supervised Embedded Clustering (DSEC) for Stratification of
Heart Failure Patients [50.48904066814385]
In this work we apply deep semi-supervised embedded clustering to determine data-driven patient subgroups of heart failure.
We find clinically relevant clusters from an embedded space derived from heterogeneous data.
The proposed algorithm can potentially find new undiagnosed subgroups of patients that have different outcomes.
arXiv Detail & Related papers (2020-12-24T12:56:46Z) - Towards Robust Partially Supervised Multi-Structure Medical Image
Segmentation on Small-Scale Data [123.03252888189546]
We propose Vicinal Labels Under Uncertainty (VLUU) to bridge the methodological gaps in partially supervised learning (PSL) under data scarcity.
Motivated by multi-task learning and vicinal risk minimization, VLUU transforms the partially supervised problem into a fully supervised problem by generating vicinal labels.
Our research suggests a new research direction in label-efficient deep learning with partial supervision.
arXiv Detail & Related papers (2020-11-28T16:31:00Z) - Semi-supervised and Unsupervised Methods for Heart Sounds Classification
in Restricted Data Environments [4.712158833534046]
This study uses various supervised, semi-supervised and unsupervised approaches on the PhysioNet/CinC 2016 Challenge dataset.
A GAN based semi-supervised method is proposed, which allows the usage of unlabelled data samples to boost the learning of data distribution.
In particular, the unsupervised feature extraction using 1D CNN Autoencoder coupled with one-class SVM obtains good performance without any data labelling.
arXiv Detail & Related papers (2020-06-04T02:07:35Z) - The GraphNet Zoo: An All-in-One Graph Based Deep Semi-Supervised
Framework for Medical Image Classification [0.0]
We consider the problem of classifying a medical image dataset when we have a limited amount of labels.
Using semi-supervised learning, one can produce accurate classifications using a significantly reduced amount of labelled data.
We propose an all-in-one framework for deep semi-supervised classification focusing on graph based approaches.
arXiv Detail & Related papers (2020-03-13T19:18:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.