Deep Cervix Model Development from Heterogeneous and Partially Labeled
Image Datasets
- URL: http://arxiv.org/abs/2201.07013v1
- Date: Tue, 18 Jan 2022 14:24:50 GMT
- Title: Deep Cervix Model Development from Heterogeneous and Partially Labeled
Image Datasets
- Authors: Anabik Pal, Zhiyun Xue and Sameer Antani
- Abstract summary: Cervical cancer is the fourth most common cancer in women worldwide.
There are a wide variety of cervical inspection objectives which impact the labeling criteria for criteria-specific prediction model development.
Motivated by these challenges, we propose a self-supervised learning (SSL) based approach to produce a pre-trained cervix model from unlabeled cervical images.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Cervical cancer is the fourth most common cancer in women worldwide. The
availability of a robust automated cervical image classification system can
augment the clinical care provider's limitation in traditional visual
inspection with acetic acid (VIA). However, there are a wide variety of
cervical inspection objectives which impact the labeling criteria for
criteria-specific prediction model development. Moreover, due to the lack of
confirmatory test results and inter-rater labeling variation, many images are
left unlabeled.
Motivated by these challenges, we propose a self-supervised learning (SSL)
based approach to produce a pre-trained cervix model from unlabeled cervical
images. The developed model is further fine-tuned to produce criteria-specific
classification models with the available labeled images. We demonstrate the
effectiveness of the proposed approach using two cervical image datasets. Both
datasets are partially labeled and labeling criteria are different. The
experimental results show that the SSL-based initialization improves
classification performance (Accuracy: 2.5% min) and the inclusion of images
from both datasets during SSL further improves the performance (Accuracy: 1.5%
min). Further, considering data-sharing restrictions, we experimented with the
effectiveness of Federated SSL and find that it can improve performance over
the SSL model developed with just its images. This justifies the importance of
SSL-based cervix model development. We believe that the present research shows
a novel direction in developing criteria-specific custom deep models for
cervical image classification by combining images from different sources
unlabeled and/or labeled with varying criteria, and addressing image access
restrictions.
Related papers
- SPLAL: Similarity-based pseudo-labeling with alignment loss for
semi-supervised medical image classification [11.435826510575879]
Semi-supervised learning (SSL) methods can mitigate challenges by leveraging both labeled and unlabeled data.
SSL methods for medical image classification need to address two key challenges: (1) estimating reliable pseudo-labels for the images in the unlabeled dataset and (2) reducing biases caused by class imbalance.
In this paper, we propose a novel SSL approach, SPLAL, that effectively addresses these challenges.
arXiv Detail & Related papers (2023-07-10T14:53:24Z) - Performance of GAN-based augmentation for deep learning COVID-19 image
classification [57.1795052451257]
The biggest challenge in the application of deep learning to the medical domain is the availability of training data.
Data augmentation is a typical methodology used in machine learning when confronted with a limited data set.
In this work, a StyleGAN2-ADA model of Generative Adversarial Networks is trained on the limited COVID-19 chest X-ray image set.
arXiv Detail & Related papers (2023-04-18T15:39:58Z) - Rethinking Semi-Supervised Medical Image Segmentation: A
Variance-Reduction Perspective [51.70661197256033]
We propose ARCO, a semi-supervised contrastive learning framework with stratified group theory for medical image segmentation.
We first propose building ARCO through the concept of variance-reduced estimation and show that certain variance-reduction techniques are particularly beneficial in pixel/voxel-level segmentation tasks.
We experimentally validate our approaches on eight benchmarks, i.e., five 2D/3D medical and three semantic segmentation datasets, with different label settings.
arXiv Detail & Related papers (2023-02-03T13:50:25Z) - Information Gain Sampling for Active Learning in Medical Image
Classification [3.1619162190378787]
This work presents an information-theoretic active learning framework that guides the optimal selection of images from the unlabelled pool to be labeled.
Experiments are performed on two different medical image classification datasets.
arXiv Detail & Related papers (2022-08-01T16:25:53Z) - Harmonizing Pathological and Normal Pixels for Pseudo-healthy Synthesis [68.5287824124996]
We present a new type of discriminator, the segmentor, to accurately locate the lesions and improve the visual quality of pseudo-healthy images.
We apply the generated images into medical image enhancement and utilize the enhanced results to cope with the low contrast problem.
Comprehensive experiments on the T2 modality of BraTS demonstrate that the proposed method substantially outperforms the state-of-the-art methods.
arXiv Detail & Related papers (2022-03-29T08:41:17Z) - A New Semi-supervised Learning Benchmark for Classifying View and
Diagnosing Aortic Stenosis from Echocardiograms [4.956777496509955]
We develop a benchmark dataset to assess semi-supervised approaches to two tasks relevant to cardiac ultrasound (echocardiogram) interpretation.
We find that a state-of-the-art method called MixMatch achieves promising gains in heldout accuracy on both tasks.
We pursue patient-level diagnosis prediction, which requires aggregating across hundreds of images of diverse view types.
arXiv Detail & Related papers (2021-07-30T21:08:12Z) - Malignancy Prediction and Lesion Identification from Clinical
Dermatological Images [65.1629311281062]
We consider machine-learning-based malignancy prediction and lesion identification from clinical dermatological images.
We first identify all lesions present in the image regardless of sub-type or likelihood of malignancy, then it estimates their likelihood of malignancy, and through aggregation, it also generates an image-level likelihood of malignancy.
arXiv Detail & Related papers (2021-04-02T20:52:05Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - Deep Metric Learning-based Image Retrieval System for Chest Radiograph
and its Clinical Applications in COVID-19 [12.584626589565522]
Chest radiograph (CXR) has been playing a crucial role in COVID-19 patient triaging, diagnosing and monitoring.
Considering the mixed and unspecific signals in CXR, an image retrieval model of CXR that provides both similar images and associated clinical information can be more clinically meaningful.
In this work we develop a novel CXR image retrieval model based on deep metric learning.
arXiv Detail & Related papers (2020-11-26T03:16:48Z) - Semi-supervised Medical Image Classification with Relation-driven
Self-ensembling Model [71.80319052891817]
We present a relation-driven semi-supervised framework for medical image classification.
It exploits the unlabeled data by encouraging the prediction consistency of given input under perturbations.
Our method outperforms many state-of-the-art semi-supervised learning methods on both single-label and multi-label image classification scenarios.
arXiv Detail & Related papers (2020-05-15T06:57:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.