INSITE: labelling medical images using submodular functions and
semi-supervised data programming
- URL: http://arxiv.org/abs/2402.07173v1
- Date: Sun, 11 Feb 2024 12:02:00 GMT
- Title: INSITE: labelling medical images using submodular functions and
semi-supervised data programming
- Authors: Akshat Gautam, Anurag Shandilya, Akshit Srivastava, Venkatapathy
Subramanian, Ganesh Ramakrishnan, Kshitij Jadhav
- Abstract summary: Large amounts of labeled data to train deep models creates an implementation bottleneck in resource-constrained settings.
We apply informed subset selection to identify a small number of most representative or diverse images from a huge pool of unlabelled data.
The newly annotated images are then used as exemplars to develop several data programming-driven labeling functions.
- Score: 19.88996560236578
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: The necessity of large amounts of labeled data to train deep models,
especially in medical imaging creates an implementation bottleneck in
resource-constrained settings. In Insite (labelINg medical imageS usIng
submodular funcTions and sEmi-supervised data programming) we apply informed
subset selection to identify a small number of most representative or diverse
images from a huge pool of unlabelled data subsequently annotated by a domain
expert. The newly annotated images are then used as exemplars to develop
several data programming-driven labeling functions. These labelling functions
output a predicted-label and a similarity score when given an unlabelled image
as an input. A consensus is brought amongst the outputs of these labeling
functions by using a label aggregator function to assign the final predicted
label to each unlabelled data point. We demonstrate that informed subset
selection followed by semi-supervised data programming methods using these
images as exemplars perform better than other state-of-the-art semi-supervised
methods. Further, for the first time we demonstrate that this can be achieved
through a small set of images used as exemplars.
Related papers
- Leveraging Fixed and Dynamic Pseudo-labels for Semi-supervised Medical Image Segmentation [7.9449756510822915]
Semi-supervised medical image segmentation has gained growing interest due to its ability to utilize unannotated data.
The current state-of-the-art methods mostly rely on pseudo-labeling within a co-training framework.
We propose a novel approach where multiple pseudo-labels for the same unannotated image are used to learn from the unlabeled data.
arXiv Detail & Related papers (2024-05-12T11:30:01Z) - Pseudo Label-Guided Data Fusion and Output Consistency for
Semi-Supervised Medical Image Segmentation [9.93871075239635]
We propose the PLGDF framework, which builds upon the mean teacher network for segmenting medical images with less annotation.
We propose a novel pseudo-label utilization scheme, which combines labeled and unlabeled data to augment the dataset effectively.
Our framework yields superior performance compared to six state-of-the-art semi-supervised learning methods.
arXiv Detail & Related papers (2023-11-17T06:36:43Z) - Dual-Perspective Semantic-Aware Representation Blending for Multi-Label
Image Recognition with Partial Labels [70.36722026729859]
We propose a dual-perspective semantic-aware representation blending (DSRB) that blends multi-granularity category-specific semantic representation across different images.
The proposed DS consistently outperforms current state-of-the-art algorithms on all proportion label settings.
arXiv Detail & Related papers (2022-05-26T00:33:44Z) - Heterogeneous Semantic Transfer for Multi-label Recognition with Partial Labels [70.45813147115126]
Multi-label image recognition with partial labels (MLR-PL) may greatly reduce the cost of annotation and thus facilitate large-scale MLR.
We find that strong semantic correlations exist within each image and across different images.
These correlations can help transfer the knowledge possessed by the known labels to retrieve the unknown labels.
arXiv Detail & Related papers (2022-05-23T08:37:38Z) - OSSGAN: Open-Set Semi-Supervised Image Generation [26.67298827670573]
We introduce a challenging training scheme of conditional GANs, called open-set semi-supervised image generation.
OSSGAN provides decision clues to the discriminator on the basis of whether an unlabeled image belongs to one or none of the classes of interest.
The results of experiments on Tiny ImageNet and ImageNet show notable improvements over supervised BigGAN and semi-supervised methods.
arXiv Detail & Related papers (2022-04-29T17:26:09Z) - Semantic-Aware Representation Blending for Multi-Label Image Recognition
with Partial Labels [86.17081952197788]
We propose to blend category-specific representation across different images to transfer information of known labels to complement unknown labels.
Experiments on the MS-COCO, Visual Genome, Pascal VOC 2007 datasets show that the proposed SARB framework obtains superior performance over current leading competitors.
arXiv Detail & Related papers (2022-03-04T07:56:16Z) - SCIDA: Self-Correction Integrated Domain Adaptation from Single- to
Multi-label Aerial Images [30.12949142271464]
Most publicly available datasets for image classification are with single labels, while images are inherently multi-labeled in our daily life.
We propose a novel integrated domain adaptation (SCIDA) method for automatic multi-label learning.
SCIDA is weakly supervised, i.e., automatically learning the multi-label image classification model from using massive, publicly available single-label images.
arXiv Detail & Related papers (2021-08-15T20:38:02Z) - GuidedMix-Net: Learning to Improve Pseudo Masks Using Labeled Images as
Reference [153.354332374204]
We propose a novel method for semi-supervised semantic segmentation named GuidedMix-Net.
We first introduce a feature alignment objective between labeled and unlabeled data to capture potentially similar image pairs.
MITrans is shown to be a powerful knowledge module for further progressive refining features of unlabeled data.
Along with supervised learning for labeled data, the prediction of unlabeled data is jointly learned with the generated pseudo masks.
arXiv Detail & Related papers (2021-06-29T02:48:45Z) - Instance-Aware Graph Convolutional Network for Multi-Label
Classification [55.131166957803345]
Graph convolutional neural network (GCN) has effectively boosted the multi-label image recognition task.
We propose an instance-aware graph convolutional neural network (IA-GCN) framework for multi-label classification.
arXiv Detail & Related papers (2020-08-19T12:49:28Z) - ATSO: Asynchronous Teacher-Student Optimization for Semi-Supervised
Medical Image Segmentation [99.90263375737362]
We propose ATSO, an asynchronous version of teacher-student optimization.
ATSO partitions the unlabeled data into two subsets and alternately uses one subset to fine-tune the model and updates the label on the other subset.
We evaluate ATSO on two popular medical image segmentation datasets and show its superior performance in various semi-supervised settings.
arXiv Detail & Related papers (2020-06-24T04:05:12Z) - Generative Adversarial Data Programming [32.2164057862111]
We show how distant supervision signals in the form of labeling functions can be used to obtain labels for given data in near-constant time.
This framework is extended to different setups, including self-supervised labeled image generation, zero-shot text to labeled image generation, transfer learning, and multi-task learning.
arXiv Detail & Related papers (2020-04-30T07:06:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.