Tackling the Problem of Limited Data and Annotations in Semantic
Segmentation
- URL: http://arxiv.org/abs/2007.07357v1
- Date: Tue, 14 Jul 2020 21:11:11 GMT
- Title: Tackling the Problem of Limited Data and Annotations in Semantic
Segmentation
- Authors: Ahmadreza Jeddi
- Abstract summary: To tackle the problem of limited data annotations in image segmentation, different pre-trained models and CRF based methods are applied.
To this end, RotNet, DeeperCluster, and Semi&Weakly Supervised Learning (SWSL) pre-trained models are transferred and finetuned in a DeepLab-v2 baseline.
The results of my study show that, on this small dataset, using a pre-trained ResNet50 SWSL model gives results that are 7.4% better than applying an ImageNet pre-trained model.
- Score: 1.0152838128195467
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this work, the case of semantic segmentation on a small image dataset
(simulated by 1000 randomly selected images from PASCAL VOC 2012), where only
weak supervision signals (scribbles from user interaction) are available is
studied. Especially, to tackle the problem of limited data annotations in image
segmentation, transferring different pre-trained models and CRF based methods
are applied to enhance the segmentation performance. To this end, RotNet,
DeeperCluster, and Semi&Weakly Supervised Learning (SWSL) pre-trained models
are transferred and finetuned in a DeepLab-v2 baseline, and dense CRF is
applied both as a post-processing and loss regularization technique. The
results of my study show that, on this small dataset, using a pre-trained
ResNet50 SWSL model gives results that are 7.4% better than applying an
ImageNet pre-trained model; moreover, for the case of training on the full
PASCAL VOC 2012 training data, this pre-training approach increases the mIoU
results by almost 4%. On the other hand, dense CRF is shown to be very
effective as well, enhancing the results both as a loss regularization
technique in weakly supervised training and as a post-processing tool.
Related papers
- Enhancing pretraining efficiency for medical image segmentation via transferability metrics [0.0]
In medical image segmentation tasks, the scarcity of labeled training data poses a significant challenge.
We introduce a novel transferability metric, based on contrastive learning, that measures how robustly a pretrained model is able to represent the target data.
arXiv Detail & Related papers (2024-10-24T12:11:52Z) - Unsupervised Pre-training with Language-Vision Prompts for Low-Data Instance Segmentation [105.23631749213729]
We propose a novel method for unsupervised pre-training in low-data regimes.
Inspired by the recently successful prompting technique, we introduce a new method, Unsupervised Pre-training with Language-Vision Prompts.
We show that our method can converge faster and perform better than CNN-based models in low-data regimes.
arXiv Detail & Related papers (2024-05-22T06:48:43Z) - A Large-scale Evaluation of Pretraining Paradigms for the Detection of
Defects in Electroluminescence Solar Cell Images [3.729242965449096]
This work is a large-scale evaluation and benchmarking of various pretraining methods for Solar Cell Defect Detection.
We cover supervised training with semantic segmentation, semi-supervised learning, and two self-supervised techniques.
We achieve a new state-of-the-art for SCDD and demonstrate that certain pretraining schemes result in superior performance on underrepresented classes.
arXiv Detail & Related papers (2024-02-27T15:37:15Z) - Boosting Visual-Language Models by Exploiting Hard Samples [126.35125029639168]
HELIP is a cost-effective strategy tailored to enhance the performance of existing CLIP models.
Our method allows for effortless integration with existing models' training pipelines.
On comprehensive benchmarks, HELIP consistently boosts existing models to achieve leading performance.
arXiv Detail & Related papers (2023-05-09T07:00:17Z) - Self-Supervised Pre-Training for Transformer-Based Person
Re-Identification [54.55281692768765]
Transformer-based supervised pre-training achieves great performance in person re-identification (ReID)
Due to the domain gap between ImageNet and ReID datasets, it usually needs a larger pre-training dataset to boost the performance.
This work aims to mitigate the gap between the pre-training and ReID datasets from the perspective of data and model structure.
arXiv Detail & Related papers (2021-11-23T18:59:08Z) - Regularizing Generative Adversarial Networks under Limited Data [88.57330330305535]
This work proposes a regularization approach for training robust GAN models on limited data.
We show a connection between the regularized loss and an f-divergence called LeCam-divergence, which we find is more robust under limited training data.
arXiv Detail & Related papers (2021-04-07T17:59:06Z) - Contrastive Learning for Label-Efficient Semantic Segmentation [44.10416030868873]
Convolutional Neural Network (CNN) based semantic segmentation approaches have achieved impressive results by using large amounts of labeled data.
Deep CNNs trained with the de facto cross-entropy loss can easily overfit to small amounts of labeled data.
We propose a simple and effective contrastive learning-based training strategy in which we first pretrain the network using a pixel-wise, label-based contrastive loss.
arXiv Detail & Related papers (2020-12-13T07:05:39Z) - Efficient Conditional Pre-training for Transfer Learning [71.01129334495553]
We propose efficient filtering methods to select relevant subsets from the pre-training dataset.
We validate our techniques by pre-training on ImageNet in both the unsupervised and supervised settings.
We improve standard ImageNet pre-training by 1-3% by tuning available models on our subsets and pre-training on a dataset filtered from a larger scale dataset.
arXiv Detail & Related papers (2020-11-20T06:16:15Z) - SML: Semantic Meta-learning for Few-shot Semantic Segmentation [27.773396307292497]
We propose a novel meta-learning framework, Semantic Meta-Learning, which incorporates class-level semantic descriptions in the generated prototypes for this problem.
In addition, we propose to use the well established technique, ridge regression, to not only bring in the class-level semantic information, but also to effectively utilise the information available from multiple images present in the training data for prototype computation.
arXiv Detail & Related papers (2020-09-14T18:26:46Z) - Cheaper Pre-training Lunch: An Efficient Paradigm for Object Detection [86.0580214485104]
We propose a general and efficient pre-training paradigm, Montage pre-training, for object detection.
Montage pre-training needs only the target detection dataset while taking only 1/4 computational resources compared to the widely adopted ImageNet pre-training.
The efficiency and effectiveness of Montage pre-training are validated by extensive experiments on the MS-COCO dataset.
arXiv Detail & Related papers (2020-04-25T16:09:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.