Scene Clustering Based Pseudo-labeling Strategy for Multi-modal Aerial
View Object Classification
- URL: http://arxiv.org/abs/2205.01920v1
- Date: Wed, 4 May 2022 07:13:01 GMT
- Title: Scene Clustering Based Pseudo-labeling Strategy for Multi-modal Aerial
View Object Classification
- Authors: Jun Yu, Hao Chang, Keda Lu, Liwen Zhang, Shenshen Du
- Abstract summary: Fine-grained data, class imbalance and various shooting conditions preclude the representational ability of general image classification.
By exploiting these properties, we propose Scene Clustering Based Pseudo-labeling Strategy ( SCP-Label)
SCP-Label brings greater accuracy by assigning the same label to objects within the same scene.
- Score: 22.4510980822444
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Multi-modal aerial view object classification (MAVOC) in Automatic target
recognition (ATR), although an important and challenging problem, has been
under studied. This paper firstly finds that fine-grained data, class imbalance
and various shooting conditions preclude the representational ability of
general image classification. Moreover, the MAVOC dataset has scene aggregation
characteristics. By exploiting these properties, we propose Scene Clustering
Based Pseudo-labeling Strategy (SCP-Label), a simple yet effective method to
employ in post-processing. The SCP-Label brings greater accuracy by assigning
the same label to objects within the same scene while also mitigating bias and
confusion with model ensembles. Its performance surpasses the official baseline
by a large margin of +20.57% Accuracy on Track 1 (SAR), and +31.86% Accuracy on
Track 2 (SAR+EO), demonstrating the potential of SCP-Label as post-processing.
Finally, we win the championship both on Track1 and Track2 in the CVPR 2022
Perception Beyond the Visible Spectrum (PBVS) Workshop MAVOC Challenge. Our
code is available at https://github.com/HowieChangchn/SCP-Label.
Related papers
- Label Filling via Mixed Supervision for Medical Image Segmentation from Noisy Annotations [22.910649758574852]
We propose a simple yet effective Label Filling framework, termed as LF-Net.
It predicts the groundtruth segmentation label given only noisy annotations during training.
Results on five datasets show that our LF-Net boosts segmentation accuracy in all datasets compared with state-of-the-art methods.
arXiv Detail & Related papers (2024-10-21T14:36:36Z) - Pseudo-labelling meets Label Smoothing for Noisy Partial Label Learning [8.387189407144403]
Partial label learning (PLL) is a weakly-supervised learning paradigm where each training instance is paired with a set of candidate labels (partial label)
NPLL relaxes this constraint by allowing some partial labels to not contain the true label, enhancing the practicality of the problem.
We present a minimalistic framework that initially assigns pseudo-labels to images by exploiting the noisy partial labels through a weighted nearest neighbour algorithm.
arXiv Detail & Related papers (2024-02-07T13:32:47Z) - Weakly Supervised Semantic Segmentation for Driving Scenes [27.0285166404621]
State-of-the-art techniques in weakly-supervised semantic segmentation (WSSS) exhibit severe performance degradation on driving scene datasets.
We develop a new WSSS framework tailored to driving scene datasets.
arXiv Detail & Related papers (2023-12-21T08:16:26Z) - TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary
Multi-Label Classification of CLIP Without Training [29.431698321195814]
Contrastive Language-Image Pre-training (CLIP) has demonstrated impressive capabilities in open-vocabulary classification.
CLIP shows poor performance on multi-label datasets because the global feature tends to be dominated by the most prominent class.
We propose a local-to-global framework to obtain image tags.
arXiv Detail & Related papers (2023-12-20T08:15:40Z) - Pseudo-label Alignment for Semi-supervised Instance Segmentation [67.9616087910363]
Pseudo-labeling is significant for semi-supervised instance segmentation.
In existing pipelines, pseudo-labels that contain valuable information may be filtered out due to mismatches in class and mask quality.
We propose a novel framework, called pseudo-label aligning instance segmentation (PAIS), in this paper.
arXiv Detail & Related papers (2023-08-10T05:56:53Z) - Distilling Self-Supervised Vision Transformers for Weakly-Supervised
Few-Shot Classification & Segmentation [58.03255076119459]
We address the task of weakly-supervised few-shot image classification and segmentation, by leveraging a Vision Transformer (ViT)
Our proposed method takes token representations from the self-supervised ViT and leverages their correlations, via self-attention, to produce classification and segmentation predictions.
Experiments on Pascal-5i and COCO-20i demonstrate significant performance gains in a variety of supervision settings.
arXiv Detail & Related papers (2023-07-07T06:16:43Z) - Multi-Granularity Denoising and Bidirectional Alignment for Weakly
Supervised Semantic Segmentation [75.32213865436442]
We propose an end-to-end multi-granularity denoising and bidirectional alignment (MDBA) model to alleviate the noisy label and multi-class generalization issues.
The MDBA model can reach the mIoU of 69.5% and 70.2% on validation and test sets for the PASCAL VOC 2012 dataset.
arXiv Detail & Related papers (2023-05-09T03:33:43Z) - Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection [98.66771688028426]
We propose a Ambiguity-Resistant Semi-supervised Learning (ARSL) for one-stage detectors.
Joint-Confidence Estimation (JCE) is proposed to quantifies the classification and localization quality of pseudo labels.
ARSL effectively mitigates the ambiguities and achieves state-of-the-art SSOD performance on MS COCO and PASCAL VOC.
arXiv Detail & Related papers (2023-03-27T07:46:58Z) - Image Understands Point Cloud: Weakly Supervised 3D Semantic
Segmentation via Association Learning [59.64695628433855]
We propose a novel cross-modality weakly supervised method for 3D segmentation, incorporating complementary information from unlabeled images.
Basically, we design a dual-branch network equipped with an active labeling strategy, to maximize the power of tiny parts of labels.
Our method even outperforms the state-of-the-art fully supervised competitors with less than 1% actively selected annotations.
arXiv Detail & Related papers (2022-09-16T07:59:04Z) - Masked Unsupervised Self-training for Zero-shot Image Classification [98.23094305347709]
Masked Unsupervised Self-Training (MUST) is a new approach which leverages two different and complimentary sources of supervision: pseudo-labels and raw images.
MUST improves upon CLIP by a large margin and narrows the performance gap between unsupervised and supervised classification.
arXiv Detail & Related papers (2022-06-07T02:03:06Z) - Threshold Matters in WSSS: Manipulating the Activation for the Robust
and Accurate Segmentation Model Against Thresholds [16.6833745997519]
Weakly-supervised semantic segmentation (WSSS) has recently gained much attention for its promise to train segmentation models only with image-level labels.
Existing WSSS methods commonly argue that the sparse coverage of CAM incurs the performance bottleneck of WSSS.
This paper provides analytical and empirical evidence that the actual bottleneck may not be sparse coverage but a global thresholding scheme applied after CAM.
arXiv Detail & Related papers (2022-03-30T04:26:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.