MUM : Mix Image Tiles and UnMix Feature Tiles for Semi-Supervised Object
Detection
- URL: http://arxiv.org/abs/2111.10958v1
- Date: Mon, 22 Nov 2021 02:46:27 GMT
- Title: MUM : Mix Image Tiles and UnMix Feature Tiles for Semi-Supervised Object
Detection
- Authors: JongMok Kim, Jooyoung Jang, Seunghyeon Seo, Jisoo Jeong, Jongkeun Na,
Nojun Kwak
- Abstract summary: We introduce a simple yet effective data augmentation method, Mix/UnMix (MUM), which unmixes feature tiles for the mixed image tiles for the semi-supervised learning (SSL) framework.
Our proposed method makes mixed input image tiles and reconstructs them in the feature space. Thus, MUM can enjoy the recent-regularization effect from non-interpolated pseudo-labels and successfully generate a meaningful weak-strong pair.
- Score: 26.032596415721947
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Many recent semi-supervised learning (SSL) studies build teacher-student
architecture and train the student network by the generated supervisory signal
from the teacher. Data augmentation strategy plays a significant role in the
SSL framework since it is hard to create a weak-strong augmented input pair
without losing label information. Especially when extending SSL to
semi-supervised object detection (SSOD), many strong augmentation methodologies
related to image geometry and interpolation-regularization are hard to utilize
since they possibly hurt the location information of the bounding box in the
object detection task. To address this, we introduce a simple yet effective
data augmentation method, Mix/UnMix (MUM), which unmixes feature tiles for the
mixed image tiles for the SSOD framework. Our proposed method makes mixed input
image tiles and reconstructs them in the feature space. Thus, MUM can enjoy the
interpolation-regularization effect from non-interpolated pseudo-labels and
successfully generate a meaningful weak-strong pair. Furthermore, MUM can be
easily equipped on top of various SSOD methods. Extensive experiments on
MS-COCO and PASCAL VOC datasets demonstrate the superiority of MUM by
consistently improving the mAP performance over the baseline in all the tested
SSOD benchmark protocols.
Related papers
- GeoMix: Towards Geometry-Aware Data Augmentation [76.09914619612812]
Mixup has shown considerable success in mitigating the challenges posed by limited labeled data in image classification.
We propose Geometric Mixup (GeoMix), a simple and interpretable Mixup approach leveraging in-place graph editing.
arXiv Detail & Related papers (2024-07-15T12:58:04Z) - Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement
Learning [53.00683059396803]
Mask image model (MIM) has been widely used due to its simplicity and effectiveness in recovering original information from masked images.
We propose a decision-based MIM that utilizes reinforcement learning (RL) to automatically search for optimal image masking ratio and masking strategy.
Our approach has a significant advantage over alternative self-supervised methods on the task of neuron segmentation.
arXiv Detail & Related papers (2023-10-06T10:40:46Z) - Supervised Stochastic Neighbor Embedding Using Contrastive Learning [4.560284382063488]
Clusters of samples belonging to the same class are pulled together in low-dimensional embedding space.
We extend the self-supervised contrastive approach to the fully-supervised setting, allowing us to effectively leverage label information.
arXiv Detail & Related papers (2023-09-15T00:26:21Z) - L-DAWA: Layer-wise Divergence Aware Weight Aggregation in Federated
Self-Supervised Visual Representation Learning [14.888569402903562]
Integration of self-supervised learning (SSL) and federated learning (FL) into one coherent system can potentially offer data privacy guarantees.
We propose a new aggregation strategy termed Layer-wise Divergence Aware Weight Aggregation (L-DAWA) to mitigate the influence of client bias and divergence during FL aggregation.
arXiv Detail & Related papers (2023-07-14T15:07:30Z) - MA2CL:Masked Attentive Contrastive Learning for Multi-Agent
Reinforcement Learning [128.19212716007794]
We propose an effective framework called textbfMulti-textbfAgent textbfMasked textbfAttentive textbfContrastive textbfLearning (MA2CL)
MA2CL encourages learning representation to be both temporal and agent-level predictive by reconstructing the masked agent observation in latent space.
Our method significantly improves the performance and sample efficiency of different MARL algorithms and outperforms other methods in various vision-based and state-based scenarios.
arXiv Detail & Related papers (2023-06-03T05:32:19Z) - Hierarchical Supervision and Shuffle Data Augmentation for 3D
Semi-Supervised Object Detection [90.32180043449263]
State-of-the-art 3D object detectors are usually trained on large-scale datasets with high-quality 3D annotations.
A natural remedy is to adopt semi-supervised learning (SSL) by leveraging a limited amount of labeled samples and abundant unlabeled samples.
This paper introduces a novel approach of Hierarchical Supervision and Shuffle Data Augmentation (HSSDA), which is a simple yet effective teacher-student framework.
arXiv Detail & Related papers (2023-04-04T02:09:32Z) - De-coupling and De-positioning Dense Self-supervised Learning [65.56679416475943]
Dense Self-Supervised Learning (SSL) methods address the limitations of using image-level feature representations when handling images with multiple objects.
We show that they suffer from coupling and positional bias, which arise from the receptive field increasing with layer depth and zero-padding.
We demonstrate the benefits of our method on COCO and on a new challenging benchmark, OpenImage-MINI, for object classification, semantic segmentation, and object detection.
arXiv Detail & Related papers (2023-03-29T18:07:25Z) - Supervision Interpolation via LossMix: Generalizing Mixup for Object
Detection and Beyond [10.25372189905226]
LossMix is a simple yet versatile and effective regularization that enhances the performance and robustness of object detectors.
Empirical results on the PASCAL VOC and MS COCO datasets demonstrate that LossMix can consistently outperform state-of-the-art methods for detection.
arXiv Detail & Related papers (2023-03-18T06:13:30Z) - Boosting Weakly Supervised Object Detection via Learning Bounding Box
Adjusters [76.36104006511684]
Weakly-supervised object detection (WSOD) has emerged as an inspiring recent topic to avoid expensive instance-level object annotations.
We defend the problem setting for improving localization performance by leveraging the bounding box regression knowledge from a well-annotated auxiliary dataset.
Our method performs favorably against state-of-the-art WSOD methods and knowledge transfer model with similar problem setting.
arXiv Detail & Related papers (2021-08-03T13:38:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.