SimROD: A Simple Adaptation Method for Robust Object Detection
- URL: http://arxiv.org/abs/2107.13389v1
- Date: Wed, 28 Jul 2021 14:28:32 GMT
- Title: SimROD: A Simple Adaptation Method for Robust Object Detection
- Authors: Rindra Ramamonjison, Amin Banitalebi-Dehkordi, Xinyu Kang, Xiaolong
Bai, Yong Zhang
- Abstract summary: This paper presents a simple and effective unsupervised adaptation method for Robust Object Detection (SimROD)
Our method integrates a novel domain-centric augmentation method, a gradual self-labeling adaptation procedure, and a teacher-guided fine-tuning mechanism.
When applied to image corruptions and high-level cross-domain adaptation benchmarks, our method outperforms prior baselines.
- Score: 8.307942341807152
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper presents a Simple and effective unsupervised adaptation method for
Robust Object Detection (SimROD). To overcome the challenging issues of domain
shift and pseudo-label noise, our method integrates a novel domain-centric
augmentation method, a gradual self-labeling adaptation procedure, and a
teacher-guided fine-tuning mechanism. Using our method, target domain samples
can be leveraged to adapt object detection models without changing the model
architecture or generating synthetic data. When applied to image corruptions
and high-level cross-domain adaptation benchmarks, our method outperforms prior
baselines on multiple domain adaptation benchmarks. SimROD achieves new
state-of-the-art on standard real-to-synthetic and cross-camera setup
benchmarks. On the image corruption benchmark, models adapted with our method
achieved a relative robustness improvement of 15-25% AP50 on Pascal-C and 5-6%
AP on COCO-C and Cityscapes-C. On the cross-domain benchmark, our method
outperformed the best baseline performance by up to 8% AP50 on Comic dataset
and up to 4% on Watercolor dataset.
Related papers
- Align and Distill: Unifying and Improving Domain Adaptive Object Detection [16.879446142847392]
Domain adaptive object detection (DAOD) methods have recently demonstrated strong results on addressing this challenge.
We identify systemic benchmarking pitfalls that call past results into question and hamper further progress.
We address these problems by introducing: A unified benchmarking and implementation framework, Align and Distill (ALDI)
A new benchmark dataset, CFC-DAOD, enabling evaluation on diverse real-world data, and (4) A new method, ALDI++, that achieves state-of-the-art results by a large margin.
arXiv Detail & Related papers (2024-03-18T17:58:02Z) - What, How, and When Should Object Detectors Update in Continually
Changing Test Domains? [34.13756022890991]
Test-time adaptation algorithms have been proposed to adapt the model online while inferring test data.
We propose a novel online adaption approach for object detection in continually changing test domains.
Our approach surpasses baselines on widely used benchmarks, achieving improvements of up to 4.9%p and 7.9%p in mAP.
arXiv Detail & Related papers (2023-12-12T07:13:08Z) - Informative Data Mining for One-Shot Cross-Domain Semantic Segmentation [84.82153655786183]
We propose a novel framework called Informative Data Mining (IDM) to enable efficient one-shot domain adaptation for semantic segmentation.
IDM provides an uncertainty-based selection criterion to identify the most informative samples, which facilitates quick adaptation and reduces redundant training.
Our approach outperforms existing methods and achieves a new state-of-the-art one-shot performance of 56.7%/55.4% on the GTA5/SYNTHIA to Cityscapes adaptation tasks.
arXiv Detail & Related papers (2023-09-25T15:56:01Z) - AR-TTA: A Simple Method for Real-World Continual Test-Time Adaptation [1.4530711901349282]
We propose to validate test-time adaptation methods using datasets for autonomous driving, namely CLAD-C and SHIFT.
We observe that current test-time adaptation methods struggle to effectively handle varying degrees of domain shift.
We enhance the well-established self-training framework by incorporating a small memory buffer to increase model stability.
arXiv Detail & Related papers (2023-09-18T19:34:23Z) - Benchmarking Test-Time Adaptation against Distribution Shifts in Image
Classification [77.0114672086012]
Test-time adaptation (TTA) is a technique aimed at enhancing the generalization performance of models by leveraging unlabeled samples solely during prediction.
We present a benchmark that systematically evaluates 13 prominent TTA methods and their variants on five widely used image classification datasets.
arXiv Detail & Related papers (2023-07-06T16:59:53Z) - Universal Domain Adaptation from Foundation Models: A Baseline Study [58.51162198585434]
We make empirical studies of state-of-the-art UniDA methods using foundation models.
We introduce textitCLIP distillation, a parameter-free method specifically designed to distill target knowledge from CLIP models.
Although simple, our method outperforms previous approaches in most benchmark tasks.
arXiv Detail & Related papers (2023-05-18T16:28:29Z) - Few-Shot Adaptation of Pre-Trained Networks for Domain Shift [17.123505029637055]
Deep networks are prone to performance degradation when there is a domain shift between the source (training) data and target (test) data.
Recent test-time adaptation methods update batch normalization layers of pre-trained source models deployed in new target environments with streaming data to mitigate such performance degradation.
We propose a framework for few-shot domain adaptation to address the practical challenges of data-efficient adaptation.
arXiv Detail & Related papers (2022-05-30T16:49:59Z) - Learning Instance-Specific Adaptation for Cross-Domain Segmentation [79.61787982393238]
We propose a test-time adaptation method for cross-domain image segmentation.
Given a new unseen instance at test time, we adapt a pre-trained model by conducting instance-specific BatchNorm calibration.
arXiv Detail & Related papers (2022-03-30T17:59:45Z) - Frequency Spectrum Augmentation Consistency for Domain Adaptive Object
Detection [107.52026281057343]
We introduce a Frequency Spectrum Augmentation Consistency (FSAC) framework with four different low-frequency filter operations.
In the first stage, we utilize all the original and augmented source data to train an object detector.
In the second stage, augmented source and target data with pseudo labels are adopted to perform the self-training for prediction consistency.
arXiv Detail & Related papers (2021-12-16T04:07:01Z) - Learning Meta Face Recognition in Unseen Domains [74.69681594452125]
We propose a novel face recognition method via meta-learning named Meta Face Recognition (MFR)
MFR synthesizes the source/target domain shift with a meta-optimization objective.
We propose two benchmarks for generalized face recognition evaluation.
arXiv Detail & Related papers (2020-03-17T14:10:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.