Improving Transferability for Domain Adaptive Detection Transformers
- URL: http://arxiv.org/abs/2204.14195v1
- Date: Fri, 29 Apr 2022 16:27:10 GMT
- Title: Improving Transferability for Domain Adaptive Detection Transformers
- Authors: Kaixiong Gong, Shuang Li, Shugang Li, Rui Zhang, Chi Harold Liu, Qiang
Chen
- Abstract summary: This paper aims to build a simple but effective baseline with a DETR-style detector on domain shift settings.
For one, mitigating the domain shift on the backbone and the decoder output features excels in getting favorable results.
For another, advanced domain alignment methods in both parts further enhance the performance.
- Score: 34.61314708197079
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: DETR-style detectors stand out amongst in-domain scenarios, but their
properties in domain shift settings are under-explored. This paper aims to
build a simple but effective baseline with a DETR-style detector on domain
shift settings based on two findings. For one, mitigating the domain shift on
the backbone and the decoder output features excels in getting favorable
results. For another, advanced domain alignment methods in both parts further
enhance the performance. Thus, we propose the Object-Aware Alignment (OAA)
module and the Optimal Transport based Alignment (OTA) module to achieve
comprehensive domain alignment on the outputs of the backbone and the detector.
The OAA module aligns the foreground regions identified by pseudo-labels in the
backbone outputs, leading to domain-invariant based features. The OTA module
utilizes sliced Wasserstein distance to maximize the retention of location
information while minimizing the domain gap in the decoder outputs. We
implement the findings and the alignment modules into our adaptation method,
and it benchmarks the DETR-style detector on the domain shift settings.
Experiments on various domain adaptive scenarios validate the effectiveness of
our method.
Related papers
- DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical Alignment [7.768332621617199]
We introduce a strong DETR-based detector named Domain Adaptive detection TRansformer ( DATR) for unsupervised domain adaptation of object detection.
Our proposed DATR incorporates a mean-teacher based self-training framework, utilizing pseudo-labels generated by the teacher model to further mitigate domain bias.
Experiments demonstrate superior performance and generalization capabilities of our proposed DATR in multiple domain adaptation scenarios.
arXiv Detail & Related papers (2024-05-20T03:48:45Z) - Plug-and-Play Transformer Modules for Test-Time Adaptation [54.80435317208111]
We introduce PLUTO: a Plug-and-pLay modUlar Test-time domain adaptatiOn strategy.
We pre-train a large set of modules, each specialized for different source domains.
We harness multiple most-relevant source domains in a single inference call.
arXiv Detail & Related papers (2024-01-06T00:24:50Z) - PM-DETR: Domain Adaptive Prompt Memory for Object Detection with
Transformers [25.812325027602252]
Transformer-based detectors (i.e., DETR) have demonstrated impressive performance on end-to-end object detection.
transferring DETR to different data distributions may lead to a significant performance degradation.
We propose a hierarchical Prompt Domain Memory (PDM) for adapting detection transformers to different distributions.
arXiv Detail & Related papers (2023-07-01T12:02:24Z) - Cross-Domain Object Detection with Mean-Teacher Transformer [43.486392965014105]
We propose an end-to-end cross-domain detection transformer based on the mean teacher knowledge transfer (MTKT)
We design three levels of source-target feature alignment strategies based on the architecture of the Transformer, including domain query-based feature alignment (DQFA), bi-level-graph-based prototype alignment (BGPA) and token-wise image feature alignment (TIFA)
Our proposed method achieves state-of-the-art performance on three domain adaptation scenarios, especially the result of Sim10k to Cityscapes scenario is remarkably improved from 52.6 mAP to 57.9 mAP.
arXiv Detail & Related papers (2022-05-03T17:11:55Z) - Amplitude Spectrum Transformation for Open Compound Domain Adaptive
Semantic Segmentation [62.68759523116924]
Open compound domain adaptation (OCDA) has emerged as a practical adaptation setting.
We propose a novel feature space Amplitude Spectrum Transformation (AST)
arXiv Detail & Related papers (2022-02-09T05:40:34Z) - Seeking Similarities over Differences: Similarity-based Domain Alignment
for Adaptive Object Detection [86.98573522894961]
We propose a framework that generalizes the components commonly used by Unsupervised Domain Adaptation (UDA) algorithms for detection.
Specifically, we propose a novel UDA algorithm, ViSGA, that leverages the best design choices and introduces a simple but effective method to aggregate features at instance-level.
We show that both similarity-based grouping and adversarial training allows our model to focus on coarsely aligning feature groups, without being forced to match all instances across loosely aligned domains.
arXiv Detail & Related papers (2021-10-04T13:09:56Z) - Stagewise Unsupervised Domain Adaptation with Adversarial Self-Training
for Road Segmentation of Remote Sensing Images [93.50240389540252]
Road segmentation from remote sensing images is a challenging task with wide ranges of application potentials.
We propose a novel stagewise domain adaptation model called RoadDA to address the domain shift (DS) issue in this field.
Experiment results on two benchmarks demonstrate that RoadDA can efficiently reduce the domain gap and outperforms state-of-the-art methods.
arXiv Detail & Related papers (2021-08-28T09:29:14Z) - Exploring Sequence Feature Alignment for Domain Adaptive Detection
Transformers [141.70707071815653]
We propose a novel Sequence Feature Alignment (SFA) method that is specially designed for the adaptation of detection transformers.
SFA consists of a domain query-based feature alignment (DQFA) module and a token-wise feature alignment (TDA) module.
Experiments on three challenging benchmarks show that SFA outperforms state-of-the-art domain adaptive object detection methods.
arXiv Detail & Related papers (2021-07-27T07:17:12Z) - AFAN: Augmented Feature Alignment Network for Cross-Domain Object
Detection [90.18752912204778]
Unsupervised domain adaptation for object detection is a challenging problem with many real-world applications.
We propose a novel augmented feature alignment network (AFAN) which integrates intermediate domain image generation and domain-adversarial training.
Our approach significantly outperforms the state-of-the-art methods on standard benchmarks for both similar and dissimilar domain adaptations.
arXiv Detail & Related papers (2021-06-10T05:01:20Z) - Domain Adaptive Object Detection via Feature Separation and Alignment [11.4768983507572]
adversarial-based domain adaptive object detection (DAOD) methods have been developed rapidly.
We establish a Feature Separation and Alignment Network (FSANet) which consists of a gray-scale feature separation (GSFS) module, a local-global feature alignment (LGFA) module and a region-instance-level alignment (RILA) module.
Our FSANet achieves better performance on the target domain detection and surpasses the state-of-the-art methods.
arXiv Detail & Related papers (2020-12-16T01:44:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.