DETRs with Hybrid Matching
- URL: http://arxiv.org/abs/2207.13080v3
- Date: Tue, 16 May 2023 16:01:50 GMT
- Title: DETRs with Hybrid Matching
- Authors: Ding Jia and Yuhui Yuan and Haodi He and Xiaopei Wu and Haojun Yu and
Weihong Lin and Lei Sun and Chao Zhang and Han Hu
- Abstract summary: One-to-one set matching is a key design for DETR to establish its end-to-end capability.
We propose a hybrid matching scheme that combines the original one-to-one matching branch with an auxiliary one-to-many matching branch during training.
- Score: 21.63116788914251
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: One-to-one set matching is a key design for DETR to establish its end-to-end
capability, so that object detection does not require a hand-crafted NMS
(non-maximum suppression) to remove duplicate detections. This end-to-end
signature is important for the versatility of DETR, and it has been generalized
to broader vision tasks. However, we note that there are few queries assigned
as positive samples and the one-to-one set matching significantly reduces the
training efficacy of positive samples. We propose a simple yet effective method
based on a hybrid matching scheme that combines the original one-to-one
matching branch with an auxiliary one-to-many matching branch during training.
Our hybrid strategy has been shown to significantly improve accuracy. In
inference, only the original one-to-one match branch is used, thus maintaining
the end-to-end merit and the same inference efficiency of DETR. The method is
named H-DETR, and it shows that a wide range of representative DETR methods can
be consistently improved across a wide range of visual tasks, including
DeformableDETR, PETRv2, PETR, and TransTrack, among others. The code is
available at: https://github.com/HDETR
Related papers
- MS-DETR: Efficient DETR Training with Mixed Supervision [74.93329653526952]
MS-DETR places one-to-many supervision to the object queries of the primary decoder that is used for inference.
Our approach does not need additional decoder branches or object queries.
Experimental results show that our approach outperforms related DETR variants.
arXiv Detail & Related papers (2024-01-08T16:08:53Z) - Semi-DETR: Semi-Supervised Object Detection with Detection Transformers [105.45018934087076]
We analyze the DETR-based framework on semi-supervised object detection (SSOD)
We present Semi-DETR, the first transformer-based end-to-end semi-supervised object detector.
Our method outperforms all state-of-the-art methods by clear margins.
arXiv Detail & Related papers (2023-07-16T16:32:14Z) - FeatAug-DETR: Enriching One-to-Many Matching for DETRs with Feature
Augmentation [48.94488166162821]
One-to-one matching is a crucial design in DETR-like object detection frameworks.
We propose two methods that realize one-to-many matching from a different perspective of augmenting images or image features.
We conduct extensive experiments to evaluate the effectiveness of the proposed approach on DETR variants.
arXiv Detail & Related papers (2023-03-02T18:59:48Z) - Semantic-Aligned Matching for Enhanced DETR Convergence and Multi-Scale
Feature Fusion [95.7732308775325]
The proposed DEtection TRansformer (DETR) has established a fully end-to-end paradigm for object detection.
DETR suffers from slow training convergence, which hinders its applicability to various detection tasks.
We design Semantic-Aligned-Matching DETR++ to accelerate DETR's convergence and improve detection performance.
arXiv Detail & Related papers (2022-07-28T15:34:29Z) - Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment [80.55064790937092]
One-to-many assignment, assigning one ground-truth object to multiple predictions, succeeds in detection methods such as Faster R-CNN and FCOS.
We introduce Group DETR, a simple yet efficient DETR training approach that introduces a group-wise way for one-to-many assignment.
Experiments show that Group DETR significantly speeds up the training convergence and improves the performance of various DETR-based models.
arXiv Detail & Related papers (2022-07-26T17:57:58Z) - Accelerating DETR Convergence via Semantic-Aligned Matching [50.3633635846255]
This paper presents SAM-DETR, a Semantic-Aligned-Matching DETR that greatly accelerates DETR's convergence without sacrificing its accuracy.
It explicitly searches salient points with the most discriminative features for semantic-aligned matching, which further speeds up the convergence and boosts detection accuracy as well.
arXiv Detail & Related papers (2022-03-14T06:50:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.