D2Q-DETR: Decoupling and Dynamic Queries for Oriented Object Detection
with Transformers
- URL: http://arxiv.org/abs/2303.00542v1
- Date: Wed, 1 Mar 2023 14:36:19 GMT
- Title: D2Q-DETR: Decoupling and Dynamic Queries for Oriented Object Detection
with Transformers
- Authors: Qiang Zhou, Chaohui Yu, Zhibin Wang, Fan Wang
- Abstract summary: We propose an end-to-end framework for oriented object detection.
Our framework is based on DETR, with the box regression head replaced with a points prediction head.
Experiments on the largest and challenging DOTA-v1.0 and DOTA-v1.5 datasets show that D2Q-DETR outperforms existing NMS-based and NMS-free oriented object detection methods.
- Score: 14.488821968433834
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Despite the promising results, existing oriented object detection methods
usually involve heuristically designed rules, e.g., RRoI generation, rotated
NMS. In this paper, we propose an end-to-end framework for oriented object
detection, which simplifies the model pipeline and obtains superior
performance. Our framework is based on DETR, with the box regression head
replaced with a points prediction head. The learning of points is more
flexible, and the distribution of points can reflect the angle and size of the
target rotated box. We further propose to decouple the query features into
classification and regression features, which significantly improves the model
precision. Aerial images usually contain thousands of instances. To better
balance model precision and efficiency, we propose a novel dynamic query
design, which reduces the number of object queries in stacked decoder layers
without sacrificing model performance. Finally, we rethink the label assignment
strategy of existing DETR-like detectors and propose an effective label
re-assignment strategy for improved performance. We name our method D2Q-DETR.
Experiments on the largest and challenging DOTA-v1.0 and DOTA-v1.5 datasets
show that D2Q-DETR outperforms existing NMS-based and NMS-free oriented object
detection methods and achieves the new state-of-the-art.
Related papers
- SEED: A Simple and Effective 3D DETR in Point Clouds [72.74016394325675]
We argue that the main challenges are challenging due to the high sparsity and uneven distribution of point clouds.
We propose a simple and effective 3D DETR method (SEED) for detecting 3D objects from point clouds.
arXiv Detail & Related papers (2024-07-15T14:21:07Z) - DQ-DETR: DETR with Dynamic Query for Tiny Object Detection [29.559819542066236]
We present a simple yet effective model, named DQ-DETR, which consists of three different components.
DQ-DETR uses the prediction and density maps from the categorical counting module to dynamically adjust the number of object queries.
Our model outperforms previous CNN-based and DETR-like methods, achieving state-of-the-art mAP 30.2% on the AI-TOD-V2 dataset.
arXiv Detail & Related papers (2024-04-04T15:10:24Z) - Innovative Horizons in Aerial Imagery: LSKNet Meets DiffusionDet for
Advanced Object Detection [55.2480439325792]
We present an in-depth evaluation of an object detection model that integrates the LSKNet backbone with the DiffusionDet head.
The proposed model achieves a mean average precision (MAP) of approximately 45.7%, which is a significant improvement.
This advancement underscores the effectiveness of the proposed modifications and sets a new benchmark in aerial image analysis.
arXiv Detail & Related papers (2023-11-21T19:49:13Z) - DEYO: DETR with YOLO for Step-by-Step Object Detection [0.0]
This paper proposes a new two-stage object detection model, named DETR with YOLO (DEYO)
The first stage provides high-quality query and anchor feeding into the second stage, improving the performance and efficiency of the second stage compared to the original DETR model.
Experiments demonstrate that DEYO attains 50.6 AP and 52.1 AP in 12 and 36 epochs, respectively.
arXiv Detail & Related papers (2022-11-12T06:36:17Z) - Plug-and-Play Few-shot Object Detection with Meta Strategy and Explicit
Localization Inference [78.41932738265345]
This paper proposes a plug detector that can accurately detect the objects of novel categories without fine-tuning process.
We introduce two explicit inferences into the localization process to reduce its dependence on annotated data.
It shows a significant lead in both efficiency, precision, and recall under varied evaluation protocols.
arXiv Detail & Related papers (2021-10-26T03:09:57Z) - MRDet: A Multi-Head Network for Accurate Oriented Object Detection in
Aerial Images [51.227489316673484]
We propose an arbitrary-oriented region proposal network (AO-RPN) to generate oriented proposals transformed from horizontal anchors.
To obtain accurate bounding boxes, we decouple the detection task into multiple subtasks and propose a multi-head network.
Each head is specially designed to learn the features optimal for the corresponding task, which allows our network to detect objects accurately.
arXiv Detail & Related papers (2020-12-24T06:36:48Z) - End-to-End Object Detection with Transformers [88.06357745922716]
We present a new method that views object detection as a direct set prediction problem.
Our approach streamlines the detection pipeline, effectively removing the need for many hand-designed components.
The main ingredients of the new framework, called DEtection TRansformer or DETR, are a set-based global loss.
arXiv Detail & Related papers (2020-05-26T17:06:38Z) - Dynamic Refinement Network for Oriented and Densely Packed Object
Detection [75.29088991850958]
We present a dynamic refinement network that consists of two novel components, i.e., a feature selection module (FSM) and a dynamic refinement head (DRH)
Our FSM enables neurons to adjust receptive fields in accordance with the shapes and orientations of target objects, whereas the DRH empowers our model to refine the prediction dynamically in an object-aware manner.
We perform quantitative evaluations on several publicly available benchmarks including DOTA, HRSC2016, SKU110K, and our own SKU110K-R dataset.
arXiv Detail & Related papers (2020-05-20T11:35:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.