MRDet: A Multi-Head Network for Accurate Oriented Object Detection in
Aerial Images
- URL: http://arxiv.org/abs/2012.13135v1
- Date: Thu, 24 Dec 2020 06:36:48 GMT
- Title: MRDet: A Multi-Head Network for Accurate Oriented Object Detection in
Aerial Images
- Authors: Ran Qin and Qingjie Liu and Guangshuai Gao and Di Huang and Yunhong
Wang
- Abstract summary: We propose an arbitrary-oriented region proposal network (AO-RPN) to generate oriented proposals transformed from horizontal anchors.
To obtain accurate bounding boxes, we decouple the detection task into multiple subtasks and propose a multi-head network.
Each head is specially designed to learn the features optimal for the corresponding task, which allows our network to detect objects accurately.
- Score: 51.227489316673484
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Objects in aerial images usually have arbitrary orientations and are densely
located over the ground, making them extremely challenge to be detected. Many
recently developed methods attempt to solve these issues by estimating an extra
orientation parameter and placing dense anchors, which will result in high
model complexity and computational costs. In this paper, we propose an
arbitrary-oriented region proposal network (AO-RPN) to generate oriented
proposals transformed from horizontal anchors. The AO-RPN is very efficient
with only a few amounts of parameters increase than the original RPN.
Furthermore, to obtain accurate bounding boxes, we decouple the detection task
into multiple subtasks and propose a multi-head network to accomplish them.
Each head is specially designed to learn the features optimal for the
corresponding task, which allows our network to detect objects accurately. We
name it MRDet short for Multi-head Rotated object Detector for convenience. We
test the proposed MRDet on two challenging benchmarks, i.e., DOTA and HRSC2016,
and compare it with several state-of-the-art methods. Our method achieves very
promising results which clearly demonstrate its effectiveness.
Related papers
- Renormalized Connection for Scale-preferred Object Detection in Satellite Imagery [51.83786195178233]
We design a Knowledge Discovery Network (KDN) to implement the renormalization group theory in terms of efficient feature extraction.
Renormalized connection (RC) on the KDN enables synergistic focusing'' of multi-scale features.
RCs extend the multi-level feature's divide-and-conquer'' mechanism of the FPN-based detectors to a wide range of scale-preferred tasks.
arXiv Detail & Related papers (2024-09-09T13:56:22Z) - PETDet: Proposal Enhancement for Two-Stage Fine-Grained Object Detection [26.843891792018447]
We present PETDet (Proposal Enhancement for Two-stage fine-grained object detection) to better handle the sub-tasks in two-stage FGOD methods.
An anchor-free Quality Oriented Proposal Network (QOPN) is proposed with dynamic label assignment and attention-based decomposition.
A novel Adaptive Recognition Loss (ARL) offers guidance for the R-CNN head to focus on high-quality proposals.
arXiv Detail & Related papers (2023-12-16T18:04:56Z) - Efficient Decoder for End-to-End Oriented Object Detection in Remote Sensing Images [26.37802649901314]
We propose an end-to-end oriented detector equipped with an efficient decoder.
Rotated RoI attention and Selective Distinct Queries (SDQ) are proposed.
Our method achieves state-of-the-art performance on DIOR-R (67.31% mAP), DOTA-v1.5 (67.43% mAP), and DOTA-v2.0 (53.28% mAP) with the ResNet50 backbone.
arXiv Detail & Related papers (2023-11-29T13:43:17Z) - Small Object Detection via Coarse-to-fine Proposal Generation and
Imitation Learning [52.06176253457522]
We propose a two-stage framework tailored for small object detection based on the Coarse-to-fine pipeline and Feature Imitation learning.
CFINet achieves state-of-the-art performance on the large-scale small object detection benchmarks, SODA-D and SODA-A.
arXiv Detail & Related papers (2023-08-18T13:13:09Z) - Anchor-free Oriented Proposal Generator for Object Detection [59.54125119453818]
Oriented object detection is a practical and challenging task in remote sensing image interpretation.
Nowadays, oriented detectors mostly use horizontal boxes as intermedium to derive oriented boxes from them.
We propose a novel Anchor-free Oriented Proposal Generator (AOPG) that abandons the horizontal boxes-related operations from the network architecture.
arXiv Detail & Related papers (2021-10-05T10:45:51Z) - Anchor-free Small-scale Multispectral Pedestrian Detection [88.7497134369344]
We propose a method for effective and efficient multispectral fusion of the two modalities in an adapted single-stage anchor-free base architecture.
We aim at learning pedestrian representations based on object center and scale rather than direct bounding box predictions.
Results show our method's effectiveness in detecting small-scaled pedestrians.
arXiv Detail & Related papers (2020-08-19T13:13:01Z) - Representation Sharing for Fast Object Detector Search and Beyond [38.18583590914755]
We propose Fast And Diverse (FAD) to better explore the optimal configuration of receptive fields and convolution types in the sub-networks for one-stage detectors.
FAD achieves prominent improvements on two types of one-stage detectors with various backbones.
arXiv Detail & Related papers (2020-07-23T15:39:44Z) - FairMOT: On the Fairness of Detection and Re-Identification in Multiple
Object Tracking [92.48078680697311]
Multi-object tracking (MOT) is an important problem in computer vision.
We present a simple yet effective approach termed as FairMOT based on the anchor-free object detection architecture CenterNet.
The approach achieves high accuracy for both detection and tracking.
arXiv Detail & Related papers (2020-04-04T08:18:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.