ReDet: A Rotation-equivariant Detector for Aerial Object Detection
- URL: http://arxiv.org/abs/2103.07733v1
- Date: Sat, 13 Mar 2021 15:37:36 GMT
- Title: ReDet: A Rotation-equivariant Detector for Aerial Object Detection
- Authors: Jiaming Han and Jian Ding and Nan Xue and Gui-Song Xia
- Abstract summary: We propose a Rotation-equivariant Detector (ReDet) to address these issues.
We incorporate rotation-equivariant networks into the detector to extract rotation-equivariant features.
Our method can achieve state-of-the-art performance on the task of aerial object detection.
- Score: 27.419045245853706
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Recently, object detection in aerial images has gained much attention in
computer vision. Different from objects in natural images, aerial objects are
often distributed with arbitrary orientation. Therefore, the detector requires
more parameters to encode the orientation information, which are often highly
redundant and inefficient. Moreover, as ordinary CNNs do not explicitly model
the orientation variation, large amounts of rotation augmented data is needed
to train an accurate object detector. In this paper, we propose a
Rotation-equivariant Detector (ReDet) to address these issues, which explicitly
encodes rotation equivariance and rotation invariance. More precisely, we
incorporate rotation-equivariant networks into the detector to extract
rotation-equivariant features, which can accurately predict the orientation and
lead to a huge reduction of model size. Based on the rotation-equivariant
features, we also present Rotation-invariant RoI Align (RiRoI Align), which
adaptively extracts rotation-invariant features from equivariant features
according to the orientation of RoI. Extensive experiments on several
challenging aerial image datasets DOTA-v1.0, DOTA-v1.5 and HRSC2016, show that
our method can achieve state-of-the-art performance on the task of aerial
object detection. Compared with previous best results, our ReDet gains 1.2, 3.5
and 2.6 mAP on DOTA-v1.0, DOTA-v1.5 and HRSC2016 respectively while reducing
the number of parameters by 60\% (313 Mb vs. 121 Mb). The code is available at:
\url{https://github.com/csuhan/ReDet}.
Related papers
- RIDE: Boosting 3D Object Detection for LiDAR Point Clouds via Rotation-Invariant Analysis [15.42293045246587]
RIDE is a pioneering exploration of Rotation-Invariance for the 3D LiDAR-point-based object DEtector.
We design a bi-feature extractor that extracts (i) object-aware features though sensitive to rotation but preserve geometry well, and (ii) rotation-invariant features, which lose geometric information to a certain extent but are robust to rotation.
Our RIDE is compatible and easy to plug into the existing one-stage and two-stage 3D detectors, and boosts both detection performance and rotation robustness.
arXiv Detail & Related papers (2024-08-28T08:53:33Z) - FRED: Towards a Full Rotation-Equivariance in Aerial Image Object
Detection [28.47314201641291]
We introduce a Fully Rotation-Equivariant Oriented Object Detector (FRED)
Our proposed method delivers comparable performance on DOTA-v1.0 and outperforms by 1.5 mAP on DOTA-v1.5, all while significantly reducing the model parameters to 16%.
arXiv Detail & Related papers (2023-12-22T09:31:43Z) - Rotation Invariant Transformer for Recognizing Object in UAVs [66.1564328237299]
We propose a novel rotation invariant vision transformer (RotTrans) forRecognizing targets of interest from UAVs.
RotTrans greatly outperforms the current state-of-the-arts, which is 5.9% and 4.8% higher than the highest mAP and Rank1.
Our solution wins the first place in the UAV-based person re-recognition track in the Multi-Modal Video Reasoning and Analyzing Competition.
arXiv Detail & Related papers (2023-11-05T03:55:08Z) - Adaptive Rotated Convolution for Rotated Object Detection [96.94590550217718]
We present Adaptive Rotated Convolution (ARC) module to handle rotated object detection problem.
In our ARC module, the convolution kernels rotate adaptively to extract object features with varying orientations in different images.
The proposed approach achieves state-of-the-art performance on the DOTA dataset with 81.77% mAP.
arXiv Detail & Related papers (2023-03-14T11:53:12Z) - Category-Level 6D Object Pose Estimation with Flexible Vector-Based
Rotation Representation [51.67545893892129]
We propose a novel 3D graph convolution based pipeline for category-level 6D pose and size estimation from monocular RGB-D images.
We first design an orientation-aware autoencoder with 3D graph convolution for latent feature learning.
Then, to efficiently decode the rotation information from the latent feature, we design a novel flexible vector-based decomposable rotation representation.
arXiv Detail & Related papers (2022-12-09T02:13:43Z) - DARDet: A Dense Anchor-free Rotated Object Detector in Aerial Images [11.45718985586972]
We propose a dense anchor-free rotated object detector (DARDet) for rotated object detection in aerial images.
Our DARDet directly predicts five parameters of rotated boxes at each foreground pixel of feature maps.
Our method achieves state-of-the-art performance on three commonly used aerial objects datasets.
arXiv Detail & Related papers (2021-10-03T15:28:14Z) - RSDet++: Point-based Modulated Loss for More Accurate Rotated Object
Detection [53.57176614020894]
We classify the discontinuity of loss in both five-param and eight-param rotated object detection methods as rotation sensitivity error (RSE)
We introduce a novel modulated rotation loss to alleviate the problem and propose a rotation sensitivity detection network (RSDet)
To further improve the accuracy of our method on objects smaller than 10 pixels, we introduce a novel RSDet++.
arXiv Detail & Related papers (2021-09-24T11:57:53Z) - Learning High-Precision Bounding Box for Rotated Object Detection via
Kullback-Leibler Divergence [100.6913091147422]
Existing rotated object detectors are mostly inherited from the horizontal detection paradigm.
In this paper, we are motivated to change the design of rotation regression loss from induction paradigm to deduction methodology.
arXiv Detail & Related papers (2021-06-03T14:29:19Z) - Rotation Equivariant Feature Image Pyramid Network for Object Detection
in Optical Remote Sensing Imagery [39.25541709228373]
We propose the rotation equivariant feature image pyramid network (REFIPN), an image pyramid network based on rotation equivariance convolution.
The proposed pyramid network extracts features in a wide range of scales and orientations by using novel convolution filters.
The detection performance of the proposed model is validated on two commonly used aerial benchmarks.
arXiv Detail & Related papers (2021-06-02T01:33:49Z) - On the Arbitrary-Oriented Object Detection: Classification based
Approaches Revisited [94.5455251250471]
We first show that the boundary problem suffered in existing dominant regression-based rotation detectors, is caused by angular periodicity or corner ordering.
We transform the angular prediction task from a regression problem to a classification one.
For the resulting circularly distributed angle classification problem, we first devise a Circular Smooth Label technique to handle the periodicity of angle and increase the error tolerance to adjacent angles.
arXiv Detail & Related papers (2020-03-12T03:23:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.