A General Gaussian Heatmap Labeling for Arbitrary-Oriented Object
Detection
- URL: http://arxiv.org/abs/2109.12848v1
- Date: Mon, 27 Sep 2021 07:46:09 GMT
- Title: A General Gaussian Heatmap Labeling for Arbitrary-Oriented Object
Detection
- Authors: Zhanchao Huang, Wei Li, Xiang-Gen Xia, and Ran Tao
- Abstract summary: An anchor-free object-adaptation label assignment (OLA) strategy is presented to define the positive candidates.
An oriented-bounding-box (OBB) representation component (ORC) is developed to indicate OBBs.
A joint-optimization loss (JOL) with area normalization and dynamic confidence weighting is designed to refine the misalign optimal results.
- Score: 11.954992010840833
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Recently, many arbitrary-oriented object detection (AOOD) methods have been
proposed and attracted widespread attention in many fields. However, most of
them are based on anchor-boxes or standard Gaussian heatmaps. Such label
assignment strategy may not only fail to reflect the shape and direction
characteristics of arbitrary-oriented objects, but also have high
parameter-tuning efforts. In this paper, a novel AOOD method called General
Gaussian Heatmap Labeling (GGHL) is proposed. Specifically, an anchor-free
object-adaptation label assignment (OLA) strategy is presented to define the
positive candidates based on two-dimensional (2-D) oriented Gaussian heatmaps,
which reflect the shape and direction features of arbitrary-oriented objects.
Based on OLA, an oriented-bounding-box (OBB) representation component (ORC) is
developed to indicate OBBs and adjust the Gaussian center prior weights to fit
the characteristics of different objects adaptively through neural network
learning. Moreover, a joint-optimization loss (JOL) with area normalization and
dynamic confidence weighting is designed to refine the misalign optimal results
of different subtasks. Extensive experiments on public datasets demonstrate
that the proposed GGHL improves the AOOD performance with low parameter-tuning
and time costs. Furthermore, it is generally applicable to most AOOD methods to
improve their performance including lightweight models on embedded platforms.
Related papers
- SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection [59.868772767818975]
We propose a simple yet effective Semi-supervised Oriented Object Detection method termed SOOD++.
Specifically, we observe that objects from aerial images are usually arbitrary orientations, small scales, and aggregation.
Extensive experiments conducted on various multi-oriented object datasets under various labeled settings demonstrate the effectiveness of our method.
arXiv Detail & Related papers (2024-07-01T07:03:51Z) - GRA: Detecting Oriented Objects through Group-wise Rotating and Attention [64.21917568525764]
Group-wise Rotating and Attention (GRA) module is proposed to replace the convolution operations in backbone networks for oriented object detection.
GRA can adaptively capture fine-grained features of objects with diverse orientations, comprising two key components: Group-wise Rotating and Group-wise Attention.
GRA achieves a new state-of-the-art (SOTA) on the DOTA-v2.0 benchmark, while saving the parameters by nearly 50% compared to the previous SOTA method.
arXiv Detail & Related papers (2024-03-17T07:29:32Z) - PointOBB: Learning Oriented Object Detection via Single Point
Supervision [55.88982271340328]
This paper proposes PointOBB, the first single Point-based OBB generation method, for oriented object detection.
PointOBB operates through the collaborative utilization of three distinctive views: an original view, a resized view, and a rotated/flipped (rot/flp) view.
Experimental results on the DIOR-R and DOTA-v1.0 datasets demonstrate that PointOBB achieves promising performance.
arXiv Detail & Related papers (2023-11-23T15:51:50Z) - SOOD: Towards Semi-Supervised Oriented Object Detection [57.05141794402972]
This paper proposes a novel Semi-supervised Oriented Object Detection model, termed SOOD, built upon the mainstream pseudo-labeling framework.
Our experiments show that when trained with the two proposed losses, SOOD surpasses the state-of-the-art SSOD methods under various settings on the DOTA-v1.5 benchmark.
arXiv Detail & Related papers (2023-04-10T11:10:42Z) - Detecting Rotated Objects as Gaussian Distributions and Its 3-D
Generalization [81.29406957201458]
Existing detection methods commonly use a parameterized bounding box (BBox) to model and detect (horizontal) objects.
We argue that such a mechanism has fundamental limitations in building an effective regression loss for rotation detection.
We propose to model the rotated objects as Gaussian distributions.
We extend our approach from 2-D to 3-D with a tailored algorithm design to handle the heading estimation.
arXiv Detail & Related papers (2022-09-22T07:50:48Z) - AO2-DETR: Arbitrary-Oriented Object Detection Transformer [17.287517988299925]
We propose an Arbitrary-Oriented Object DEtection TRansformer framework, termed AO2-DETR.
An oriented proposal generation mechanism is proposed to explicitly generate oriented proposals.
And a rotation-aware set matching loss is used to ensure the one-to-one matching process for direct set prediction.
arXiv Detail & Related papers (2022-05-25T13:57:13Z) - Dynamic Anchor Learning for Arbitrary-Oriented Object Detection [4.247967690041766]
Arbitrary-oriented objects widely appear in natural scenes, aerial photographs, remote sensing images, etc.
Current rotation detectors use plenty of anchors with different orientations to achieve spatial alignment with ground truth boxes.
We propose a dynamic anchor learning (DAL) method, which utilizes the newly defined matching degree.
arXiv Detail & Related papers (2020-12-08T01:30:06Z) - Align Deep Features for Oriented Object Detection [40.28244152216309]
We propose a single-shot Alignment Network (S$2$A-Net) consisting of two modules: a Feature Alignment Module (FAM) and an Oriented Detection Module (ODM)
The FAM can generate high-quality anchors with an Anchor Refinement Network and adaptively align the convolutional features according to the anchor boxes with a novel Alignment Convolution.
The ODM first adopts active rotating filters to encode the orientation information and then produces orientation-sensitive and orientation-invariant features to alleviate the inconsistency between classification score and localization accuracy.
arXiv Detail & Related papers (2020-08-21T09:55:13Z) - Dynamic Refinement Network for Oriented and Densely Packed Object
Detection [75.29088991850958]
We present a dynamic refinement network that consists of two novel components, i.e., a feature selection module (FSM) and a dynamic refinement head (DRH)
Our FSM enables neurons to adjust receptive fields in accordance with the shapes and orientations of target objects, whereas the DRH empowers our model to refine the prediction dynamically in an object-aware manner.
We perform quantitative evaluations on several publicly available benchmarks including DOTA, HRSC2016, SKU110K, and our own SKU110K-R dataset.
arXiv Detail & Related papers (2020-05-20T11:35:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.