Related papers: H2RBox-v2: Incorporating Symmetry for Boosting Horizontal Box Supervised Oriented Object Detection

H2RBox-v2: Incorporating Symmetry for Boosting Horizontal Box Supervised Oriented Object Detection

URL: http://arxiv.org/abs/2304.04403v4
Date: Mon, 16 Oct 2023 15:12:19 GMT
Title: H2RBox-v2: Incorporating Symmetry for Boosting Horizontal Box Supervised Oriented Object Detection
Authors: Yi Yu, Xue Yang, Qingyun Li, Yue Zhou, Gefan Zhang, Feipeng Da, Junchi Yan
Abstract summary: We present H2RBox-v2, to bridge the gap between HBox-supervised and RBox-supervised oriented object detection. To our best knowledge, H2RBox-v2 is the first symmetry-aware self-supervised paradigm for oriented object detection.
Score: 55.3948651109885
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: With the rapidly increasing demand for oriented object detection, e.g. in autonomous driving and remote sensing, the recently proposed paradigm involving weakly-supervised detector H2RBox for learning rotated box (RBox) from the more readily-available horizontal box (HBox) has shown promise. This paper presents H2RBox-v2, to further bridge the gap between HBox-supervised and RBox-supervised oriented object detection. Specifically, we propose to leverage the reflection symmetry via flip and rotate consistencies, using a weakly-supervised network branch similar to H2RBox, together with a novel self-supervised branch that learns orientations from the symmetry inherent in visual objects. The detector is further stabilized and enhanced by practical techniques to cope with peripheral issues e.g. angular periodicity. To our best knowledge, H2RBox-v2 is the first symmetry-aware self-supervised paradigm for oriented object detection. In particular, our method shows less susceptibility to low-quality annotation and insufficient training data compared to H2RBox. Specifically, H2RBox-v2 achieves very close performance to a rotation annotation trained counterpart -- Rotated FCOS: 1) DOTA-v1.0/1.5/2.0: 72.31%/64.76%/50.33% vs. 72.44%/64.53%/51.77%; 2) HRSC: 89.66% vs. 88.99%; 3) FAIR1M: 42.27% vs. 41.25%.

Related papers

Wholly-WOOD: Wholly Leveraging Diversified-quality Labels for Weakly-supervised Oriented Object Detection [57.26265276035267]
Wholly-WOOD is a weakly-supervised OOD framework capable of wholly leveraging various labeling forms. By only using HBox for training, our Wholly-WOOD achieves performance very close to that of the RBox-trained counterpart on remote sensing.
arXiv Detail & Related papers (2025-02-13T16:34:59Z)
Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances [50.80161958767447]
We present Point2RBox-v2, an approach to explore the spatial layout among instances for learning point-supervised OOD. Our solution is elegant and lightweight, yet it is expected to give a competitive performance especially in densely packed scenes.
arXiv Detail & Related papers (2025-02-06T18:07:25Z)
Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-end Oriented Object Detection with Single Point Supervision [81.60564776995682]
We present Point2RBox, an end-to-end solution for point-supervised object detection. Our method uses a lightweight paradigm, yet it achieves a competitive performance among point-supervised alternatives. In particular, our method uses a lightweight paradigm, yet it achieves a competitive performance among point-supervised alternatives.
arXiv Detail & Related papers (2023-11-23T15:57:41Z)
P2RBox: Point Prompt Oriented Object Detection with SAM [28.96914721062631]
We introduce P2RBox, which employs point prompt to generate rotated box (RBox) annotation for oriented object detection. P2RBox incorporates two advanced guidance cues: Boundary Sensitive Mask guidance, and Centrality guidance, which utilize spatial information to reduce granularity ambiguity. Compared to the state-of-the-art point-annotated generative method PointOBB, P2RBox outperforms by about 29% mAP on DOTA-v1.0 dataset.
arXiv Detail & Related papers (2023-11-22T03:33:00Z)
RD-VIO: Robust Visual-Inertial Odometry for Mobile Augmented Reality in Dynamic Environments [55.864869961717424]
It is typically challenging for visual or visual-inertial odometry systems to handle the problems of dynamic scenes and pure rotation. We design a novel visual-inertial odometry (VIO) system called RD-VIO to handle both of these problems.
arXiv Detail & Related papers (2023-10-23T16:30:39Z)
SOOD: Towards Semi-Supervised Oriented Object Detection [57.05141794402972]
This paper proposes a novel Semi-supervised Oriented Object Detection model, termed SOOD, built upon the mainstream pseudo-labeling framework. Our experiments show that when trained with the two proposed losses, SOOD surpasses the state-of-the-art SSOD methods under various settings on the DOTA-v1.5 benchmark.
arXiv Detail & Related papers (2023-04-10T11:10:42Z)
H2RBox: Horizonal Box Annotation is All You Need for Oriented Object Detection [63.66553556240689]
Oriented object detection emerges in many applications from aerial images to autonomous driving. Many existing detection benchmarks are annotated with horizontal bounding box only which is also less costive than fine-grained rotated box. This paper proposes a simple yet effective oriented object detection approach called H2RBox.
arXiv Detail & Related papers (2022-10-13T05:12:45Z)
Point RCNN: An Angle-Free Framework for Rotated Object Detection [13.209895262511015]
Rotated object detection in aerial images is still challenging due to arbitrary orientations, large scale and aspect ratio variations, and extreme density of objects. We propose a purely angle-free framework for rotated object detection, called Point RCNN, which mainly consists of PointRPN and PointReg. Experiments demonstrate that our Point RCNN achieves the new state-of-the-art detection performance on commonly used aerial datasets.
arXiv Detail & Related papers (2022-05-28T04:07:37Z)
MRDet: A Multi-Head Network for Accurate Oriented Object Detection in Aerial Images [51.227489316673484]
We propose an arbitrary-oriented region proposal network (AO-RPN) to generate oriented proposals transformed from horizontal anchors. To obtain accurate bounding boxes, we decouple the detection task into multiple subtasks and propose a multi-head network. Each head is specially designed to learn the features optimal for the corresponding task, which allows our network to detect objects accurately.
arXiv Detail & Related papers (2020-12-24T06:36:48Z)

This list is automatically generated from the titles and abstracts of the papers in this site.