Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-end Oriented Object Detection with Single Point Supervision
- URL: http://arxiv.org/abs/2311.14758v2
- Date: Thu, 21 Mar 2024 12:43:32 GMT
- Title: Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-end Oriented Object Detection with Single Point Supervision
- Authors: Yi Yu, Xue Yang, Qingyun Li, Feipeng Da, Jifeng Dai, Yu Qiao, Junchi Yan,
- Abstract summary: We present Point2RBox, an end-to-end solution for point-supervised object detection.
Our method uses a lightweight paradigm, yet it achieves a competitive performance among point-supervised alternatives.
In particular, our method uses a lightweight paradigm, yet it achieves a competitive performance among point-supervised alternatives.
- Score: 81.60564776995682
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: With the rapidly increasing demand for oriented object detection (OOD), recent research involving weakly-supervised detectors for learning rotated box (RBox) from the horizontal box (HBox) has attracted more and more attention. In this paper, we explore a more challenging yet label-efficient setting, namely single point-supervised OOD, and present our approach called Point2RBox. Specifically, we propose to leverage two principles: 1) Synthetic pattern knowledge combination: By sampling around each labeled point on the image, we spread the object feature to synthetic visual patterns with known boxes to provide the knowledge for box regression. 2) Transform self-supervision: With a transformed input image (e.g. scaled/rotated), the output RBoxes are trained to follow the same transformation so that the network can perceive the relative size/rotation between objects. The detector is further enhanced by a few devised techniques to cope with peripheral issues, e.g. the anchor/layer assignment as the size of the object is not available in our point supervision setting. To our best knowledge, Point2RBox is the first end-to-end solution for point-supervised OOD. In particular, our method uses a lightweight paradigm, yet it achieves a competitive performance among point-supervised alternatives, 41.05%/27.62%/80.01% on DOTA/DIOR/HRSC datasets.
Related papers
- Wholly-WOOD: Wholly Leveraging Diversified-quality Labels for Weakly-supervised Oriented Object Detection [57.26265276035267]
Wholly-WOOD is a weakly-supervised OOD framework capable of wholly leveraging various labeling forms.
By only using HBox for training, our Wholly-WOOD achieves performance very close to that of the RBox-trained counterpart on remote sensing.
arXiv Detail & Related papers (2025-02-13T16:34:59Z) - Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances [50.80161958767447]
We present Point2RBox-v2, an approach to explore the spatial layout among instances for learning point-supervised OOD.
Our solution is elegant and lightweight, yet it is expected to give a competitive performance especially in densely packed scenes.
arXiv Detail & Related papers (2025-02-06T18:07:25Z) - PointOBB: Learning Oriented Object Detection via Single Point
Supervision [55.88982271340328]
This paper proposes PointOBB, the first single Point-based OBB generation method, for oriented object detection.
PointOBB operates through the collaborative utilization of three distinctive views: an original view, a resized view, and a rotated/flipped (rot/flp) view.
Experimental results on the DIOR-R and DOTA-v1.0 datasets demonstrate that PointOBB achieves promising performance.
arXiv Detail & Related papers (2023-11-23T15:51:50Z) - P2RBox: Point Prompt Oriented Object Detection with SAM [28.96914721062631]
We introduce P2RBox, which employs point prompt to generate rotated box (RBox) annotation for oriented object detection.
P2RBox incorporates two advanced guidance cues: Boundary Sensitive Mask guidance, and Centrality guidance, which utilize spatial information to reduce granularity ambiguity.
Compared to the state-of-the-art point-annotated generative method PointOBB, P2RBox outperforms by about 29% mAP on DOTA-v1.0 dataset.
arXiv Detail & Related papers (2023-11-22T03:33:00Z) - H2RBox-v2: Incorporating Symmetry for Boosting Horizontal Box Supervised
Oriented Object Detection [55.3948651109885]
We present H2RBox-v2, to bridge the gap between HBox-supervised and RBox-supervised oriented object detection.
To our best knowledge, H2RBox-v2 is the first symmetry-aware self-supervised paradigm for oriented object detection.
arXiv Detail & Related papers (2023-04-10T06:11:40Z) - H2RBox: Horizonal Box Annotation is All You Need for Oriented Object
Detection [63.66553556240689]
Oriented object detection emerges in many applications from aerial images to autonomous driving.
Many existing detection benchmarks are annotated with horizontal bounding box only which is also less costive than fine-grained rotated box.
This paper proposes a simple yet effective oriented object detection approach called H2RBox.
arXiv Detail & Related papers (2022-10-13T05:12:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.