Oriented RepPoints for Aerial Object Detection
- URL: http://arxiv.org/abs/2105.11111v1
- Date: Mon, 24 May 2021 06:18:23 GMT
- Title: Oriented RepPoints for Aerial Object Detection
- Authors: Wentong Li, Jianke Zhu
- Abstract summary: In this paper, we propose a novel approach to aerial object detection, named Oriented RepPoints.
Specifically, we suggest to employ a set of adaptive points to capture the geometric and spatial information of the arbitrary-oriented objects.
To facilitate the supervised learning, the oriented conversion function is proposed to explicitly map the adaptive point set into an oriented bounding box.
- Score: 10.818838437018682
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In contrast to the oriented bounding boxes, point set representation has
great potential to capture the detailed structure of instances with the
arbitrary orientations, large aspect ratios and dense distribution in aerial
images. However, the conventional point set-based approaches are handcrafted
with the fixed locations using points-to-points supervision, which hurts their
flexibility on the fine-grained feature extraction. To address these
limitations, in this paper, we propose a novel approach to aerial object
detection, named Oriented RepPoints. Specifically, we suggest to employ a set
of adaptive points to capture the geometric and spatial information of the
arbitrary-oriented objects, which is able to automatically arrange themselves
over the object in a spatial and semantic scenario. To facilitate the
supervised learning, the oriented conversion function is proposed to explicitly
map the adaptive point set into an oriented bounding box. Moreover, we
introduce an effective quality assessment measure to select the point set
samples for training, which can choose the representative items with respect to
their potentials on orientated object detection. Furthermore, we suggest a
spatial constraint to penalize the outlier points outside the ground-truth
bounding box. In addition to the traditional evaluation metric mAP focusing on
overlap ratio, we propose a new metric mAOE to measure the orientation accuracy
that is usually neglected in the previous studies on oriented object detection.
Experiments on three widely used datasets including DOTA, HRSC2016 and UCAS-AOD
demonstrate that our proposed approach is effective.
Related papers
- PointOBB: Learning Oriented Object Detection via Single Point
Supervision [55.88982271340328]
This paper proposes PointOBB, the first single Point-based OBB generation method, for oriented object detection.
PointOBB operates through the collaborative utilization of three distinctive views: an original view, a resized view, and a rotated/flipped (rot/flp) view.
Experimental results on the DIOR-R and DOTA-v1.0 datasets demonstrate that PointOBB achieves promising performance.
arXiv Detail & Related papers (2023-11-23T15:51:50Z) - Weakly-supervised Contrastive Learning for Unsupervised Object Discovery [52.696041556640516]
Unsupervised object discovery is promising due to its ability to discover objects in a generic manner.
We design a semantic-guided self-supervised learning model to extract high-level semantic features from images.
We introduce Principal Component Analysis (PCA) to localize object regions.
arXiv Detail & Related papers (2023-07-07T04:03:48Z) - ROIFormer: Semantic-Aware Region of Interest Transformer for Efficient
Self-Supervised Monocular Depth Estimation [6.923035780685481]
We propose an efficient local adaptive attention method for geometric aware representation enhancement.
We leverage geometric cues from semantic information to learn local adaptive bounding boxes to guide unsupervised feature aggregation.
Our proposed method establishes a new state-of-the-art in self-supervised monocular depth estimation task.
arXiv Detail & Related papers (2022-12-12T06:38:35Z) - ProposalContrast: Unsupervised Pre-training for LiDAR-based 3D Object
Detection [114.54835359657707]
ProposalContrast is an unsupervised point cloud pre-training framework.
It learns robust 3D representations by contrasting region proposals.
ProposalContrast is verified on various 3D detectors.
arXiv Detail & Related papers (2022-07-26T04:45:49Z) - SASA: Semantics-Augmented Set Abstraction for Point-based 3D Object
Detection [78.90102636266276]
We propose a novel set abstraction method named Semantics-Augmented Set Abstraction (SASA)
Based on the estimated point-wise foreground scores, we then propose a semantics-guided point sampling algorithm to help retain more important foreground points during down-sampling.
In practice, SASA shows to be effective in identifying valuable points related to foreground objects and improving feature learning for point-based 3D detection.
arXiv Detail & Related papers (2022-01-06T08:54:47Z) - A Self-Training Approach for Point-Supervised Object Detection and
Counting in Crowds [54.73161039445703]
We propose a novel self-training approach that enables a typical object detector trained only with point-level annotations.
During training, we utilize the available point annotations to supervise the estimation of the center points of objects.
Experimental results show that our approach significantly outperforms state-of-the-art point-supervised methods under both detection and counting tasks.
arXiv Detail & Related papers (2020-07-25T02:14:42Z) - Point-Set Anchors for Object Detection, Instance Segmentation and Pose
Estimation [85.96410825961966]
We argue that the image features extracted at a central point contain limited information for predicting distant keypoints or bounding box boundaries.
To facilitate inference, we propose to instead perform regression from a set of points placed at more advantageous positions.
We apply this proposed framework, called Point-Set Anchors, to object detection, instance segmentation, and human pose estimation.
arXiv Detail & Related papers (2020-07-06T15:59:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.