Oriented Bounding Boxes for Small and Freely Rotated Objects
- URL: http://arxiv.org/abs/2104.11854v1
- Date: Sat, 24 Apr 2021 02:04:49 GMT
- Title: Oriented Bounding Boxes for Small and Freely Rotated Objects
- Authors: Mohsen Zand, Ali Etemad, and Michael Greenspan
- Abstract summary: A novel object detection method is presented that handles freely rotated objects of arbitrary sizes.
The method encodes the precise location and orientation of features of the target objects at grid cell locations.
Evaluations on the xView and DOTA datasets show that the proposed method uniformly improves performance over existing state-of-the-art methods.
- Score: 7.6997148655751895
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A novel object detection method is presented that handles freely rotated
objects of arbitrary sizes, including tiny objects as small as $2\times 2$
pixels. Such tiny objects appear frequently in remotely sensed images, and
present a challenge to recent object detection algorithms. More importantly,
current object detection methods have been designed originally to accommodate
axis-aligned bounding box detection, and therefore fail to accurately localize
oriented boxes that best describe freely rotated objects. In contrast, the
proposed CNN-based approach uses potential pixel information at multiple scale
levels without the need for any external resources, such as anchor boxes.The
method encodes the precise location and orientation of features of the target
objects at grid cell locations. Unlike existing methods which regress the
bounding box location and dimension,the proposed method learns all the required
information by classification, which has the added benefit of enabling oriented
bounding box detection without any extra computation. It thus infers the
bounding boxes only at inference time by finding the minimum surrounding box
for every set of the same predicted class labels. Moreover, a
rotation-invariant feature representation is applied to each scale, which
imposes a regularization constraint to enforce covering the 360 degree range of
in-plane rotation of the training samples to share similar features.
Evaluations on the xView and DOTA datasets show that the proposed method
uniformly improves performance over existing state-of-the-art methods.
Related papers
- Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments [67.83787474506073]
We tackle the limitations of current LiDAR-based 3D object detection systems.
We introduce a universal textscFind n' Propagate approach for 3D OV tasks.
We achieve up to a 3.97-fold increase in Average Precision (AP) for novel object classes.
arXiv Detail & Related papers (2024-03-20T12:51:30Z) - Linear Object Detection in Document Images using Multiple Object
Tracking [58.720142291102135]
Linear objects convey substantial information about document structure.
Many approaches can recover some vector representation, but only one closed-source technique introduced in 1994.
We propose a framework for accurate instance segmentation of linear objects in document images using Multiple Object Tracking.
arXiv Detail & Related papers (2023-05-26T14:22:03Z) - One-Shot General Object Localization [43.88712478006662]
OneLoc is a general one-shot object localization algorithm.
OneLoc efficiently finds the object center and bounding box size by a special voting scheme.
Experiments show that the proposed method achieves state-of-the-art overall performance on two datasets.
arXiv Detail & Related papers (2022-11-24T03:14:04Z) - H2RBox: Horizonal Box Annotation is All You Need for Oriented Object
Detection [63.66553556240689]
Oriented object detection emerges in many applications from aerial images to autonomous driving.
Many existing detection benchmarks are annotated with horizontal bounding box only which is also less costive than fine-grained rotated box.
This paper proposes a simple yet effective oriented object detection approach called H2RBox.
arXiv Detail & Related papers (2022-10-13T05:12:45Z) - BoxMask: Revisiting Bounding Box Supervision for Video Object Detection [11.255962936937744]
We propose BoxMask, which learns discriminative representations by incorporating class-aware pixel-level information.
The proposed module can be effortlessly integrated into any region-based detector to boost detection.
arXiv Detail & Related papers (2022-10-12T08:25:27Z) - ImpDet: Exploring Implicit Fields for 3D Object Detection [74.63774221984725]
We introduce a new perspective that views bounding box regression as an implicit function.
This leads to our proposed framework, termed Implicit Detection or ImpDet.
Our ImpDet assigns specific values to points in different local 3D spaces, thereby high-quality boundaries can be generated.
arXiv Detail & Related papers (2022-03-31T17:52:12Z) - Oriented RepPoints for Aerial Object Detection [10.818838437018682]
In this paper, we propose a novel approach to aerial object detection, named Oriented RepPoints.
Specifically, we suggest to employ a set of adaptive points to capture the geometric and spatial information of the arbitrary-oriented objects.
To facilitate the supervised learning, the oriented conversion function is proposed to explicitly map the adaptive point set into an oriented bounding box.
arXiv Detail & Related papers (2021-05-24T06:18:23Z) - End-to-end Deep Object Tracking with Circular Loss Function for Rotated
Bounding Box [68.8204255655161]
We introduce a novel end-to-end deep learning method based on the Transformer Multi-Head Attention architecture.
We also present a new type of loss function, which takes into account the bounding box overlap and orientation.
arXiv Detail & Related papers (2020-12-17T17:29:29Z) - Oriented Object Detection in Aerial Images with Box Boundary-Aware
Vectors [21.484827209503823]
Oriented object detection in aerial images is a challenging task as the objects in aerial images are displayed in arbitrary directions.
Current oriented object detection methods mainly rely on two-stage anchor-based detectors.
We extend the horizontal keypoint-based object to the oriented object detection task.
arXiv Detail & Related papers (2020-08-17T00:56:50Z) - Dive Deeper Into Box for Object Detection [49.923586776690115]
We propose a box reorganization method(DDBNet), which can dive deeper into the box for more accurate localization.
Experimental results show that our method is effective which leads to state-of-the-art performance for object detection.
arXiv Detail & Related papers (2020-07-15T07:49:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.