Linear Gaussian Bounding Box Representation and Ring-Shaped Rotated
Convolution for Oriented Object Detection
- URL: http://arxiv.org/abs/2311.05410v2
- Date: Tue, 14 Nov 2023 02:26:09 GMT
- Title: Linear Gaussian Bounding Box Representation and Ring-Shaped Rotated
Convolution for Oriented Object Detection
- Authors: Zhen Zhou, Yunkai Ma, Junfeng Fan, Zhaoyang Liu, Fengshui Jing and Min
Tan
- Abstract summary: In oriented object detection, current representations of oriented bounding boxes (OBBs) often suffer from boundary discontinuity problem.
We propose linear GBB (LGBB), a novel OBB representation. By linearly transforming the elements of GBB, LGBB avoids the boundary discontinuity problem and has high numerical stability.
We also propose ring-shaped rotated convolution (RRC), which adaptively rotates feature maps to arbitrary orientations to extract rotation-sensitive features under a ring-shaped receptive field.
- Score: 13.946780253720616
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In oriented object detection, current representations of oriented bounding
boxes (OBBs) often suffer from boundary discontinuity problem. Methods of
designing continuous regression losses do not essentially solve this problem.
Although Gaussian bounding box (GBB) representation avoids this problem,
directly regressing GBB is susceptible to numerical instability. We propose
linear GBB (LGBB), a novel OBB representation. By linearly transforming the
elements of GBB, LGBB avoids the boundary discontinuity problem and has high
numerical stability. In addition, existing convolution-based rotation-sensitive
feature extraction methods only have local receptive fields, resulting in slow
feature aggregation. We propose ring-shaped rotated convolution (RRC), which
adaptively rotates feature maps to arbitrary orientations to extract
rotation-sensitive features under a ring-shaped receptive field, rapidly
aggregating features and contextual information. Experimental results
demonstrate that LGBB and RRC achieve state-of-the-art performance.
Furthermore, integrating LGBB and RRC into various models effectively improves
detection accuracy.
Related papers
- Constrained Optimization via Exact Augmented Lagrangian and Randomized
Iterative Sketching [55.28394191394675]
We develop an adaptive inexact Newton method for equality-constrained nonlinear, nonIBS optimization problems.
We demonstrate the superior performance of our method on benchmark nonlinear problems, constrained logistic regression with data from LVM, and a PDE-constrained problem.
arXiv Detail & Related papers (2023-05-28T06:33:37Z) - Detecting Rotated Objects as Gaussian Distributions and Its 3-D
Generalization [81.29406957201458]
Existing detection methods commonly use a parameterized bounding box (BBox) to model and detect (horizontal) objects.
We argue that such a mechanism has fundamental limitations in building an effective regression loss for rotation detection.
We propose to model the rotated objects as Gaussian distributions.
We extend our approach from 2-D to 3-D with a tailored algorithm design to handle the heading estimation.
arXiv Detail & Related papers (2022-09-22T07:50:48Z) - Implicit Neural Representation for Mesh-Free Inverse Obstacle Scattering [21.459567997723376]
Implicit representation of shapes as level sets of multilayer perceptrons has recently flourished in different shape analysis, compression, and reconstruction tasks.
We introduce an implicit neural representation-based framework for solving the inverse obstacle scattering problem in a mesh-free fashion.
arXiv Detail & Related papers (2022-06-04T17:16:09Z) - G-Rep: Gaussian Representation for Arbitrary-Oriented Object Detection [13.112764410519981]
We propose a unified Gaussian representation called G-Rep to construct Gaussian distributions for OBB, QBB, and PointSet.
G-Rep achieves a unified solution to various representations and problems.
arXiv Detail & Related papers (2022-05-24T05:28:08Z) - Learning Polar Encodings for Arbitrary-Oriented Ship Detection in SAR
Images [22.93997487064683]
Methods based on oriented bounding box (OBB) have gradually received attention from researchers.
Most of the recently proposed deep learning-based methods for OBB detection encounter the boundary discontinuity problem in angle or key point regression.
In this paper, we propose to detect SAR ships by learning polar encodings.
arXiv Detail & Related papers (2021-03-24T12:52:54Z) - Optimization for Oriented Object Detection via Representation Invariance
Loss [2.501282372971187]
mainstream rotation detectors use oriented bounding boxes (OBB) or quadrilateral bounding boxes (QBB) to represent the rotating objects.
We propose a Representation Invariance Loss (RIL) to optimize the bounding box regression for the rotating objects.
Our method achieves consistent and substantial improvement in experiments on remote sensing datasets and scene text datasets.
arXiv Detail & Related papers (2021-03-22T07:55:33Z) - Rethinking Rotated Object Detection with Gaussian Wasserstein Distance
Loss [111.8807588392563]
Boundary discontinuity and its inconsistency to the final detection metric have been the bottleneck for rotating detection regression loss design.
We propose a novel regression loss based on Gaussian Wasserstein distance as a fundamental approach to solve the problem.
arXiv Detail & Related papers (2021-01-28T12:04:35Z) - Hybrid Trilinear and Bilinear Programming for Aligning Partially
Overlapping Point Sets [85.71360365315128]
In many applications, we need algorithms which can align partially overlapping point sets are invariant to the corresponding corresponding RPM algorithm.
We first show that the objective is a cubic bound function. We then utilize the convex envelopes of trilinear and bilinear monomial transformations to derive its lower bound.
We next develop a branch-and-bound (BnB) algorithm which only branches over the transformation variables and runs efficiently.
arXiv Detail & Related papers (2021-01-19T04:24:23Z) - Canny-VO: Visual Odometry with RGB-D Cameras based on Geometric 3D-2D
Edge Alignment [85.32080531133799]
This paper reviews the classical problem of free-form curve registration and applies it to an efficient RGBD visual odometry system called Canny-VO.
Two replacements for the distance transformation commonly used in edge registration are proposed: Approximate Nearest Neighbour Fields and Oriented Nearest Neighbour Fields.
3D2D edge alignment benefits from these alternative formulations in terms of both efficiency and accuracy.
arXiv Detail & Related papers (2020-12-15T11:42:17Z) - Fast Gravitational Approach for Rigid Point Set Registration with
Ordinary Differential Equations [79.71184760864507]
This article introduces a new physics-based method for rigid point set alignment called Fast Gravitational Approach (FGA)
In FGA, the source and target point sets are interpreted as rigid particle swarms with masses interacting in a globally multiply-linked manner while moving in a simulated gravitational force field.
We show that the new method class has characteristics not found in previous alignment methods.
arXiv Detail & Related papers (2020-09-28T15:05:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.