Edge Wasserstein Distance Loss for Oriented Object Detection
- URL: http://arxiv.org/abs/2312.07048v1
- Date: Tue, 12 Dec 2023 08:00:40 GMT
- Title: Edge Wasserstein Distance Loss for Oriented Object Detection
- Authors: Yuke Zhu, Yumeng Ruan, Zihua Xiong, Sheng Guo
- Abstract summary: We propose a novel oriented regression loss, Wasserstein Distance(EWD) loss, to alleviate the square-like problem.
Specifically, for the oriented box(OBox) representation, we choose a specially-designed distribution whose probability density function is only nonzero over the edges.
- Score: 30.63435516524413
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Regression loss design is an essential topic for oriented object detection.
Due to the periodicity of the angle and the ambiguity of width and height
definition, traditional L1-distance loss and its variants have been suffered
from the metric discontinuity and the square-like problem. As a solution, the
distribution based methods show significant advantages by representing oriented
boxes as distributions. Differing from exploited the Gaussian distribution to
get analytical form of distance measure, we propose a novel oriented regression
loss, Wasserstein Distance(EWD) loss, to alleviate the square-like problem.
Specifically, for the oriented box(OBox) representation, we choose a
specially-designed distribution whose probability density function is only
nonzero over the edges. On this basis, we develop Wasserstein distance as the
measure. Besides, based on the edge representation of OBox, the EWD loss can be
generalized to quadrilateral and polynomial regression scenarios. Experiments
on multiple popular datasets and different detectors show the effectiveness of
the proposed method.
Related papers
- Refined Risk Bounds for Unbounded Losses via Transductive Priors [58.967816314671296]
We revisit the sequential variants of linear regression with the squared loss, classification problems with hinge loss, and logistic regression.
Our key tools are based on the exponential weights algorithm with carefully chosen transductive priors.
arXiv Detail & Related papers (2024-10-29T00:01:04Z) - Rate-Distortion-Perception Tradeoff Based on the
Conditional-Distribution Perception Measure [33.084834042565895]
We study the rate-distortionperception (RDP) tradeoff for a memoryless source model in the limit of large blocklengths.
Our perception measure is based on a divergence between the distributions of the source and reconstruction sequences conditioned on the encoder output.
arXiv Detail & Related papers (2024-01-22T18:49:56Z) - Energy-Based Sliced Wasserstein Distance [47.18652387199418]
A key component of the sliced Wasserstein (SW) distance is the slicing distribution.
We propose to design the slicing distribution as an energy-based distribution that is parameter-free.
We then derive a novel sliced Wasserstein metric, energy-based sliced Waserstein (EBSW) distance.
arXiv Detail & Related papers (2023-04-26T14:28:45Z) - Detecting Rotated Objects as Gaussian Distributions and Its 3-D
Generalization [81.29406957201458]
Existing detection methods commonly use a parameterized bounding box (BBox) to model and detect (horizontal) objects.
We argue that such a mechanism has fundamental limitations in building an effective regression loss for rotation detection.
We propose to model the rotated objects as Gaussian distributions.
We extend our approach from 2-D to 3-D with a tailored algorithm design to handle the heading estimation.
arXiv Detail & Related papers (2022-09-22T07:50:48Z) - MKIoU Loss: Towards Accurate Oriented Object Detection in Aerial Images [0.0]
A modulated Kalman IoU loss of approximate SkewIoU is proposed, named MKIoU.
The proposed method can be easily extended to other Gaussian-based methods.
arXiv Detail & Related papers (2022-06-30T08:17:01Z) - Optimization for Oriented Object Detection via Representation Invariance
Loss [2.501282372971187]
mainstream rotation detectors use oriented bounding boxes (OBB) or quadrilateral bounding boxes (QBB) to represent the rotating objects.
We propose a Representation Invariance Loss (RIL) to optimize the bounding box regression for the rotating objects.
Our method achieves consistent and substantial improvement in experiments on remote sensing datasets and scene text datasets.
arXiv Detail & Related papers (2021-03-22T07:55:33Z) - Regressive Domain Adaptation for Unsupervised Keypoint Detection [67.2950306888855]
Domain adaptation (DA) aims at transferring knowledge from a labeled source domain to an unlabeled target domain.
We present a method of regressive domain adaptation (RegDA) for unsupervised keypoint detection.
Our method brings large improvement by 8% to 11% in terms of PCK on different datasets.
arXiv Detail & Related papers (2021-03-10T16:45:22Z) - Rethinking Rotated Object Detection with Gaussian Wasserstein Distance
Loss [111.8807588392563]
Boundary discontinuity and its inconsistency to the final detection metric have been the bottleneck for rotating detection regression loss design.
We propose a novel regression loss based on Gaussian Wasserstein distance as a fundamental approach to solve the problem.
arXiv Detail & Related papers (2021-01-28T12:04:35Z) - Wasserstein Distances for Stereo Disparity Estimation [62.09272563885437]
Existing approaches to depth or disparity estimation output a distribution over a set of pre-defined discrete values.
This leads to inaccurate results when the true depth or disparity does not match any of these values.
We address these issues using a new neural network architecture that is capable of outputting arbitrary depth values.
arXiv Detail & Related papers (2020-07-06T21:37:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.