Confidence-driven Bounding Box Localization for Small Object Detection
- URL: http://arxiv.org/abs/2303.01803v1
- Date: Fri, 3 Mar 2023 09:19:08 GMT
- Title: Confidence-driven Bounding Box Localization for Small Object Detection
- Authors: Huixin Sun, Baochang Zhang, Yanjing Li, Xianbin Cao
- Abstract summary: We present Confidence-driven Bounding Box localization (C-BBL) method to rectify the gradients.
C-BBL quantizes continuous labels into grids and formulates two-hot ground truth labels.
We demonstrate the generalizability of C-BBL to different label systems and effectiveness for high resolution detection.
- Score: 30.906712428887147
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Despite advancements in generic object detection, there remains a performance
gap in detecting small objects compared to normal-scale objects. We for the
first time observe that existing bounding box regression methods tend to
produce distorted gradients for small objects and result in less accurate
localization. To address this issue, we present a novel Confidence-driven
Bounding Box Localization (C-BBL) method to rectify the gradients. C-BBL
quantizes continuous labels into grids and formulates two-hot ground truth
labels. In prediction, the bounding box head generates a confidence
distribution over the grids. Unlike the bounding box regression paradigms in
conventional detectors, we introduce a classification-based localization
objective through cross entropy between ground truth and predicted confidence
distribution, generating confidence-driven gradients. Additionally, C-BBL
describes a uncertainty loss based on distribution entropy in labels and
predictions to further reduce the uncertainty in small object localization. The
method is evaluated on multiple detectors using three object detection
benchmarks and consistently improves baseline detectors, achieving
state-of-the-art performance. We also demonstrate the generalizability of C-BBL
to different label systems and effectiveness for high resolution detection,
which validates its prospect as a general solution.
Related papers
- An Uncertainty-aware DETR Enhancement Framework for Object Detection [10.102900613370817]
We propose an uncertainty-aware enhancement framework for DETR-based object detectors.<n>We derive a Bayes Risk formulation to filter high-risk information and improve detection reliability.<n> Experiments on the COCO benchmark show that our method can be effectively integrated into existing DETR variants.
arXiv Detail & Related papers (2025-07-20T07:53:04Z) - Robust Distribution Alignment for Industrial Anomaly Detection under Distribution Shift [51.24522135151649]
Anomaly detection plays a crucial role in quality control for industrial applications.
Existing methods attempt to address domain shifts by training generalizable models.
Our proposed method demonstrates superior results compared with state-of-the-art anomaly detection and domain adaptation methods.
arXiv Detail & Related papers (2025-03-19T05:25:52Z) - Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection [22.297964850282177]
Unsupervised 3D object detection aims to identify objects of interest from unlabeled raw data, such as LiDAR points.
Recent approaches usually adopt pseudo 3D bounding boxes (3D bboxes) from clustering algorithm to initialize the model training.
We introduce a new uncertainty-aware framework for unsupervised 3D object detection, dubbed UA3D.
arXiv Detail & Related papers (2024-08-01T15:01:07Z) - Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation [91.83820250747935]
Pseudo-label noise is mainly contained in unstable samples in which predictions of most pixels undergo significant variations during self-training.
We introduce the Stable Neighbor Denoising (SND) approach, which effectively discovers highly correlated stable and unstable samples.
SND consistently outperforms state-of-the-art methods in various SFUDA semantic segmentation settings.
arXiv Detail & Related papers (2024-06-10T21:44:52Z) - Better Sampling, towards Better End-to-end Small Object Detection [7.7473020808686694]
Small object detection remains unsatisfactory due to limited characteristics and high density and mutual overlap.
We propose methods enhancing sampling within an end-to-end framework.
Our model demonstrates a significant enhancement, achieving a 2.9% increase in average precision (AP) over the state-of-the-art (SOTA) on the VisDrone dataset.
arXiv Detail & Related papers (2024-05-17T04:37:44Z) - Bi-discriminator Domain Adversarial Neural Networks with Class-Level
Gradient Alignment [87.8301166955305]
We propose a novel bi-discriminator domain adversarial neural network with class-level gradient alignment.
BACG resorts to gradient signals and second-order probability estimation for better alignment of domain distributions.
In addition, inspired by contrastive learning, we develop a memory bank-based variant, i.e. Fast-BACG, which can greatly shorten the training process.
arXiv Detail & Related papers (2023-10-21T09:53:17Z) - Exploiting Low-confidence Pseudo-labels for Source-free Object Detection [54.98300313452037]
Source-free object detection (SFOD) aims to adapt a source-trained detector to an unlabeled target domain without access to the labeled source data.
Current SFOD methods utilize a threshold-based pseudo-label approach in the adaptation phase.
We propose a new approach to take full advantage of pseudo-labels by introducing high and low confidence thresholds.
arXiv Detail & Related papers (2023-10-19T12:59:55Z) - Provably Accelerating Ill-Conditioned Low-rank Estimation via Scaled
Gradient Descent, Even with Overparameterization [48.65416821017865]
This chapter introduces a new algorithmic approach, dubbed scaled gradient (ScaledGD)
It converges linearly at a constant rate independent of the condition number of the low-rank object.
It maintains the low periteration cost of gradient descent for a variety of tasks.
arXiv Detail & Related papers (2023-10-09T21:16:57Z) - Localization-Guided Track: A Deep Association Multi-Object Tracking
Framework Based on Localization Confidence of Detections [4.565826090373598]
localization confidence is applied in MOT for the first time, with appearance clarity and localization accuracy of detection boxes taken into account.
Our proposed method outperforms the compared state-of-art tracking methods.
arXiv Detail & Related papers (2023-09-18T13:45:35Z) - Small Object Detection via Coarse-to-fine Proposal Generation and
Imitation Learning [52.06176253457522]
We propose a two-stage framework tailored for small object detection based on the Coarse-to-fine pipeline and Feature Imitation learning.
CFINet achieves state-of-the-art performance on the large-scale small object detection benchmarks, SODA-D and SODA-A.
arXiv Detail & Related papers (2023-08-18T13:13:09Z) - GLENet: Boosting 3D Object Detectors with Generative Label Uncertainty Estimation [70.75100533512021]
In this paper, we formulate the label uncertainty problem as the diversity of potentially plausible bounding boxes of objects.
We propose GLENet, a generative framework adapted from conditional variational autoencoders, to model the one-to-many relationship between a typical 3D object and its potential ground-truth bounding boxes with latent variables.
The label uncertainty generated by GLENet is a plug-and-play module and can be conveniently integrated into existing deep 3D detectors.
arXiv Detail & Related papers (2022-07-06T06:26:17Z) - Beyond the Edge of Stability via Two-step Gradient Updates [49.03389279816152]
Gradient Descent (GD) is a powerful workhorse of modern machine learning.
GD's ability to find local minimisers is only guaranteed for losses with Lipschitz gradients.
This work focuses on simple, yet representative, learning problems via analysis of two-step gradient updates.
arXiv Detail & Related papers (2022-06-08T21:32:50Z) - Mitigating the Mutual Error Amplification for Semi-Supervised Object
Detection [92.52505195585925]
We propose a Cross Teaching (CT) method, aiming to mitigate the mutual error amplification by introducing a rectification mechanism of pseudo labels.
In contrast to existing mutual teaching methods that directly treat predictions from other detectors as pseudo labels, we propose the Label Rectification Module (LRM)
arXiv Detail & Related papers (2022-01-26T03:34:57Z) - Boundary Distribution Estimation for Precise Object Detection [12.247010914825971]
In the field of object detection, the task of object localization is typically accomplished through a dedicated that emphasizes bounding box regression.
This traditionally predicts the object's position by regressing the box's center position and scaling factors.
In this paper, we address the shortcomings of previous methods through theoretical analysis and experimental verification.
Our approach enhances the accuracy of bounding box localization by refining the box edges based on the estimated distribution at the object's boundary.
arXiv Detail & Related papers (2021-11-02T06:58:22Z) - Uncertainty-Aware Model Adaptation for Unsupervised Cross-Domain Object
Detection [12.807987076435928]
This work tackles the unsupervised cross-domain object detection problem.
It aims to generalize a pre-trained object detector to a new target domain without labels.
arXiv Detail & Related papers (2021-08-28T09:37:18Z) - Localization Uncertainty Estimation for Anchor-Free Object Detection [48.931731695431374]
There are several limitations of the existing uncertainty estimation methods for anchor-based object detection.
We propose a new localization uncertainty estimation method called UAD for anchor-free object detection.
Our method captures the uncertainty in four directions of box offsets that are homogeneous, so that it can tell which direction is uncertain.
arXiv Detail & Related papers (2020-06-28T13:49:30Z) - Fine-Grained Analysis of Stability and Generalization for Stochastic
Gradient Descent [55.85456985750134]
We introduce a new stability measure called on-average model stability, for which we develop novel bounds controlled by the risks of SGD iterates.
This yields generalization bounds depending on the behavior of the best model, and leads to the first-ever-known fast bounds in the low-noise setting.
To our best knowledge, this gives the firstever-known stability and generalization for SGD with even non-differentiable loss functions.
arXiv Detail & Related papers (2020-06-15T06:30:19Z) - Scope Head for Accurate Localization in Object Detection [135.9979405835606]
We propose a novel detector coined as ScopeNet, which models anchors of each location as a mutually dependent relationship.
With our concise and effective design, the proposed ScopeNet achieves state-of-the-art results on COCO.
arXiv Detail & Related papers (2020-05-11T04:00:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.