Confluence: A Robust Non-IoU Alternative to Non-Maxima Suppression in
Object Detection
- URL: http://arxiv.org/abs/2012.00257v1
- Date: Tue, 1 Dec 2020 04:22:01 GMT
- Title: Confluence: A Robust Non-IoU Alternative to Non-Maxima Suppression in
Object Detection
- Authors: Andrew Shepley, Greg Falzon, Paul Kwan
- Abstract summary: Confluence is an algorithm which does not rely solely on individual confidence scores to select optimal bounding boxes.
It is experimentally validated on RetinaNet, YOLOv3 and Mask-RCNN, using both the MS COCO and PASCAL VOC 2007 datasets.
Confluence outperforms Greedy NMS in both mAP and recall on both datasets.
- Score: 0.570896453969985
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper presents a novel alternative to Greedy Non-Maxima Suppression
(NMS) in the task of bounding box selection and suppression in object
detection. It proposes Confluence, an algorithm which does not rely solely on
individual confidence scores to select optimal bounding boxes, nor does it rely
on Intersection Over Union (IoU) to remove false positives. Using Manhattan
Distance, it selects the bounding box which is closest to every other bounding
box within the cluster and removes highly confluent neighboring boxes. Thus,
Confluence represents a paradigm shift in bounding box selection and
suppression as it is based on fundamentally different theoretical principles to
Greedy NMS and its variants. Confluence is experimentally validated on
RetinaNet, YOLOv3 and Mask-RCNN, using both the MS COCO and PASCAL VOC 2007
datasets. Confluence outperforms Greedy NMS in both mAP and recall on both
datasets, using the challenging 0.50:0.95 mAP evaluation metric. On each
detector and dataset, mAP was improved by 0.3-0.7% while recall was improved by
1.4-2.5%. A theoretical comparison of Greedy NMS and the Confluence Algorithm
is provided, and quantitative results are supported by extensive qualitative
results analysis. Furthermore, sensitivity analysis experiments across mAP
thresholds support the conclusion that Confluence is more robust than NMS.
Related papers
- Theoretically Achieving Continuous Representation of Oriented Bounding Boxes [64.15627958879053]
This paper endeavors to completely solve the issue of discontinuity in Oriented Bounding Box representation.
We propose a novel representation method called Continuous OBB (COBB) which can be readily integrated into existing detectors.
For fairness and transparency of experiments, we have developed a modularized benchmark based on the open-source deep learning framework Jittor's detection toolbox JDet for OOD evaluation.
arXiv Detail & Related papers (2024-02-29T09:27:40Z) - Fast Semisupervised Unmixing Using Nonconvex Optimization [80.11512905623417]
We introduce a novel convex convex model for semi/library-based unmixing.
We demonstrate the efficacy of Alternating Methods of sparse unsupervised unmixing.
arXiv Detail & Related papers (2024-01-23T10:07:41Z) - Fuzzy-NMS: Improving 3D Object Detection with Fuzzy Classification in
NMS [19.452760776980472]
Non-maximum suppression (NMS) is an essential post-processing module used in many 3D object detection frameworks.
We introduce fuzzy learning into NMS and propose a novel generalized Fuzzy-NMS module to achieve finer candidate bounding box filtering.
arXiv Detail & Related papers (2023-10-21T09:09:03Z) - DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization [51.517956081644186]
We introduce a new graph-based diffusion framework, namely DIFUSCO.
Our framework casts NPC problems as discrete 0, 1-vector optimization problems.
For the MIS problem, DIFUSCO outperforms the previous state-of-the-art neural solver on the challenging SATLIB benchmark.
arXiv Detail & Related papers (2023-02-16T11:13:36Z) - Differentially Private Federated Clustering over Non-IID Data [59.611244450530315]
clustering clusters (FedC) problem aims to accurately partition unlabeled data samples distributed over massive clients into finite clients under the orchestration of a server.
We propose a novel FedC algorithm using differential privacy convergence technique, referred to as DP-Fed, in which partial participation and multiple clients are also considered.
Various attributes of the proposed DP-Fed are obtained through theoretical analyses of privacy protection, especially for the case of non-identically and independently distributed (non-i.i.d.) data.
arXiv Detail & Related papers (2023-01-03T05:38:43Z) - A Large-scale Multiple-objective Method for Black-box Attack against
Object Detection [70.00150794625053]
We propose to minimize the true positive rate and maximize the false positive rate, which can encourage more false positive objects to block the generation of new true positive bounding boxes.
We extend the standard Genetic Algorithm with Random Subset selection and Divide-and-Conquer, called GARSDC, which significantly improves the efficiency.
Compared with the state-of-art attack methods, GARSDC decreases by an average 12.0 in the mAP and queries by about 1000 times in extensive experiments.
arXiv Detail & Related papers (2022-09-16T08:36:42Z) - Confidence Propagation Cluster: Unleash Full Potential of Object
Detectors [8.996530151621661]
Most object detection methods obtain objects by using the non-maximum suppression (NMS) and its improved versions like Soft-NMS to remove redundant bounding boxes.
We challenge those NMS-based methods from three aspects: 1) The bounding box with highest confidence value may not be the true positive having the biggest overlap with the ground-truth box, 2) Not only suppression is required for redundant boxes, but also confidence enhancement is needed for those true positives, and 3) Sorting candidate boxes by confidence values is not necessary so that full parallelism is achievable.
In this paper, inspired by belief propagation (BP), we propose the
arXiv Detail & Related papers (2021-12-01T08:22:00Z) - Localization Uncertainty-Based Attention for Object Detection [8.154943252001848]
We propose a more efficient uncertainty-aware dense detector (UADET) that predicts four-directional localization uncertainties via Gaussian modeling.
Experiments using the MS COCO benchmark show that our UADET consistently surpasses baseline FCOS, and that our best model, ResNext-64x4d-101-DCN, obtains a single model, single-scale AP of 48.3% on COCO test-dev.
arXiv Detail & Related papers (2021-08-25T04:32:39Z) - NMS-Loss: Learning with Non-Maximum Suppression for Crowded Pedestrian
Detection [39.417540296897194]
We propose a novel NMS-Loss making the NMS procedure can be trained end-to-end without any additional network parameters.
Our NMS-Loss punishes two cases when FP is not suppressed and FN is wrongly eliminated by NMS.
With the help of NMS-Loss, our detector, namely NMS-Ped, achieves impressive results with Miss Rate of 5.92% on Caltech dataset and 10.08% on CityPersons dataset.
arXiv Detail & Related papers (2021-06-04T12:06:46Z) - LSDAT: Low-Rank and Sparse Decomposition for Decision-based Adversarial
Attack [74.5144793386864]
LSDAT crafts perturbations in the low-dimensional subspace formed by the sparse component of the input sample and that of an adversarial sample.
LSD works directly in the image pixel domain to guarantee that non-$ell$ constraints, such as sparsity, are satisfied.
arXiv Detail & Related papers (2021-03-19T13:10:47Z) - Quantum-soft QUBO Suppression for Accurate Object Detection [8.871042314510788]
Non-maximum suppression (NMS) has been adopted by default for removing redundant object detections for decades.
We propose Quantum-soft QUBO Suppression (QSQS) algorithm for fast and accurate detection by exploiting quantum computing advantages.
arXiv Detail & Related papers (2020-07-28T05:12:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.