Related papers: ASAP-NMS: Accelerating Non-Maximum Suppression Using Spatially Aware Priors

ASAP-NMS: Accelerating Non-Maximum Suppression Using Spatially Aware Priors

URL: http://arxiv.org/abs/2007.09785v2
Date: Fri, 21 Aug 2020 15:18:57 GMT
Title: ASAP-NMS: Accelerating Non-Maximum Suppression Using Spatially Aware Priors
Authors: Rohun Tripathi, Vasu Singla, Mahyar Najibi, Bharat Singh, Abhishek Sharma and Larry Davis
Abstract summary: Non Maximum Suppression (or Greedy-NMS) is a crucial module for object-detection pipelines. For the region proposal stage of two/multi-stage detectors, NMS is turning out to be a latency bottleneck due to its sequential nature. We use ASAP-NMS to improve the latency of the NMS step from 13.6ms to 1.2 ms on a CPU without sacrificing the accuracy of a state-of-the-art two-stage detector.
Score: 26.835571059909007
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The widely adopted sequential variant of Non Maximum Suppression (or Greedy-NMS) is a crucial module for object-detection pipelines. Unfortunately, for the region proposal stage of two/multi-stage detectors, NMS is turning out to be a latency bottleneck due to its sequential nature. In this article, we carefully profile Greedy-NMS iterations to find that a major chunk of computation is wasted in comparing proposals that are already far-away and have a small chance of suppressing each other. We address this issue by comparing only those proposals that are generated from nearby anchors. The translation-invariant property of the anchor lattice affords generation of a lookup table, which provides an efficient access to nearby proposals, during NMS. This leads to an Accelerated NMS algorithm which leverages Spatially Aware Priors, or ASAP-NMS, and improves the latency of the NMS step from 13.6ms to 1.2 ms on a CPU without sacrificing the accuracy of a state-of-the-art two-stage detector on COCO and VOC datasets. Importantly, ASAP-NMS is agnostic to image resolution and can be used as a simple drop-in module during inference. Using ASAP-NMS at run-time only, we obtain an mAP of 44.2\%@25Hz on the COCO dataset with a V100 GPU.

Related papers

Work-Efficient Parallel Non-Maximum Suppression Kernels [8.872464006522929]
Non-Maximum Suppression (NMS) is the process of selecting a single representative candidate within a cluster of detections. We present a highly scalable NMS algorithm for embedded GPU architectures that is designed from scratch to handle workloads featuring thousands of simultaneous detections. Our proposed parallel greedy NMS algorithm yields a 14x-40x speed up when compared to state-of-the-art NMS methods.
arXiv Detail & Related papers (2025-02-01T19:23:13Z)
Accelerating Non-Maximum Suppression: A Graph Theory Perspective [24.34791528442417]
Non-maximum suppression (NMS) is an indispensable post-processing step in object detection. This paper systematically analyzes NMS from a graph theory perspective for the first time, revealing its intrinsic structure. We introduce NMS-Bench, the first benchmark designed to comprehensively assess various NMS methods.
arXiv Detail & Related papers (2024-09-30T17:20:49Z)
Fuzzy-NMS: Improving 3D Object Detection with Fuzzy Classification in NMS [19.452760776980472]
Non-maximum suppression (NMS) is an essential post-processing module used in many 3D object detection frameworks. We introduce fuzzy learning into NMS and propose a novel generalized Fuzzy-NMS module to achieve finer candidate bounding box filtering.
arXiv Detail & Related papers (2023-10-21T09:09:03Z)
ANMS: Asynchronous Non-Maximum Suppression in Event Stream [15.355579943905585]
Non-maximum suppression (NMS) is widely used in frame-based tasks as an essential post-processing algorithm. This paper proposes a general-purpose asynchronous non-maximum suppression pipeline (ANMS) The proposed pipeline extract fine feature stream from the output of original detectors and adapts to the speed of motion.
arXiv Detail & Related papers (2023-03-19T05:33:32Z)
ISDA: Position-Aware Instance Segmentation with Deformable Attention [4.188555841288538]
We propose a novel end-to-end instance segmentation method termed ISDA. It reshapes the task into predicting a set of object masks, which are generated via traditional convolution operation. Thanks to the introduced set-prediction mechanism, the proposed method is NMS-free.
arXiv Detail & Related papers (2022-02-23T12:30:18Z)
PSRR-MaxpoolNMS: Pyramid Shifted MaxpoolNMS with Relationship Recovery [17.704037442897004]
Non-maximum Suppression (NMS) is an essential postprocessing step in modern convolutional neural networks for object detection. The de-facto standard for NMS, namely GreedyNMS, cannot be easily parallelized. MaxpoolNMS is introduced as a parallelizable alternative to GreedyNMS.
arXiv Detail & Related papers (2021-05-27T08:24:21Z)
VL-NMS: Breaking Proposal Bottlenecks in Two-Stage Visual-Language Matching [75.71523183166799]
The prevailing framework for matching multimodal inputs is based on a two-stage process. We argue that these methods overlook an obvious emphmismatch between the roles of proposals in the two stages. We propose VL-NMS, which is the first method to yield query-aware proposals at the first stage.
arXiv Detail & Related papers (2021-05-12T13:05:25Z)
Object Detection Made Simpler by Eliminating Heuristic NMS [70.93004137521946]
We show a simple NMS-free, end-to-end object detection framework. We attain on par or even improved detection accuracy compared with the original one-stage detector.
arXiv Detail & Related papers (2021-01-28T02:38:29Z)
End-to-End Object Detection with Fully Convolutional Network [71.56728221604158]
We introduce a Prediction-aware One-To-One (POTO) label assignment for classification to enable end-to-end detection. A simple 3D Max Filtering (3DMF) is proposed to utilize the multi-scale features and improve the discriminability of convolutions in the local region. Our end-to-end framework achieves competitive performance against many state-of-the-art detectors with NMS on COCO and CrowdHuman datasets.
arXiv Detail & Related papers (2020-12-07T09:14:55Z)
Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding [80.46288064284084]
Ref-NMS is the first method to yield expression-aware proposals at the first stage. Ref-NMS regards all nouns in the expression as critical objects, and introduces a lightweight module to predict a score for aligning each box with a critical object. Since Ref- NMS is agnostic to the grounding step, it can be easily integrated into any state-of-the-art two-stage method.
arXiv Detail & Related papers (2020-09-03T05:04:12Z)
FCOS: A simple and strong anchor-free object detector [111.87691210818194]
We propose a fully convolutional one-stage object detector (FCOS) to solve object detection in a per-pixel prediction fashion. Almost all state-of-the-art object detectors such as RetinaNet, SSD, YOLOv3, and Faster R-CNN rely on pre-defined anchor boxes. In contrast, our proposed detector FCOS is anchor box free, as well as proposal free.
arXiv Detail & Related papers (2020-06-14T01:03:39Z)
Hashing-based Non-Maximum Suppression for Crowded Object Detection [63.761451382081844]
We propose an algorithm, named hashing-based non-maximum suppression (HNMS) to efficiently suppress the non-maximum boxes for object detection. For two-stage detectors, we replace NMS in region proposal network with HNMS, and observe significant speed-up with comparable accuracy. Experiments are conducted on CARPK, SKU-110K, CrowdHuman datasets to demonstrate the efficiency and effectiveness of HNMS.
arXiv Detail & Related papers (2020-05-22T23:45:59Z)

This list is automatically generated from the titles and abstracts of the papers in this site.