Seeing without Looking: Contextual Rescoring of Object Detections for AP
Maximization
- URL: http://arxiv.org/abs/1912.12290v2
- Date: Mon, 30 Mar 2020 16:09:20 GMT
- Title: Seeing without Looking: Contextual Rescoring of Object Detections for AP
Maximization
- Authors: Louren\c{c}o V. Pato, Renato Negrinho, Pedro M. Q. Aguiar
- Abstract summary: We propose to incorporate context in object detection by post-processing the output of an arbitrary detector.
Rescoring is done by conditioning on contextual information from the entire set of detections.
We show that AP can be improved by simply reassigning the detection confidence values.
- Score: 4.346179456029563
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The majority of current object detectors lack context: class predictions are
made independently from other detections. We propose to incorporate context in
object detection by post-processing the output of an arbitrary detector to
rescore the confidences of its detections. Rescoring is done by conditioning on
contextual information from the entire set of detections: their confidences,
predicted classes, and positions. We show that AP can be improved by simply
reassigning the detection confidence values such that true positives that
survive longer (i.e., those with the correct class and large IoU) are scored
higher than false positives or detections with small IoU. In this setting, we
use a bidirectional RNN with attention for contextual rescoring and introduce a
training target that uses the IoU with ground truth to maximize AP for the
given set of detections. The fact that our approach does not require access to
visual features makes it computationally inexpensive and agnostic to the
detection architecture. In spite of this simplicity, our model consistently
improves AP over strong pre-trained baselines (Cascade R-CNN and Faster R-CNN
with several backbones), particularly by reducing the confidence of duplicate
detections (a learned form of non-maximum suppression) and removing
out-of-context objects by conditioning on the confidences, classes, positions,
and sizes of the co-occurrent detections. Code is available at
https://github.com/LourencoVazPato/seeing-without-looking/
Related papers
- Small Object Detection via Coarse-to-fine Proposal Generation and
Imitation Learning [52.06176253457522]
We propose a two-stage framework tailored for small object detection based on the Coarse-to-fine pipeline and Feature Imitation learning.
CFINet achieves state-of-the-art performance on the large-scale small object detection benchmarks, SODA-D and SODA-A.
arXiv Detail & Related papers (2023-08-18T13:13:09Z) - Label-Efficient Object Detection via Region Proposal Network
Pre-Training [58.50615557874024]
We propose a simple pretext task that provides an effective pre-training for the region proposal network (RPN)
In comparison with multi-stage detectors without RPN pre-training, our approach is able to consistently improve downstream task performance.
arXiv Detail & Related papers (2022-11-16T16:28:18Z) - Detecting tiny objects in aerial images: A normalized Wasserstein
distance and a new benchmark [45.10513110142015]
We propose a new evaluation metric dubbed Normalized Wasserstein Distance (NWD) and a new RanKing-based Assigning (RKA) strategy for tiny object detection.
The proposed NWD-RKA strategy can be easily embedded into all kinds of anchor-based detectors to replace the standard IoU threshold-based one.
Tested on four datasets, NWD-RKA can consistently improve tiny object detection performance by a large margin.
arXiv Detail & Related papers (2022-06-28T13:33:06Z) - Double-Dot Network for Antipodal Grasp Detection [20.21384585441404]
This paper proposes a new deep learning approach to antipodal grasp detection, named Double-Dot Network (DD-Net)
It follows the recent anchor-free object detection framework, which does not depend on empirically pre-set anchors.
An effective CNN architecture is introduced to localize such fingertips, and with the help of auxiliary centers for refinement, it accurately and robustly infers grasp candidates.
arXiv Detail & Related papers (2021-08-03T14:21:17Z) - Robust and Accurate Object Detection via Adversarial Learning [111.36192453882195]
This work augments the fine-tuning stage for object detectors by exploring adversarial examples.
Our approach boosts the performance of state-of-the-art EfficientDets by +1.1 mAP on the object detection benchmark.
arXiv Detail & Related papers (2021-03-23T19:45:26Z) - Detection of Adversarial Supports in Few-shot Classifiers Using Feature
Preserving Autoencoders and Self-Similarity [89.26308254637702]
We propose a detection strategy to highlight adversarial support sets.
We make use of feature preserving autoencoder filtering and also the concept of self-similarity of a support set to perform this detection.
Our method is attack-agnostic and also the first to explore detection for few-shot classifiers to the best of our knowledge.
arXiv Detail & Related papers (2020-12-09T14:13:41Z) - CIA-SSD: Confident IoU-Aware Single-Stage Object Detector From Point
Cloud [40.871850077528485]
We present a new single-stage detector named the Confident IoU-Aware Single-Stage object Detector (CIA-SSD)
CIA-SSD adaptively fuses high-level abstract semantic features and low-level spatial features for accurate predictions of bounding boxes and classification confidence.
It attains top performance in terms of the official ranking metric and above 32 FPS inference speed.
arXiv Detail & Related papers (2020-12-05T12:00:08Z) - Transformer-Encoder Detector Module: Using Context to Improve Robustness
to Adversarial Attacks on Object Detection [12.521662223741673]
This article proposes a new context module that can be applied to an object detector to improve the labeling of object instances.
The proposed model achieves higher mAP, F1 scores and AUC average score of up to 13% compared to the baseline Faster-RCNN detector.
arXiv Detail & Related papers (2020-11-13T15:52:53Z) - Anomaly Detection-Based Unknown Face Presentation Attack Detection [74.4918294453537]
Anomaly detection-based spoof attack detection is a recent development in face Presentation Attack Detection.
In this paper, we present a deep-learning solution for anomaly detection-based spoof attack detection.
The proposed approach benefits from the representation learning power of the CNNs and learns better features for fPAD task.
arXiv Detail & Related papers (2020-07-11T21:20:55Z) - Scope Head for Accurate Localization in Object Detection [135.9979405835606]
We propose a novel detector coined as ScopeNet, which models anchors of each location as a mutually dependent relationship.
With our concise and effective design, the proposed ScopeNet achieves state-of-the-art results on COCO.
arXiv Detail & Related papers (2020-05-11T04:00:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.