Anchor-free Small-scale Multispectral Pedestrian Detection
- URL: http://arxiv.org/abs/2008.08418v2
- Date: Thu, 20 Aug 2020 15:01:59 GMT
- Title: Anchor-free Small-scale Multispectral Pedestrian Detection
- Authors: Alexander Wolpert, Michael Teutsch, M. Saquib Sarfraz, Rainer
Stiefelhagen
- Abstract summary: We propose a method for effective and efficient multispectral fusion of the two modalities in an adapted single-stage anchor-free base architecture.
We aim at learning pedestrian representations based on object center and scale rather than direct bounding box predictions.
Results show our method's effectiveness in detecting small-scaled pedestrians.
- Score: 88.7497134369344
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Multispectral images consisting of aligned visual-optical (VIS) and thermal
infrared (IR) image pairs are well-suited for practical applications like
autonomous driving or visual surveillance. Such data can be used to increase
the performance of pedestrian detection especially for weakly illuminated,
small-scaled, or partially occluded instances. The current state-of-the-art is
based on variants of Faster R-CNN and thus passes through two stages: a
proposal generator network with handcrafted anchor boxes for object
localization and a classification network for verifying the object category. In
this paper we propose a method for effective and efficient multispectral fusion
of the two modalities in an adapted single-stage anchor-free base architecture.
We aim at learning pedestrian representations based on object center and scale
rather than direct bounding box predictions. In this way, we can both simplify
the network architecture and achieve higher detection performance, especially
for pedestrians under occlusion or at low object resolution. In addition, we
provide a study on well-suited multispectral data augmentation techniques that
improve the commonly used augmentations. The results show our method's
effectiveness in detecting small-scaled pedestrians. We achieve 5.68%
log-average miss rate in comparison to the best current state-of-the-art of
7.49% (25% improvement) on the challenging KAIST Multispectral Pedestrian
Detection Benchmark.
Code: https://github.com/HensoldtOptronicsCV/MultispectralPedestrianDetection
Related papers
- A Dual Attentive Generative Adversarial Network for Remote Sensing Image
Change Detection [6.906936669510404]
We propose a dual attentive generative adversarial network for achieving very high-resolution remote sensing image change detection tasks.
The DAGAN framework has better performance with 85.01% mean IoU and 91.48% mean F1 score than advanced methods on the LEVIR dataset.
arXiv Detail & Related papers (2023-10-03T08:26:27Z) - A Multi-task Framework for Infrared Small Target Detection and
Segmentation [9.033048310220346]
We propose a novel end-to-end framework for infrared small target detection and segmentation.
We use UNet as the backbone to maintain resolution and semantic information.
We develop a multi-task framework for infrared small target detection and segmentation.
arXiv Detail & Related papers (2022-06-14T15:43:34Z) - Activation to Saliency: Forming High-Quality Labels for Unsupervised
Salient Object Detection [54.92703325989853]
We propose a two-stage Activation-to-Saliency (A2S) framework that effectively generates high-quality saliency cues.
No human annotations are involved in our framework during the whole training process.
Our framework reports significant performance compared with existing USOD methods.
arXiv Detail & Related papers (2021-12-07T11:54:06Z) - Enhancing Object Detection for Autonomous Driving by Optimizing Anchor
Generation and Addressing Class Imbalance [0.0]
This study presents an enhanced 2D object detector based on Faster R-CNN that is better suited for the context of autonomous vehicles.
The proposed modifications over the Faster R-CNN do not increase computational cost and can easily be extended to optimize other anchor-based detection frameworks.
arXiv Detail & Related papers (2021-04-08T16:58:31Z) - Bidirectional Multi-scale Attention Networks for Semantic Segmentation
of Oblique UAV Imagery [30.524771772192757]
We propose the novel bidirectional multi-scale attention networks, which fuse features from multiple scales bidirectionally for more adaptive and effective feature extraction.
Our model achieved the state-of-the-art (SOTA) result with a mean intersection over union (mIoU) score of 70.80%.
arXiv Detail & Related papers (2021-02-05T11:02:15Z) - MRDet: A Multi-Head Network for Accurate Oriented Object Detection in
Aerial Images [51.227489316673484]
We propose an arbitrary-oriented region proposal network (AO-RPN) to generate oriented proposals transformed from horizontal anchors.
To obtain accurate bounding boxes, we decouple the detection task into multiple subtasks and propose a multi-head network.
Each head is specially designed to learn the features optimal for the corresponding task, which allows our network to detect objects accurately.
arXiv Detail & Related papers (2020-12-24T06:36:48Z) - One-Shot Object Detection without Fine-Tuning [62.39210447209698]
We introduce a two-stage model consisting of a first stage Matching-FCOS network and a second stage Structure-Aware Relation Module.
We also propose novel training strategies that effectively improve detection performance.
Our method exceeds the state-of-the-art one-shot performance consistently on multiple datasets.
arXiv Detail & Related papers (2020-05-08T01:59:23Z) - FairMOT: On the Fairness of Detection and Re-Identification in Multiple
Object Tracking [92.48078680697311]
Multi-object tracking (MOT) is an important problem in computer vision.
We present a simple yet effective approach termed as FairMOT based on the anchor-free object detection architecture CenterNet.
The approach achieves high accuracy for both detection and tracking.
arXiv Detail & Related papers (2020-04-04T08:18:00Z) - Depthwise Non-local Module for Fast Salient Object Detection Using a
Single Thread [136.2224792151324]
We propose a new deep learning algorithm for fast salient object detection.
The proposed algorithm achieves competitive accuracy and high inference efficiency simultaneously with a single CPU thread.
arXiv Detail & Related papers (2020-01-22T15:23:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.