Illumination and Temperature-Aware Multispectral Networks for
Edge-Computing-Enabled Pedestrian Detection
- URL: http://arxiv.org/abs/2112.05053v1
- Date: Thu, 9 Dec 2021 17:27:23 GMT
- Title: Illumination and Temperature-Aware Multispectral Networks for
Edge-Computing-Enabled Pedestrian Detection
- Authors: Yifan Zhuang, Ziyuan Pu, Jia Hu, Yinhai Wang
- Abstract summary: This study proposes a lightweight Illumination and Temperature-aware Multispectral Network (IT-MN) for accurate and efficient pedestrian detection.
The proposed algorithm is evaluated by comparing with the selected state-of-the-art algorithms using a public dataset collected by in-vehicle cameras.
The results show that the proposed algorithm achieves a low miss rate and inference time at 14.19% and 0.03 seconds per image pair on GPU.
- Score: 10.454696553567809
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Accurate and efficient pedestrian detection is crucial for the intelligent
transportation system regarding pedestrian safety and mobility, e.g., Advanced
Driver Assistance Systems, and smart pedestrian crosswalk systems. Among all
pedestrian detection methods, vision-based detection method is demonstrated to
be the most effective in previous studies. However, the existing vision-based
pedestrian detection algorithms still have two limitations that restrict their
implementations, those being real-time performance as well as the resistance to
the impacts of environmental factors, e.g., low illumination conditions. To
address these issues, this study proposes a lightweight Illumination and
Temperature-aware Multispectral Network (IT-MN) for accurate and efficient
pedestrian detection. The proposed IT-MN is an efficient one-stage detector.
For accommodating the impacts of environmental factors and enhancing the
sensing accuracy, thermal image data is fused by the proposed IT-MN with visual
images to enrich useful information when visual image quality is limited. In
addition, an innovative and effective late fusion strategy is also developed to
optimize the image fusion performance. To make the proposed model implementable
for edge computing, the model quantization is applied to reduce the model size
by 75% while shortening the inference time significantly. The proposed
algorithm is evaluated by comparing with the selected state-of-the-art
algorithms using a public dataset collected by in-vehicle cameras. The results
show that the proposed algorithm achieves a low miss rate and inference time at
14.19% and 0.03 seconds per image pair on GPU. Besides, the quantized IT-MN
achieves an inference time of 0.21 seconds per image pair on the edge device,
which also demonstrates the potentiality of deploying the proposed model on
edge devices as a highly efficient pedestrian detection algorithm.
Related papers
- MambaST: A Plug-and-Play Cross-Spectral Spatial-Temporal Fuser for Efficient Pedestrian Detection [0.5898893619901381]
This paper proposes MambaST, a plug-and-play cross-spectral spatial-temporal fusion pipeline for efficient pedestrian detection.
It is difficult to perform accurate detection using RGB cameras under dark or low-light conditions.
Our proposed model also achieves superior performance on small-scale pedestrian detection.
arXiv Detail & Related papers (2024-08-02T06:20:48Z) - Optimizing ship detection efficiency in SAR images [12.829941550630776]
The speed and compute cost of vessel detection are essential for a timely intervention to prevent illegal fishing.
We trained an object detection model based on a convolutional neural network (CNN) using a dataset of satellite images.
We show that by using a classification model the average precision of the detection model can be approximated to 99.5% in 44% of the time or to 92.7% in 25% of the time.
arXiv Detail & Related papers (2022-12-12T12:04:10Z) - Real-Time Driver Monitoring Systems through Modality and View Analysis [28.18784311981388]
Driver distractions are known to be the dominant cause of road accidents.
State-of-the-art methods prioritize accuracy while ignoring latency.
We propose time-effective detection models by neglecting the temporal relation between video frames.
arXiv Detail & Related papers (2022-10-17T21:22:41Z) - ETAD: A Unified Framework for Efficient Temporal Action Detection [70.21104995731085]
Untrimmed video understanding such as temporal action detection (TAD) often suffers from the pain of huge demand for computing resources.
We build a unified framework for efficient end-to-end temporal action detection (ETAD)
ETAD achieves state-of-the-art performance on both THUMOS-14 and ActivityNet-1.3.
arXiv Detail & Related papers (2022-05-14T21:16:21Z) - Meta-UDA: Unsupervised Domain Adaptive Thermal Object Detection using
Meta-Learning [64.92447072894055]
Infrared (IR) cameras are robust under adverse illumination and lighting conditions.
We propose an algorithm meta-learning framework to improve existing UDA methods.
We produce a state-of-the-art thermal detector for the KAIST and DSIAC datasets.
arXiv Detail & Related papers (2021-10-07T02:28:18Z) - An Efficient Approach for Anomaly Detection in Traffic Videos [30.83924581439373]
We propose an efficient approach for a video anomaly detection system which is capable of running at the edge devices.
The proposed approach comprises a pre-processing module that detects changes in the scene and removes the corrupted frames.
We also propose a sequential change detection algorithm that can quickly adapt to a new scene and detect changes in the similarity statistic.
arXiv Detail & Related papers (2021-04-20T04:43:18Z) - Anchor-free Small-scale Multispectral Pedestrian Detection [88.7497134369344]
We propose a method for effective and efficient multispectral fusion of the two modalities in an adapted single-stage anchor-free base architecture.
We aim at learning pedestrian representations based on object center and scale rather than direct bounding box predictions.
Results show our method's effectiveness in detecting small-scaled pedestrians.
arXiv Detail & Related papers (2020-08-19T13:13:01Z) - Identity-Aware Attribute Recognition via Real-Time Distributed Inference
in Mobile Edge Clouds [53.07042574352251]
We design novel models for pedestrian attribute recognition with re-ID in an MEC-enabled camera monitoring system.
We propose a novel inference framework with a set of distributed modules, by jointly considering the attribute recognition and person re-ID.
We then devise a learning-based algorithm for the distributions of the modules of the proposed distributed inference framework.
arXiv Detail & Related papers (2020-08-12T12:03:27Z) - SADet: Learning An Efficient and Accurate Pedestrian Detector [68.66857832440897]
This paper proposes a series of systematic optimization strategies for the detection pipeline of one-stage detector.
It forms a single shot anchor-based detector (SADet) for efficient and accurate pedestrian detection.
Though structurally simple, it presents state-of-the-art result and real-time speed of $20$ FPS for VGA-resolution images.
arXiv Detail & Related papers (2020-07-26T12:32:38Z) - End-to-end Learning for Inter-Vehicle Distance and Relative Velocity
Estimation in ADAS with a Monocular Camera [81.66569124029313]
We propose a camera-based inter-vehicle distance and relative velocity estimation method based on end-to-end training of a deep neural network.
The key novelty of our method is the integration of multiple visual clues provided by any two time-consecutive monocular frames.
We also propose a vehicle-centric sampling mechanism to alleviate the effect of perspective distortion in the motion field.
arXiv Detail & Related papers (2020-06-07T08:18:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.