Evaluation of Thermal Imaging on Embedded GPU Platforms for Application
in Vehicular Assistance Systems
- URL: http://arxiv.org/abs/2201.01661v1
- Date: Wed, 5 Jan 2022 15:36:25 GMT
- Title: Evaluation of Thermal Imaging on Embedded GPU Platforms for Application
in Vehicular Assistance Systems
- Authors: Muhammad Ali Farooq, Waseem Shariff, Peter Corcoran
- Abstract summary: This study is focused on evaluating the real-time performance of thermal object detection for smart and safe vehicular systems.
A novel large-scale thermal dataset comprising of > 35,000 distinct frames is acquired.
The effectiveness of trained networks is validated on extensive test data using various quantitative metrics.
- Score: 0.5156484100374058
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: This study is focused on evaluating the real-time performance of thermal
object detection for smart and safe vehicular systems by deploying the trained
networks on GPU & single-board EDGE-GPU computing platforms for onboard
automotive sensor suite testing. A novel large-scale thermal dataset comprising
of > 35,000 distinct frames is acquired, processed, and open-sourced in
challenging weather and environmental scenarios. The dataset is a recorded from
lost-cost yet effective uncooled LWIR thermal camera, mounted stand-alone and
on an electric vehicle to minimize mechanical vibrations. State-of-the-art
YOLO-V5 networks variants are trained using four different public datasets as
well newly acquired local dataset for optimal generalization of DNN by
employing SGD optimizer. The effectiveness of trained networks is validated on
extensive test data using various quantitative metrics which include precision,
recall curve, mean average precision, and frames per second. The smaller
network variant of YOLO is further optimized using TensorRT inference
accelerator to explicitly boost the frames per second rate. Optimized network
engine increases the frames per second rate by 3.5 times when testing on low
power edge devices thus achieving 11 fps on Nvidia Jetson Nano and 60 fps on
Nvidia Xavier NX development boards.
Related papers
- EdgeYOLO: An Edge-Real-Time Object Detector [69.41688769991482]
This paper proposes an efficient, low-complexity and anchor-free object detector based on the state-of-the-art YOLO framework.
We develop an enhanced data augmentation method to effectively suppress overfitting during training, and design a hybrid random loss function to improve the detection accuracy of small objects.
Our baseline model can reach the accuracy of 50.6% AP50:95 and 69.8% AP50 in MS 2017 dataset, 26.4% AP50:95 and 44.8% AP50 in VisDrone 2019-DET dataset, and it meets real-time requirements (FPS>=30) on edge-computing device Nvidia
arXiv Detail & Related papers (2023-02-15T06:05:14Z) - Development, Optimization, and Deployment of Thermal Forward Vision
Systems for Advance Vehicular Applications on Edge Devices [0.3058685580689604]
We have proposed a thermal tiny-YOLO multi-class object detection (TTYMOD) system as a smart forward sensing system using an end-to-end YOLO deep learning framework.
The system is trained on large-scale thermal public as well as newly gathered novel open-sourced dataset comprising of more than 35,000 distinct thermal frames.
The efficacy of a thermally tuned nano network is quantified using various qualitative metrics which include mean precision, frames per second rate, and average inference time.
arXiv Detail & Related papers (2023-01-18T15:45:33Z) - Benchmarking Edge Computing Devices for Grape Bunches and Trunks
Detection using Accelerated Object Detection Single Shot MultiBox Deep
Learning Models [2.1922186455344796]
This work benchmarks the performance of different platforms for object detection in real-time.
Authors used the RetinaNet ResNet-50 fine-tuned using the natural Vine dataset.
arXiv Detail & Related papers (2022-11-21T17:02:33Z) - FasterX: Real-Time Object Detection Based on Edge GPUs for UAV
Applications [16.51060054575739]
We propose a novel lightweight deep learning architectures named FasterX based on YOLOX model for real-time object detection on edge GPU.
First, we design an effective and lightweight PixSF head to replace the original head of YOLOX to better detect small objects.
Then, a slimmer structure in the Neck layer termed as SlimFPN is developed to reduce parameters of the network, which is a trade-off between accuracy and speed.
arXiv Detail & Related papers (2022-09-07T13:52:25Z) - A lightweight and accurate YOLO-like network for small target detection
in Aerial Imagery [94.78943497436492]
We present YOLO-S, a simple, fast and efficient network for small target detection.
YOLO-S exploits a small feature extractor based on Darknet20, as well as skip connection, via both bypass and concatenation.
YOLO-S has an 87% decrease of parameter size and almost one half FLOPs of YOLOv3, making practical the deployment for low-power industrial applications.
arXiv Detail & Related papers (2022-04-05T16:29:49Z) - FPGA-optimized Hardware acceleration for Spiking Neural Networks [69.49429223251178]
This work presents the development of a hardware accelerator for an SNN, with off-line training, applied to an image recognition task.
The design targets a Xilinx Artix-7 FPGA, using in total around the 40% of the available hardware resources.
It reduces the classification time by three orders of magnitude, with a small 4.5% impact on the accuracy, if compared to its software, full precision counterpart.
arXiv Detail & Related papers (2022-01-18T13:59:22Z) - Accelerating Training and Inference of Graph Neural Networks with Fast
Sampling and Pipelining [58.10436813430554]
Mini-batch training of graph neural networks (GNNs) requires a lot of computation and data movement.
We argue in favor of performing mini-batch training with neighborhood sampling in a distributed multi-GPU environment.
We present a sequence of improvements to mitigate these bottlenecks, including a performance-engineered neighborhood sampler.
We also conduct an empirical analysis that supports the use of sampling for inference, showing that test accuracies are not materially compromised.
arXiv Detail & Related papers (2021-10-16T02:41:35Z) - Meta-UDA: Unsupervised Domain Adaptive Thermal Object Detection using
Meta-Learning [64.92447072894055]
Infrared (IR) cameras are robust under adverse illumination and lighting conditions.
We propose an algorithm meta-learning framework to improve existing UDA methods.
We produce a state-of-the-art thermal detector for the KAIST and DSIAC datasets.
arXiv Detail & Related papers (2021-10-07T02:28:18Z) - Object Detection in Thermal Spectrum for Advanced Driver-Assistance
Systems (ADAS) [0.5156484100374058]
Object detection in thermal infrared spectrum provides more reliable data source in low-lighting conditions and different weather conditions.
This paper is about exploring and adapting state-of-the-art object and vision framework on thermal vision with seven distinct classes for advanced driver-assistance systems (ADAS)
The trained network variants on public datasets are validated on test data with three different test approaches.
The efficacy of trained networks is tested on locally gathered novel test-data captured with an uncooled LWIR prototype thermal camera.
arXiv Detail & Related papers (2021-09-20T21:38:55Z) - ASFD: Automatic and Scalable Face Detector [129.82350993748258]
We propose a novel Automatic and Scalable Face Detector (ASFD)
ASFD is based on a combination of neural architecture search techniques as well as a new loss design.
Our ASFD-D6 outperforms the prior strong competitors, and our lightweight ASFD-D0 runs at more than 120 FPS with Mobilenet for VGA-resolution images.
arXiv Detail & Related papers (2020-03-25T06:00:47Z) - Learning to Exploit Multiple Vision Modalities by Using Grafted Networks [16.562442770255032]
Novel vision sensors provide information that is not available from conventional intensity cameras.
An obstacle to using these sensors with current powerful deep neural networks is the lack of large labeled training datasets.
This paper proposes a Network Grafting Algorithm (NGA), where a new front end network driven by unconventional visual inputs replaces the front end network of a pretrained deep network that processes intensity frames.
arXiv Detail & Related papers (2020-03-24T16:37:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.