FasterX: Real-Time Object Detection Based on Edge GPUs for UAV
Applications
- URL: http://arxiv.org/abs/2209.03157v1
- Date: Wed, 7 Sep 2022 13:52:25 GMT
- Title: FasterX: Real-Time Object Detection Based on Edge GPUs for UAV
Applications
- Authors: Wei Zhou, Xuanlin Min, Rui Hu, Yiwen Long, Huan Luo, and JunYi
- Abstract summary: We propose a novel lightweight deep learning architectures named FasterX based on YOLOX model for real-time object detection on edge GPU.
First, we design an effective and lightweight PixSF head to replace the original head of YOLOX to better detect small objects.
Then, a slimmer structure in the Neck layer termed as SlimFPN is developed to reduce parameters of the network, which is a trade-off between accuracy and speed.
- Score: 16.51060054575739
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Real-time object detection on Unmanned Aerial Vehicles (UAVs) is a
challenging issue due to the limited computing resources of edge GPU devices as
Internet of Things (IoT) nodes. To solve this problem, in this paper, we
propose a novel lightweight deep learning architectures named FasterX based on
YOLOX model for real-time object detection on edge GPU. First, we design an
effective and lightweight PixSF head to replace the original head of YOLOX to
better detect small objects, which can be further embedded in the depthwise
separable convolution (DS Conv) to achieve a lighter head. Then, a slimmer
structure in the Neck layer termed as SlimFPN is developed to reduce parameters
of the network, which is a trade-off between accuracy and speed. Furthermore,
we embed attention module in the Head layer to improve the feature extraction
effect of the prediction head. Meanwhile, we also improve the label assignment
strategy and loss function to alleviate category imbalance and box optimization
problems of the UAV dataset. Finally, auxiliary heads are presented for online
distillation to improve the ability of position embedding and feature
extraction in PixSF head. The performance of our lightweight models are
validated experimentally on the NVIDIA Jetson NX and Jetson Nano GPU embedded
platforms.Extensive experiments show that FasterX models achieve better
trade-off between accuracy and latency on VisDrone2021 dataset compared to
state-of-the-art models.
Related papers
- Task-Oriented Real-time Visual Inference for IoVT Systems: A Co-design Framework of Neural Networks and Edge Deployment [61.20689382879937]
Task-oriented edge computing addresses this by shifting data analysis to the edge.
Existing methods struggle to balance high model performance with low resource consumption.
We propose a novel co-design framework to optimize neural network architecture.
arXiv Detail & Related papers (2024-10-29T19:02:54Z) - DM3D: Distortion-Minimized Weight Pruning for Lossless 3D Object Detection [42.07920565812081]
We propose a novel post-training weight pruning scheme for 3D object detection.
It determines redundant parameters in the pretrained model that lead to minimal distortion in both locality and confidence.
This framework aims to minimize detection distortion of network output to maximally maintain detection precision.
arXiv Detail & Related papers (2024-07-02T09:33:32Z) - Object Detection in Thermal Images Using Deep Learning for Unmanned
Aerial Vehicles [0.9208007322096533]
This work presents a neural network model capable of recognizing small and tiny objects in thermal images collected by unmanned aerial vehicles.
The backbone is developed based on the structure of YOLOv5 combined with the use of a transformer encoder at the end.
The neck includes a BI-FPN block combined with the use of a sliding window and a transformer to increase the information fed into the prediction head.
arXiv Detail & Related papers (2024-02-13T06:40:55Z) - SATAY: A Streaming Architecture Toolflow for Accelerating YOLO Models on
FPGA Devices [48.47320494918925]
This work tackles the challenges of deploying stateof-the-art object detection models onto FPGA devices for ultralow latency applications.
We employ a streaming architecture design for our YOLO accelerators, implementing the complete model on-chip in a deeply pipelined fashion.
We introduce novel hardware components to support the operations of YOLO models in a dataflow manner, and off-chip memory buffering to address the limited on-chip memory resources.
arXiv Detail & Related papers (2023-09-04T13:15:01Z) - EdgeYOLO: An Edge-Real-Time Object Detector [69.41688769991482]
This paper proposes an efficient, low-complexity and anchor-free object detector based on the state-of-the-art YOLO framework.
We develop an enhanced data augmentation method to effectively suppress overfitting during training, and design a hybrid random loss function to improve the detection accuracy of small objects.
Our baseline model can reach the accuracy of 50.6% AP50:95 and 69.8% AP50 in MS 2017 dataset, 26.4% AP50:95 and 44.8% AP50 in VisDrone 2019-DET dataset, and it meets real-time requirements (FPS>=30) on edge-computing device Nvidia
arXiv Detail & Related papers (2023-02-15T06:05:14Z) - Rethinking Voxelization and Classification for 3D Object Detection [68.8204255655161]
The main challenge in 3D object detection from LiDAR point clouds is achieving real-time performance without affecting the reliability of the network.
We present a solution to improve network inference speed and precision at the same time by implementing a fast dynamic voxelizer.
In addition, we propose a lightweight detection sub-head model for classifying predicted objects and filter out false detected objects.
arXiv Detail & Related papers (2023-01-10T16:22:04Z) - DAMO-YOLO : A Report on Real-Time Object Detection Design [19.06518351354291]
We present a fast and accurate object detection method dubbed DAMO-YOLO, which achieves higher performance than the state-of-the-art YOLO series.
We use MAE-NAS, a method guided by the principle of maximum entropy, to search our detection backbone.
In the design of necks and heads, we follow the rule of large neck, small head''
arXiv Detail & Related papers (2022-11-23T17:59:12Z) - Evaluation of Thermal Imaging on Embedded GPU Platforms for Application
in Vehicular Assistance Systems [0.5156484100374058]
This study is focused on evaluating the real-time performance of thermal object detection for smart and safe vehicular systems.
A novel large-scale thermal dataset comprising of > 35,000 distinct frames is acquired.
The effectiveness of trained networks is validated on extensive test data using various quantitative metrics.
arXiv Detail & Related papers (2022-01-05T15:36:25Z) - FastFlowNet: A Lightweight Network for Fast Optical Flow Estimation [81.76975488010213]
Dense optical flow estimation plays a key role in many robotic vision tasks.
Current networks often occupy large number of parameters and require heavy computation costs.
Our proposed FastFlowNet works in the well-known coarse-to-fine manner with following innovations.
arXiv Detail & Related papers (2021-03-08T03:09:37Z) - ASFD: Automatic and Scalable Face Detector [129.82350993748258]
We propose a novel Automatic and Scalable Face Detector (ASFD)
ASFD is based on a combination of neural architecture search techniques as well as a new loss design.
Our ASFD-D6 outperforms the prior strong competitors, and our lightweight ASFD-D0 runs at more than 120 FPS with Mobilenet for VGA-resolution images.
arXiv Detail & Related papers (2020-03-25T06:00:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.