R-TOD: Real-Time Object Detector with Minimized End-to-End Delay for
  Autonomous Driving
        - URL: http://arxiv.org/abs/2011.06372v1
- Date: Fri, 23 Oct 2020 01:03:46 GMT
- Title: R-TOD: Real-Time Object Detector with Minimized End-to-End Delay for
  Autonomous Driving
- Authors: Wonseok Jang, Hansaem Jeong, Kyungtae Kang, Nikil Dutt, Jong-Chan Kim
- Abstract summary: This paper aims to provide more comprehensive understanding of the end-to-end delay.
Three optimization methods are implemented: (i) on-demand capture, (ii) zero-slack pipeline, and (iii) contention-free pipeline.
Our experimental results show a 76% reduction in the end-to-end delay of Darknet YOLO v3.
- Score: 3.366875318492424
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   For realizing safe autonomous driving, the end-to-end delays of real-time
object detection systems should be thoroughly analyzed and minimized. However,
despite recent development of neural networks with minimized inference delays,
surprisingly little attention has been paid to their end-to-end delays from an
object's appearance until its detection is reported. With this motivation, this
paper aims to provide more comprehensive understanding of the end-to-end delay,
through which precise best- and worst-case delay predictions are formulated,
and three optimization methods are implemented: (i) on-demand capture, (ii)
zero-slack pipeline, and (iii) contention-free pipeline. Our experimental
results show a 76% reduction in the end-to-end delay of Darknet YOLO (You Only
Look Once) v3 (from 1070 ms to 261 ms), thereby demonstrating the great
potential of exploiting the end-to-end delay analysis for autonomous driving.
Furthermore, as we only modify the system architecture and do not change the
neural network architecture itself, our approach incurs no penalty on the
detection accuracy.
 
      
        Related papers
        - On the Fragility of Multimodal Perception to Temporal Misalignment in   Autonomous Driving [26.809693071623272]
 We introduce DejaVu, a novel attack that exploits network-induced delays to create subtle temporal misalignments across sensor streams.<n>With a single-frame LiDAR delay, an attacker can reduce the car detection mAP by up to 88.5%, while with a three-frame camera delay, multiple object tracking accuracy (MOTA) for car drops by 73%.<n>We propose AION, a patch that can work alongside the existing perception model to monitor temporal alignment through cross-modal temporal consistency.
 arXiv  Detail & Related papers  (2025-07-12T00:44:26Z)
- CorrDiff: Adaptive Delay-aware Detector with Temporal Cue Inputs for   Real-time Object Detection [11.714072240331518]
 CorrDiff is designed to tackle the challenge of delays in real-time detection systems.
It is able to utilize runtime-estimated temporal cues to predict objects' locations for multiple future frames.
It meets the stringent real-time processing requirements on all kinds of devices.
 arXiv  Detail & Related papers  (2025-01-09T10:34:25Z)
- VALO: A Versatile Anytime Framework for LiDAR-based Object Detection   Deep Neural Networks [4.953750672237398]
 This work addresses the challenge of adapting dynamic deadline requirements for LiDAR object detection deep neural networks (DNNs)
We introduce VALO (Versatile Anytime algorithm for LiDAR Object detection), a novel data-centric approach that enables anytime computing of 3D LiDAR object detection DNNs.
We implement VALO on state-of-the-art 3D LiDAR object detection networks, namely CenterPoint and VoxelNext, and demonstrate its dynamic adaptability to a wide range of time constraints.
 arXiv  Detail & Related papers  (2024-09-17T20:30:35Z)
- PNAS-MOT: Multi-Modal Object Tracking with Pareto Neural Architecture   Search [64.28335667655129]
 Multiple object tracking is a critical task in autonomous driving.
As tracking accuracy improves, neural networks become increasingly complex, posing challenges for their practical application in real driving scenarios due to the high level of latency.
In this paper, we explore the use of the neural architecture search (NAS) methods to search for efficient architectures for tracking, aiming for low real-time latency while maintaining relatively high accuracy.
 arXiv  Detail & Related papers  (2024-03-23T04:18:49Z)
- MTD: Multi-Timestep Detector for Delayed Streaming Perception [0.5439020425819]
 Streaming perception is a task of reporting the current state of the world, which is used to evaluate the delay and accuracy of autonomous driving systems.
This paper propose the Multi- Timestep Detector (MTD), an end-to-end detector which uses dynamic routing for multi-branch future prediction.
The proposed method has been evaluated on the Argoverse-HD dataset, and the experimental results show that it has achieved state-of-the-art performance across various delay settings.
 arXiv  Detail & Related papers  (2023-09-13T06:23:58Z)
- MonoPIC -- A Monocular Low-Latency Pedestrian Intention Classification
  Framework for IoT Edges Using ID3 Modelled Decision Trees [0.0]
 We propose an algorithm that classifies the intent of a single arbitrarily chosen pedestrian in a two dimensional frame into logic states.
This bypasses the need to employ any relatively high latency deep-learning algorithms.
The model was able to achieve an average testing accuracy of 83.56% with a reliable variance of 0.0042 while operating with an average latency of 48 milliseconds.
 arXiv  Detail & Related papers  (2023-04-01T02:42:24Z)
- Achieving Real-Time Object Detection on MobileDevices with Neural
  Pruning Search [45.20331644857981]
 We propose a compiler-aware neural pruning search framework to achieve high-speed inference on autonomous vehicles for 2D and 3D object detection.
For the first time, the proposed method achieves computation (close-to) real-time, 55ms and 99ms inference times for YOLOv4 based 2D object detection and PointPillars based 3D detection.
 arXiv  Detail & Related papers  (2021-06-28T18:59:20Z)
- Efficient and Robust LiDAR-Based End-to-End Navigation [132.52661670308606]
 We present an efficient and robust LiDAR-based end-to-end navigation framework.
We propose Fast-LiDARNet that is based on sparse convolution kernel optimization and hardware-aware model design.
We then propose Hybrid Evidential Fusion that directly estimates the uncertainty of the prediction from only a single forward pass.
 arXiv  Detail & Related papers  (2021-05-20T17:52:37Z)
- Achieving Real-Time LiDAR 3D Object Detection on a Mobile Device [53.323878851563414]
 We propose a compiler-aware unified framework incorporating network enhancement and pruning search with the reinforcement learning techniques.
Specifically, a generator Recurrent Neural Network (RNN) is employed to provide the unified scheme for both network enhancement and pruning search automatically.
The proposed framework achieves real-time 3D object detection on mobile devices with competitive detection performance.
 arXiv  Detail & Related papers  (2020-12-26T19:41:15Z)
- FastEmit: Low-latency Streaming ASR with Sequence-level Emission
  Regularization [78.46088089185156]
 Streaming automatic speech recognition (ASR) aims to emit each hypothesized word as quickly and accurately as possible.
Existing approaches penalize emission delay by manipulating per-token or per-frame probability prediction in sequence transducer models.
We propose a sequence-level emission regularization method, named FastEmit, that applies latency regularization directly on per-sequence probability in training transducer models.
 arXiv  Detail & Related papers  (2020-10-21T17:05:01Z)
- PnPNet: End-to-End Perception and Prediction with Tracking in the Loop [82.97006521937101]
 We tackle the problem of joint perception and motion forecasting in the context of self-driving vehicles.
We propose Net, an end-to-end model that takes as input sensor data, and outputs at each time step object tracks and their future level.
 arXiv  Detail & Related papers  (2020-05-29T17:57:25Z)
- Streaming Object Detection for 3-D Point Clouds [29.465873948076766]
 LiDAR provides a prominent sensory modality that informs many existing perceptual systems.
The latency for perceptual systems based on point cloud data can be dominated by the amount of time for a complete rotational scan.
We show how operating on LiDAR data in its native streaming formulation offers several advantages for self driving object detection.
 arXiv  Detail & Related papers  (2020-05-04T21:55:15Z)
- Adaptive Anomaly Detection for IoT Data in Hierarchical Edge Computing [71.86955275376604]
 We propose an adaptive anomaly detection approach for hierarchical edge computing (HEC) systems to solve this problem.
We design an adaptive scheme to select one of the models based on the contextual information extracted from input data, to perform anomaly detection.
We evaluate our proposed approach using a real IoT dataset, and demonstrate that it reduces detection delay by 84% while maintaining almost the same accuracy as compared to offloading detection tasks to the cloud.
 arXiv  Detail & Related papers  (2020-01-10T05:29:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.