StrObe: Streaming Object Detection from LiDAR Packets
- URL: http://arxiv.org/abs/2011.06425v2
- Date: Fri, 13 Nov 2020 17:59:04 GMT
- Title: StrObe: Streaming Object Detection from LiDAR Packets
- Authors: Davi Frossard, Simon Suo, Sergio Casas, James Tu, Rui Hu, Raquel
Urtasun
- Abstract summary: Rolling shutter LiDARs emitted as a stream of packets, each covering a sector of the 360deg coverage.
Modern perception algorithms wait for the full sweep to be built before processing the data, which introduces an additional latency.
In this paper we propose StrObe, a novel approach that minimizes latency by ingesting LiDAR packets and emitting a stream of detections without waiting for the full sweep to be built.
- Score: 73.27333924964306
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Many modern robotics systems employ LiDAR as their main sensing modality due
to its geometrical richness. Rolling shutter LiDARs are particularly common, in
which an array of lasers scans the scene from a rotating base. Points are
emitted as a stream of packets, each covering a sector of the 360{\deg}
coverage. Modern perception algorithms wait for the full sweep to be built
before processing the data, which introduces an additional latency. For typical
10Hz LiDARs this will be 100ms. As a consequence, by the time an output is
produced, it no longer accurately reflects the state of the world. This poses a
challenge, as robotics applications require minimal reaction times, such that
maneuvers can be quickly planned in the event of a safety-critical situation.
In this paper we propose StrObe, a novel approach that minimizes latency by
ingesting LiDAR packets and emitting a stream of detections without waiting for
the full sweep to be built. StrObe reuses computations from previous packets
and iteratively updates a latent spatial representation of the scene, which
acts as a memory, as new evidence comes in, resulting in accurate low-latency
perception. We demonstrate the effectiveness of our approach on a large scale
real-world dataset, showing that StrObe far outperforms the state-of-the-art
when latency is taken into account, and matches the performance in the
traditional setting.
Related papers
- Characterization of point-source transient events with a rolling-shutter compressed sensing system [0.0]
Point-source transient events (PSTEs) pose several challenges to an imaging system.
Traditional imaging systems that meet these requirements are costly in terms of price, size, weight, power consumption, and data bandwidth.
We develop a novel compressed sensing algorithm adapted to the rolling shutter readout of an imaging system.
arXiv Detail & Related papers (2024-08-29T19:22:37Z) - Semi-supervised Open-World Object Detection [74.95267079505145]
We introduce a more realistic formulation, named semi-supervised open-world detection (SS-OWOD)
We demonstrate that the performance of the state-of-the-art OWOD detector dramatically deteriorates in the proposed SS-OWOD setting.
Our experiments on 4 datasets including MS COCO, PASCAL, Objects365 and DOTA demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2024-02-25T07:12:51Z) - TimePillars: Temporally-Recurrent 3D LiDAR Object Detection [8.955064958311517]
TimePillars is a temporally-recurrent object detection pipeline.
It exploits the pillar representation of LiDAR data across time.
We show how basic building blocks are enough to achieve robust and efficient results.
arXiv Detail & Related papers (2023-12-22T10:25:27Z) - UnLoc: A Universal Localization Method for Autonomous Vehicles using
LiDAR, Radar and/or Camera Input [51.150605800173366]
UnLoc is a novel unified neural modeling approach for localization with multi-sensor input in all weather conditions.
Our method is extensively evaluated on Oxford Radar RobotCar, ApolloSouthBay and Perth-WA datasets.
arXiv Detail & Related papers (2023-07-03T04:10:55Z) - Multi-modal Streaming 3D Object Detection [20.01800869678355]
We propose an innovative camera-LiDAR streaming 3D object detection framework.
It uses camera images instead of past LiDAR slices to provide an up-to-date, dense, and wide context for streaming perception.
Our method is shown to be robust to missing camera images, narrow LiDAR slices, and small camera-LiDAR miscalibration.
arXiv Detail & Related papers (2022-09-12T00:30:52Z) - Real-Time GPU-Accelerated Machine Learning Based Multiuser Detection for
5G and Beyond [70.81551587109833]
nonlinear beamforming filters can significantly outperform linear approaches in stationary scenarios with massive connectivity.
One of the main challenges comes from the real-time implementation of these algorithms.
This paper explores the acceleration of APSM-based algorithms through massive parallelization.
arXiv Detail & Related papers (2022-01-13T15:20:45Z) - Rethinking Drone-Based Search and Rescue with Aerial Person Detection [79.76669658740902]
The visual inspection of aerial drone footage is an integral part of land search and rescue (SAR) operations today.
We propose a novel deep learning algorithm to automate this aerial person detection (APD) task.
We present the novel Aerial Inspection RetinaNet (AIR) algorithm as the combination of these contributions.
arXiv Detail & Related papers (2021-11-17T21:48:31Z) - FOVEA: Foveated Image Magnification for Autonomous Navigation [53.69803081925454]
We propose an attentional approach that elastically magnifies certain regions while maintaining a small input canvas.
Our proposed method boosts the detection AP over standard Faster R-CNN, with and without finetuning.
On the autonomous driving datasets Argoverse-HD and BDD100K, we show our proposed method boosts the detection AP over standard Faster R-CNN, with and without finetuning.
arXiv Detail & Related papers (2021-08-27T03:07:55Z) - Streaming Object Detection for 3-D Point Clouds [29.465873948076766]
LiDAR provides a prominent sensory modality that informs many existing perceptual systems.
The latency for perceptual systems based on point cloud data can be dominated by the amount of time for a complete rotational scan.
We show how operating on LiDAR data in its native streaming formulation offers several advantages for self driving object detection.
arXiv Detail & Related papers (2020-05-04T21:55:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.