LOPR: Latent Occupancy PRediction using Generative Models
- URL: http://arxiv.org/abs/2210.01249v3
- Date: Thu, 24 Aug 2023 17:30:57 GMT
- Title: LOPR: Latent Occupancy PRediction using Generative Models
- Authors: Bernard Lange, Masha Itkina, Mykel J. Kochenderfer
- Abstract summary: LiDAR generated occupancy grid maps (L-OGMs) offer a robust bird's eye-view scene representation.
We propose a framework that decouples occupancy prediction into: representation learning and prediction within the learned latent space.
- Score: 49.15687400958916
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Environment prediction frameworks are integral for autonomous vehicles,
enabling safe navigation in dynamic environments. LiDAR generated occupancy
grid maps (L-OGMs) offer a robust bird's eye-view scene representation that
facilitates joint scene predictions without relying on manual labeling unlike
commonly used trajectory prediction frameworks. Prior approaches have optimized
deterministic L-OGM prediction architectures directly in grid cell space. While
these methods have achieved some degree of success in prediction, they
occasionally grapple with unrealistic and incorrect predictions. We claim that
the quality and realism of the forecasted occupancy grids can be enhanced with
the use of generative models. We propose a framework that decouples occupancy
prediction into: representation learning and stochastic prediction within the
learned latent space. Our approach allows for conditioning the model on other
available sensor modalities such as RGB-cameras and high definition maps. We
demonstrate that our approach achieves state-of-the-art performance and is
readily transferable between different robotic platforms on the real-world
NuScenes, Waymo Open, and a custom dataset we collected on an experimental
vehicle platform.
Related papers
- OPUS: Occupancy Prediction Using a Sparse Set [64.60854562502523]
We present a framework to simultaneously predict occupied locations and classes using a set of learnable queries.
OPUS incorporates a suite of non-trivial strategies to enhance model performance.
Our lightest model achieves superior RayIoU on the Occ3D-nuScenes dataset at near 2x FPS, while our heaviest model surpasses previous best results by 6.1 RayIoU.
arXiv Detail & Related papers (2024-09-14T07:44:22Z) - Self-supervised Multi-future Occupancy Forecasting for Autonomous Driving [45.886941596233974]
LiDAR-generated occupancy grid maps (L-OGMs) offer a robust bird's-eye view for the scene representation.
Our proposed framework performs L-OGM prediction in the latent space of a generative architecture.
We decode predictions using either a single-step decoder, which provides high-quality predictions in real-time, or a diffusion-based batch decoder.
arXiv Detail & Related papers (2024-07-30T18:37:59Z) - Vehicle Motion Forecasting using Prior Information and Semantic-assisted
Occupancy Grid Maps [6.99274104609965]
Motion is a challenging task for autonomous vehicles due to uncertainty in the sensor data, the non-deterministic nature of future, and complex behavior.
In this paper, we tackle this problem by representing the scene as dynamic occupancy grid maps (DOGMs)
We propose a novel framework that combines deep-temporal and probabilistic approaches to predict vehicle behaviors.
arXiv Detail & Related papers (2023-08-08T14:49:44Z) - Conditioned Human Trajectory Prediction using Iterative Attention Blocks [70.36888514074022]
We present a simple yet effective pedestrian trajectory prediction model aimed at pedestrians positions prediction in urban-like environments.
Our model is a neural-based architecture that can run several layers of attention blocks and transformers in an iterative sequential fashion.
We show that without explicit introduction of social masks, dynamical models, social pooling layers, or complicated graph-like structures, it is possible to produce on par results with SoTA models.
arXiv Detail & Related papers (2022-06-29T07:49:48Z) - Predicting Future Occupancy Grids in Dynamic Environment with
Spatio-Temporal Learning [63.25627328308978]
We propose a-temporal prediction network pipeline to generate future occupancy predictions.
Compared to current SOTA, our approach predicts occupancy for a longer horizon of 3 seconds.
We publicly release our grid occupancy dataset based on nulis to support further research.
arXiv Detail & Related papers (2022-05-06T13:45:32Z) - SLPC: a VRNN-based approach for stochastic lidar prediction and
completion in autonomous driving [63.87272273293804]
We propose a new LiDAR prediction framework that is based on generative models namely Variational Recurrent Neural Networks (VRNNs)
Our algorithm is able to address the limitations of previous video prediction frameworks when dealing with sparse data by spatially inpainting the depth maps in the upcoming frames.
We present a sparse version of VRNNs and an effective self-supervised training method that does not require any labels.
arXiv Detail & Related papers (2021-02-19T11:56:44Z) - Spatio-Temporal Graph Dual-Attention Network for Multi-Agent Prediction
and Tracking [23.608125748229174]
We propose a generic generative neural system for multi-agent trajectory prediction involving heterogeneous agents.
The proposed system is evaluated on three public benchmark datasets for trajectory prediction.
arXiv Detail & Related papers (2021-02-18T02:25:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.