Imitative Planning using Conditional Normalizing Flow
- URL: http://arxiv.org/abs/2007.16162v3
- Date: Thu, 13 Oct 2022 16:07:40 GMT
- Title: Imitative Planning using Conditional Normalizing Flow
- Authors: Shubhankar Agarwal, Harshit Sikchi, Cole Gulino, Eric Wilkinson and
Shivam Gautam
- Abstract summary: A popular way to plan trajectories in dynamic urban scenarios for Autonomous Vehicles is to rely on explicitly specified and hand crafted cost functions.
We explore the application of normalizing flows for improving the performance of trajectory planning for autonomous vehicles (AVs)
By modeling the trajectory planner's cost manifold as an energy function, we learn a scene conditioned mapping from the prior to a Boltzmann distribution over the AV control space.
- Score: 2.8978926857710263
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: A popular way to plan trajectories in dynamic urban scenarios for Autonomous
Vehicles is to rely on explicitly specified and hand crafted cost functions,
coupled with random sampling in the trajectory space to find the minimum cost
trajectory. Such methods require a high number of samples to find a low-cost
trajectory and might end up with a highly suboptimal trajectory given the
planning time budget. We explore the application of normalizing flows for
improving the performance of trajectory planning for autonomous vehicles (AVs).
Our key insight is to learn a sampling policy in a low-dimensional latent space
of expert-like trajectories, out of which the best sample is selected for
execution. By modeling the trajectory planner's cost manifold as an energy
function, we learn a scene conditioned mapping from the prior to a Boltzmann
distribution over the AV control space. Finally, we demonstrate the
effectiveness of our approach on real-world datasets over IL and
hand-constructed trajectory sampling techniques.
Related papers
- Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive Approach [51.76826149868971]
Policy evaluation via Monte Carlo simulation is at the core of many MC Reinforcement Learning (RL) algorithms.
We propose as a quality index a surrogate of the mean squared error of a return estimator that uses trajectories of different lengths.
We present an adaptive algorithm called Robust and Iterative Data collection strategy Optimization (RIDO)
arXiv Detail & Related papers (2024-10-17T11:47:56Z) - TrajDiffuse: A Conditional Diffusion Model for Environment-Aware Trajectory Prediction [16.188078087197106]
We propose TrajDiffuse, a planning-based trajectory prediction method using a novel guided conditional diffusion model.
We form the trajectory prediction problem as a denoising impaint task and design a map-based guidance term for the diffusion process.
TrajDiffuse is able to generate trajectory predictions that match or exceed the accuracy and diversity of the SOTA, while adhering almost perfectly to environmental constraints.
arXiv Detail & Related papers (2024-10-14T17:59:03Z) - Rocket Landing Control with Grid Fins and Path-following using MPC [0.0]
The goal is to minimize the total fuel consumption during the landing process using different techniques.
Once the optimal and feasible trajectory is generated using batch approach, we attempt to follow the path using a Model Predictive Control (MPC) based algorithm.
We show that TOPED can follow a demonstration trajectory well in practice under model mismatch and different initial states.
arXiv Detail & Related papers (2024-05-25T11:42:29Z) - D-Cubed: Latent Diffusion Trajectory Optimisation for Dexterous Deformable Manipulation [15.680133621889809]
D-Cubed is a novel trajectory optimisation method using a latent diffusion model (LDM) trained from a task-agnostic play dataset.
We demonstrate that D-Cubed outperforms traditional trajectory optimisation and competitive baseline approaches by a significant margin.
arXiv Detail & Related papers (2024-03-19T16:05:51Z) - Pre-training on Synthetic Driving Data for Trajectory Prediction [61.520225216107306]
We propose a pipeline-level solution to mitigate the issue of data scarcity in trajectory forecasting.
We adopt HD map augmentation and trajectory synthesis for generating driving data, and then we learn representations by pre-training on them.
We conduct extensive experiments to demonstrate the effectiveness of our data expansion and pre-training strategies.
arXiv Detail & Related papers (2023-09-18T19:49:22Z) - DDPEN: Trajectory Optimisation With Sub Goal Generation Model [70.36888514074022]
In this paper, we produce a novel Differential Dynamic Programming with Escape Network (DDPEN)
We propose to utilize a deep model that takes as an input map of the environment in the form of a costmap together with the desired position.
The model produces possible future directions that will lead to the goal, avoiding local minima which is possible to run in real time conditions.
arXiv Detail & Related papers (2023-01-18T11:02:06Z) - Learning Space Partitions for Path Planning [54.475949279050596]
PlaLaM outperforms existing path planning methods in 2D navigation tasks, especially in the presence of difficult-to-escape local optima.
These gains transfer to highly multimodal real-world tasks, where we outperform strong baselines in compiler phase ordering by up to 245% and in molecular design by up to 0.4 on properties on a 0-1 scale.
arXiv Detail & Related papers (2021-06-19T18:06:11Z) - Autonomous Drone Racing with Deep Reinforcement Learning [39.757652701917166]
In many robotic tasks, such as drone racing, the goal is to travel through a set of waypoints as fast as possible.
A key challenge is planning the minimum-time trajectory, which is typically solved by assuming perfect knowledge of the waypoints to pass in advance.
In this work, a new approach to minimum-time trajectory generation for quadrotors is presented.
arXiv Detail & Related papers (2021-03-15T18:05:49Z) - End-to-end Interpretable Neural Motion Planner [78.69295676456085]
We propose a neural motion planner (NMP) for learning to drive autonomously in complex urban scenarios.
We design a holistic model that takes as input raw LIDAR data and a HD map and produces interpretable intermediate representations.
We demonstrate the effectiveness of our approach in real-world driving data captured in several cities in North America.
arXiv Detail & Related papers (2021-01-17T14:16:12Z) - Diverse Sampling for Normalizing Flow Based Trajectory Forecasting [34.01303881881315]
We propose Diversity Sampling for Flow (DSF) to improve the quality and diversity of trajectory samples from a pre-trained flow model.
DSF is easy to implement, and we show that it offers a simple plug-in improvement for several existing flow-based forecasting models.
arXiv Detail & Related papers (2020-11-30T18:23:29Z) - Path Planning Followed by Kinodynamic Smoothing for Multirotor Aerial
Vehicles (MAVs) [61.94975011711275]
We propose a geometrically based motion planning technique textquotedblleft RRT*textquotedblright; for this purpose.
In the proposed technique, we modified original RRT* introducing an adaptive search space and a steering function.
We have tested the proposed technique in various simulated environments.
arXiv Detail & Related papers (2020-08-29T09:55:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.