Related papers: The Impact of Missing Velocity Information in Dynamic Obstacle Avoidance based on Deep Reinforcement Learning

The Impact of Missing Velocity Information in Dynamic Obstacle Avoidance based on Deep Reinforcement Learning

URL: http://arxiv.org/abs/2112.12465v1
Date: Thu, 23 Dec 2021 11:07:00 GMT
Title: The Impact of Missing Velocity Information in Dynamic Obstacle Avoidance based on Deep Reinforcement Learning
Authors: Fabian Hart, Martin Waltz, Ostap Okhrin
Abstract summary: We introduce a novel approach to dynamic obstacle avoidance based on Deep Reinforcement Learning. We thoroughly investigate the effect of missing velocity information on an agent's performance in obstacle avoidance tasks.
Score: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: We introduce a novel approach to dynamic obstacle avoidance based on Deep Reinforcement Learning by defining a traffic type independent environment with variable complexity. Filling a gap in the current literature, we thoroughly investigate the effect of missing velocity information on an agent's performance in obstacle avoidance tasks. This is a crucial issue in practice since several sensors yield only positional information of objects or vehicles. We evaluate frequently-applied approaches in scenarios of partial observability, namely the incorporation of recurrency in the deep neural networks and simple frame-stacking. For our analysis, we rely on state-of-the-art model-free deep RL algorithms. The lack of velocity information is found to significantly impact the performance of an agent. Both approaches - recurrency and frame-stacking - cannot consistently replace missing velocity information in the observation space. However, in simplified scenarios, they can significantly boost performance and stabilize the overall training procedure.

Related papers

When Context Is Not Enough: Modeling Unexplained Variability in Car-Following Behavior [22.102157707436884]
Traditional deterministic models often fail to capture the full extent of variability and unpredictability in human driving.<n>This study introduces an interpretable modeling framework that captures not only context-dependent dynamics but also residual variability beyond what context can explain.<n>The integration of interpretability and accuracy makes this framework a promising tool for traffic analysis and safety-critical applications.
arXiv Detail & Related papers (2025-07-09T16:42:41Z)
Trajectory Entropy Reinforcement Learning for Predictable and Robust Control [12.289021814766539]
We introduce a novel inductive bias towards simple policies in reinforcement learning.<n>The simplicity inductive bias is introduced by minimizing the entropy of entire action trajectories.<n>We show that our learned policies produce more cyclical and consistent action trajectories.
arXiv Detail & Related papers (2025-05-07T07:41:29Z)
Decoupled Dynamics Framework with Neural Fields for 3D Spatio-temporal Prediction of Vehicle Collisions [1.474723404975345]
This study proposes a neural framework that predicts 3D vehicle collision dynamics by independently modeling global rigid-body motion and local structural deformation. Two specialized networks form the core of the framework: a quaternion-based Rigid Net for rigid motion and a coordinate-based Deformation Net for local deformation. The model, trained on only 10% of available simulation data, significantly outperforms baseline models, with prediction errors reduced by up to 83%.
arXiv Detail & Related papers (2025-03-25T14:38:37Z)
Vision-Based Deep Reinforcement Learning of UAV Autonomous Navigation Using Privileged Information [6.371251946803415]
DPRL is an end-to-end policy designed to address the challenge of high-speed autonomous UAV navigation under partially observable environmental conditions. We leverage an asymmetric Actor-Critic architecture to provide the agent with privileged information during training. We conduct extensive simulations across various scenarios, benchmarking our DPRL algorithm against the state-of-the-art navigation algorithms.
arXiv Detail & Related papers (2024-12-09T09:05:52Z)
Localized Gaussians as Self-Attention Weights for Point Clouds Correspondence [92.07601770031236]
We investigate semantically meaningful patterns in the attention heads of an encoder-only Transformer architecture. We find that fixing the attention weights not only accelerates the training process but also enhances the stability of the optimization.
arXiv Detail & Related papers (2024-09-20T07:41:47Z)
Analyzing Adversarial Inputs in Deep Reinforcement Learning [53.3760591018817]
We present a comprehensive analysis of the characterization of adversarial inputs, through the lens of formal verification. We introduce a novel metric, the Adversarial Rate, to classify models based on their susceptibility to such perturbations. Our analysis empirically demonstrates how adversarial inputs can affect the safety of a given DRL system with respect to such perturbations.
arXiv Detail & Related papers (2024-02-07T21:58:40Z)
Video Dynamics Prior: An Internal Learning Approach for Robust Video Enhancements [83.5820690348833]
We present a framework for low-level vision tasks that does not require any external training data corpus. Our approach learns neural modules by optimizing over a corrupted sequence, leveraging the weights of the coherence-temporal test and statistics internal statistics.
arXiv Detail & Related papers (2023-12-13T01:57:11Z)
Robustness Benchmark of Road User Trajectory Prediction Models for Automated Driving [0.0]
We benchmark machine learning models against perturbations that simulate functional insufficiencies observed during model deployment in a vehicle. Training the models with similar perturbations effectively reduces performance degradation, with error increases of up to +87.5%. We argue that despite being an effective mitigation strategy, data augmentation through perturbations during training does not guarantee robustness towards unforeseen perturbations.
arXiv Detail & Related papers (2023-04-04T15:47:42Z)
Enhancing Multiple Reliability Measures via Nuisance-extended Information Bottleneck [77.37409441129995]
In practical scenarios where training data is limited, many predictive signals in the data can be rather from some biases in data acquisition. We consider an adversarial threat model under a mutual information constraint to cover a wider class of perturbations in training. We propose an autoencoder-based training to implement the objective, as well as practical encoder designs to facilitate the proposed hybrid discriminative-generative training.
arXiv Detail & Related papers (2023-03-24T16:03:21Z)
RIOT: Recursive Inertial Odometry Transformer for Localisation from Low-Cost IMU Measurements [5.770538064283154]
We present two end-to-end frameworks for pose invariant deep inertial odometry that utilise self-attention to capture both spatial features and long-range dependencies in inertial data. We evaluate our approaches against a custom 2-layer Gated Recurrent Unit, trained in the same manner on the same data, and tested each approach on a number of different users, devices and activities.
arXiv Detail & Related papers (2023-03-03T00:20:01Z)
Robust Longitudinal Control for Vehicular Autonomous Platoons Using Deep Reinforcement Learning [3.0552168294716298]
We propose an approach to generalize the training process of a vehicular platoon, such that the acceleration command of each agent becomes independent of the network topology. We illustrate the effectiveness of our proposal with experiments using different network topologies, uncertain parameters, and external forces.
arXiv Detail & Related papers (2022-05-31T20:38:12Z)
Improving robustness of jet tagging algorithms with adversarial training [56.79800815519762]
We investigate the vulnerability of flavor tagging algorithms via application of adversarial attacks. We present an adversarial training strategy that mitigates the impact of such simulated attacks.
arXiv Detail & Related papers (2022-03-25T19:57:19Z)
Spatial-Temporal Conv-sequence Learning with Accident Encoding for Traffic Flow Prediction [17.94199362114272]
In intelligent transportation system, the key problem of traffic forecasting is how to extract the periodic temporal dependencies and complex spatial correlation. We propose the Spatial-Temporal Conv-sequence Learning (STCL), in which a focused temporal block uses unidirectional convolution to effectively capture short-term periodic temporal dependence. We conduct extensive experiments on large-scale real-world tasks and verify the effectiveness of our proposed method.
arXiv Detail & Related papers (2021-05-21T17:43:07Z)
Attribute-Guided Adversarial Training for Robustness to Natural Perturbations [64.35805267250682]
We propose an adversarial training approach which learns to generate new samples so as to maximize exposure of the classifier to the attributes-space. Our approach enables deep neural networks to be robust against a wide range of naturally occurring perturbations.
arXiv Detail & Related papers (2020-12-03T10:17:30Z)

This list is automatically generated from the titles and abstracts of the papers in this site.