Exploring Social Posterior Collapse in Variational Autoencoder for
Interaction Modeling
- URL: http://arxiv.org/abs/2112.00298v1
- Date: Wed, 1 Dec 2021 06:20:58 GMT
- Title: Exploring Social Posterior Collapse in Variational Autoencoder for
Interaction Modeling
- Authors: Chen Tang, Wei Zhan, Masayoshi Tomizuka
- Abstract summary: Variational Autoencoder (VAE) has been widely applied in multi-agent interaction modeling.
VAE is prone to ignoring historical social context when predicting the future trajectory of an agent.
We propose a novel sparse graph attention message-passing layer, which helps us detect social posterior collapse.
- Score: 26.01824780050843
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Multi-agent behavior modeling and trajectory forecasting are crucial for the
safe navigation of autonomous agents in interactive scenarios. Variational
Autoencoder (VAE) has been widely applied in multi-agent interaction modeling
to generate diverse behavior and learn a low-dimensional representation for
interacting systems. However, existing literature did not formally discuss if a
VAE-based model can properly encode interaction into its latent space. In this
work, we argue that one of the typical formulations of VAEs in multi-agent
modeling suffers from an issue we refer to as social posterior collapse, i.e.,
the model is prone to ignoring historical social context when predicting the
future trajectory of an agent. It could cause significant prediction errors and
poor generalization performance. We analyze the reason behind this
under-explored phenomenon and propose several measures to tackle it. Afterward,
we implement the proposed framework and experiment on real-world datasets for
multi-agent trajectory prediction. In particular, we propose a novel sparse
graph attention message-passing (sparse-GAMP) layer, which helps us detect
social posterior collapse in our experiments. In the experiments, we verify
that social posterior collapse indeed occurs. Also, the proposed measures are
effective in alleviating the issue. As a result, the model attains better
generalization performance when historical social context is informative for
prediction.
Related papers
- PMM-Net: Single-stage Multi-agent Trajectory Prediction with Patching-based Embedding and Explicit Modal Modulation [6.793915571620126]
In this letter, we aim to explore a distinct formulation for multi-agent trajectory prediction framework.
We propose a patching-based temporal feature extraction module and a graph-based social feature extraction module.
We present a novel method based on explicit modality modulation to integrate temporal and social features, thereby constructing an efficient single-stage inference pipeline.
arXiv Detail & Related papers (2024-10-25T13:16:27Z) - Spatial and social situation-aware transformer-based trajectory prediction of autonomous systems [2.498836880652668]
Anticipating the behavior of an agent in a given situation is required to adequately react to it in time.
Deep learning-based models has become the dominant approach to motion prediction recently.
For longer prediction horizons, the deviation of the predicted trajectory from the ground truth is lower compared to a spatially and socially agnostic model.
arXiv Detail & Related papers (2024-06-04T20:36:16Z) - JRDB-Traj: A Dataset and Benchmark for Trajectory Forecasting in Crowds [79.00975648564483]
Trajectory forecasting models, employed in fields such as robotics, autonomous vehicles, and navigation, face challenges in real-world scenarios.
This dataset provides comprehensive data, including the locations of all agents, scene images, and point clouds, all from the robot's perspective.
The objective is to predict the future positions of agents relative to the robot using raw sensory input data.
arXiv Detail & Related papers (2023-11-05T18:59:31Z) - Conditioned Human Trajectory Prediction using Iterative Attention Blocks [70.36888514074022]
We present a simple yet effective pedestrian trajectory prediction model aimed at pedestrians positions prediction in urban-like environments.
Our model is a neural-based architecture that can run several layers of attention blocks and transformers in an iterative sequential fashion.
We show that without explicit introduction of social masks, dynamical models, social pooling layers, or complicated graph-like structures, it is possible to produce on par results with SoTA models.
arXiv Detail & Related papers (2022-06-29T07:49:48Z) - LatentFormer: Multi-Agent Transformer-Based Interaction Modeling and
Trajectory Prediction [12.84508682310717]
We propose LatentFormer, a transformer-based model for predicting future vehicle trajectories.
We evaluate the proposed method on the nuScenes benchmark dataset and show that our approach achieves state-of-the-art performance and improves upon trajectory metrics by up to 40%.
arXiv Detail & Related papers (2022-03-03T17:44:58Z) - You Mostly Walk Alone: Analyzing Feature Attribution in Trajectory
Prediction [52.442129609979794]
Recent deep learning approaches for trajectory prediction show promising performance.
It remains unclear which features such black-box models actually learn to use for making predictions.
This paper proposes a procedure that quantifies the contributions of different cues to model performance.
arXiv Detail & Related papers (2021-10-11T14:24:15Z) - Test-time Collective Prediction [73.74982509510961]
Multiple parties in machine learning want to jointly make predictions on future test points.
Agents wish to benefit from the collective expertise of the full set of agents, but may not be willing to release their data or model parameters.
We explore a decentralized mechanism to make collective predictions at test time, leveraging each agent's pre-trained model.
arXiv Detail & Related papers (2021-06-22T18:29:58Z) - SMART: Simultaneous Multi-Agent Recurrent Trajectory Prediction [72.37440317774556]
We propose advances that address two key challenges in future trajectory prediction.
multimodality in both training data and predictions and constant time inference regardless of number of agents.
arXiv Detail & Related papers (2020-07-26T08:17:10Z) - Causal Discovery in Physical Systems from Videos [123.79211190669821]
Causal discovery is at the core of human cognition.
We consider the task of causal discovery from videos in an end-to-end fashion without supervision on the ground-truth graph structure.
arXiv Detail & Related papers (2020-07-01T17:29:57Z) - Learning Opinion Dynamics From Social Traces [25.161493874783584]
We propose an inference mechanism for fitting a generative, agent-like model of opinion dynamics to real-world social traces.
We showcase our proposal by translating a classical agent-based model of opinion dynamics into its generative counterpart.
We apply our model to real-world data from Reddit to explore the long-standing question about the impact of backfire effect.
arXiv Detail & Related papers (2020-06-02T14:48:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.