MSN: Multi-Style Network for Trajectory Prediction
- URL: http://arxiv.org/abs/2107.00932v5
- Date: Mon, 8 May 2023 07:30:35 GMT
- Title: MSN: Multi-Style Network for Trajectory Prediction
- Authors: Conghao Wong, Beihao Xia, Qinmu Peng, Wei Yuan and Xinge You
- Abstract summary: Trajectory prediction aims to forecast agents' possible future locations considering their observations along with the video context.
This paper proposes the Multi-Style Network (MSN), which utilizes style proposal and stylized prediction using two sub-networks.
Experiments show that the proposed MSN outperforms current state-of-the-art methods up to 10% quantitatively on two widely used datasets.
- Score: 14.861532983777133
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Trajectory prediction aims to forecast agents' possible future locations
considering their observations along with the video context. It is strongly
needed by many autonomous platforms like tracking, detection, robot navigation,
and self-driving cars. Whether it is agents' internal personality factors,
interactive behaviors with the neighborhood, or the influence of surroundings,
they all impact agents' future planning. However, many previous methods model
and predict agents' behaviors with the same strategy or feature distribution,
making them challenging to make predictions with sufficient style differences.
This paper proposes the Multi-Style Network (MSN), which utilizes style
proposal and stylized prediction using two sub-networks, to provide multi-style
predictions in a novel categorical way adaptively. The proposed network
contains a series of style channels, and each channel is bound to a unique and
specific behavior style. We use agents' end-point plannings and their
interaction context as the basis for the behavior classification, so as to
adaptively learn multiple diverse behavior styles through these channels. Then,
we assume that the target agents may plan their future behaviors according to
each of these categorized styles, thus utilizing different style channels to
make predictions with significant style differences in parallel. Experiments
show that the proposed MSN outperforms current state-of-the-art methods up to
10% quantitatively on two widely used datasets, and presents better multi-style
characteristics qualitatively.
Related papers
- Learning signatures of decision making from many individuals playing the
same game [54.33783158658077]
We design a predictive framework that learns representations to encode an individual's 'behavioral style'
We apply our method to a large-scale behavioral dataset from 1,000 humans playing a 3-armed bandit task.
arXiv Detail & Related papers (2023-02-21T21:41:53Z) - Deep Interactive Motion Prediction and Planning: Playing Games with
Motion Prediction Models [162.21629604674388]
This work presents a game-theoretic Model Predictive Controller (MPC) that uses a novel interactive multi-agent neural network policy as part of its predictive model.
Fundamental to the success of our method is the design of a novel multi-agent policy network that can steer a vehicle given the state of the surrounding agents and the map information.
arXiv Detail & Related papers (2022-04-05T17:58:18Z) - View Vertically: A Hierarchical Network for Trajectory Prediction via
Fourier Spectrums [8.065451321690011]
Learning to understand and predict future motions or behaviors for agents like humans and robots are critical to various autonomous platforms.
We propose the Transformer-based V model, which predicts agents' trajectories with spectrums in the keypoints and interactions levels respectively.
Experimental results show that V outperforms most of current state-of-the-art methods on ETH-UCY and SDD trajectories dataset.
arXiv Detail & Related papers (2021-10-14T11:48:31Z) - You Mostly Walk Alone: Analyzing Feature Attribution in Trajectory
Prediction [52.442129609979794]
Recent deep learning approaches for trajectory prediction show promising performance.
It remains unclear which features such black-box models actually learn to use for making predictions.
This paper proposes a procedure that quantifies the contributions of different cues to model performance.
arXiv Detail & Related papers (2021-10-11T14:24:15Z) - Online Multi-Agent Forecasting with Interpretable Collaborative Graph
Neural Network [65.11999700562869]
We propose a novel collaborative prediction unit (CoPU), which aggregates predictions from multiple collaborative predictors according to a collaborative graph.
Our methods outperform state-of-the-art works on the three tasks by 28.6%, 17.4% and 21.0% on average.
arXiv Detail & Related papers (2021-07-02T08:20:06Z) - Scene Transformer: A unified multi-task model for behavior prediction
and planning [42.758178896204036]
We formulate a model for predicting the behavior of all agents jointly in real-world driving environments.
Inspired by recent language modeling approaches, we use a masking strategy as the query to our model.
We evaluate our approach on autonomous driving datasets for behavior prediction, and achieve state-of-the-art performance.
arXiv Detail & Related papers (2021-06-15T20:20:44Z) - Divide-and-Conquer for Lane-Aware Diverse Trajectory Prediction [71.97877759413272]
Trajectory prediction is a safety-critical tool for autonomous vehicles to plan and execute actions.
Recent methods have achieved strong performances using Multi-Choice Learning objectives like winner-takes-all (WTA) or best-of-many.
Our work addresses two key challenges in trajectory prediction, learning outputs, and better predictions by imposing constraints using driving knowledge.
arXiv Detail & Related papers (2021-04-16T17:58:56Z) - LaPred: Lane-Aware Prediction of Multi-Modal Future Trajectories of
Dynamic Agents [10.869902339190949]
We propose a novel prediction model, referred to as the lane-aware prediction (LaPred) network.
LaPred uses the instance-level lane entities extracted from a semantic map to predict the multi-modal future trajectories.
The experiments conducted on the public nuScenes and Argoverse dataset demonstrate that the proposed LaPred method significantly outperforms the existing prediction models.
arXiv Detail & Related papers (2021-04-01T04:33:36Z) - Pedestrian Behavior Prediction via Multitask Learning and Categorical
Interaction Modeling [13.936894582450734]
We propose a multitask learning framework that simultaneously predicts trajectories and actions of pedestrians by relying on multimodal data.
We show that our model achieves state-of-the-art performance and improves trajectory and action prediction by up to 22% and 6% respectively.
arXiv Detail & Related papers (2020-12-06T15:57:11Z) - Multi-Modal Hybrid Architecture for Pedestrian Action Prediction [14.032334569498968]
We propose a novel multi-modal prediction algorithm that incorporates different sources of information captured from the environment to predict future crossing actions of pedestrians.
Using the existing 2D pedestrian behavior benchmarks and a newly annotated 3D driving dataset, we show that our proposed model achieves state-of-the-art performance in pedestrian crossing prediction.
arXiv Detail & Related papers (2020-11-16T15:17:58Z) - SMART: Simultaneous Multi-Agent Recurrent Trajectory Prediction [72.37440317774556]
We propose advances that address two key challenges in future trajectory prediction.
multimodality in both training data and predictions and constant time inference regardless of number of agents.
arXiv Detail & Related papers (2020-07-26T08:17:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.