MTR-A: 1st Place Solution for 2022 Waymo Open Dataset Challenge --
Motion Prediction
- URL: http://arxiv.org/abs/2209.10033v1
- Date: Tue, 20 Sep 2022 23:03:22 GMT
- Title: MTR-A: 1st Place Solution for 2022 Waymo Open Dataset Challenge --
Motion Prediction
- Authors: Shaoshuai Shi, Li Jiang, Dengxin Dai, Bernt Schiele
- Abstract summary: We propose a novel Motion Transformer framework for multimodal motion prediction, which introduces a small set of novel motion query pairs.
A simple model ensemble strategy with non-maximum-suppression is adopted to further boost the final performance.
Our approach achieves the 1st place on the motion prediction leaderboard of 2022 Open dataset Challenges, outperforming other methods with remarkable margins.
- Score: 103.75625476231401
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: In this report, we present the 1st place solution for motion prediction track
in 2022 Waymo Open Dataset Challenges. We propose a novel Motion Transformer
framework for multimodal motion prediction, which introduces a small set of
novel motion query pairs for generating better multimodal future trajectories
by jointly performing the intention localization and iterative motion
refinement. A simple model ensemble strategy with non-maximum-suppression is
adopted to further boost the final performance. Our approach achieves the 1st
place on the motion prediction leaderboard of 2022 Waymo Open Dataset
Challenges, outperforming other methods with remarkable margins. Code will be
available at https://github.com/sshaoshuai/MTR.
Related papers
- SMART: Scalable Multi-agent Real-time Motion Generation via Next-token Prediction [4.318757942343036]
We introduce a novel autonomous driving motion generation paradigm that models vectorized map and agent trajectory data into discrete sequence tokens.
These tokens are then processed through a decoder-only transformer architecture to train for the next token prediction task.
We have collected over 1 billion motion tokens from multiple datasets, validating the model's scalability.
arXiv Detail & Related papers (2024-05-24T16:17:35Z) - Motion Flow Matching for Human Motion Synthesis and Editing [75.13665467944314]
We propose emphMotion Flow Matching, a novel generative model for human motion generation featuring efficient sampling and effectiveness in motion editing applications.
Our method reduces the sampling complexity from thousand steps in previous diffusion models to just ten steps, while achieving comparable performance in text-to-motion and action-to-motion generation benchmarks.
arXiv Detail & Related papers (2023-12-14T12:57:35Z) - HumanMAC: Masked Motion Completion for Human Motion Prediction [62.279925754717674]
Human motion prediction is a classical problem in computer vision and computer graphics.
Previous effects achieve great empirical performance based on an encoding-decoding style.
In this paper, we propose a novel framework from a new perspective.
arXiv Detail & Related papers (2023-02-07T18:34:59Z) - Motion Transformer with Global Intention Localization and Local Movement
Refinement [103.75625476231401]
Motion TRansformer (MTR) models motion prediction as the joint optimization of global intention localization and local movement refinement.
MTR achieves state-of-the-art performance on both the marginal and joint motion prediction challenges.
arXiv Detail & Related papers (2022-09-27T16:23:14Z) - Golfer: Trajectory Prediction with Masked Goal Conditioning MnM Network [16.393675040056397]
We propose a general Transformer-like architectural module MnM network equipped with novel masked goal conditioning training procedures for AV trajectory prediction.
The model, named golfer, achieves state-of-the-art performance, winning the 2nd place in the 2022 Open Motion Prediction Challenge and ranked 1st place according to minADE.
arXiv Detail & Related papers (2022-07-02T04:57:44Z) - MotionCNN: A Strong Baseline for Motion Prediction in Autonomous Driving [6.10183951877597]
We present a simple and yet very strong baseline for multimodal motion prediction based purely on Convolutional Neural Networks.
Our source code is publicly available at GitHub.
arXiv Detail & Related papers (2022-06-05T12:31:58Z) - Transformer based trajectory prediction [71.31516599226606]
We present a simple and yet strong baseline for uncertainty aware motion prediction based purely on transformer neural networks.
While being easy-to-implement, the proposed approach achieves competitive performance and ranks 1$st$ on the 2021 Shifts Vehicle Motion Prediction Competition.
arXiv Detail & Related papers (2021-12-08T16:00:14Z) - Multimodal Motion Prediction with Stacked Transformers [35.9674180611893]
We propose a novel transformer framework for multimodal motion prediction, termed as mmTransformer.
A novel network architecture based on stacked transformers is designed to model the multimodality at feature level with a set of fixed independent proposals.
A region-based training strategy is then developed to induce the multimodality of the generated proposals.
arXiv Detail & Related papers (2021-03-22T07:25:54Z) - TPNet: Trajectory Proposal Network for Motion Prediction [81.28716372763128]
Trajectory Proposal Network (TPNet) is a novel two-stage motion prediction framework.
TPNet first generates a candidate set of future trajectories as hypothesis proposals, then makes the final predictions by classifying and refining the proposals.
Experiments on four large-scale trajectory prediction datasets, show that TPNet achieves the state-of-the-art results both quantitatively and qualitatively.
arXiv Detail & Related papers (2020-04-26T00:01:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.