Grouptron: Dynamic Multi-Scale Graph Convolutional Networks for
Group-Aware Dense Crowd Trajectory Forecasting
- URL: http://arxiv.org/abs/2109.14128v1
- Date: Wed, 29 Sep 2021 01:22:25 GMT
- Title: Grouptron: Dynamic Multi-Scale Graph Convolutional Networks for
Group-Aware Dense Crowd Trajectory Forecasting
- Authors: Rui Zhou, Hongyu Zhou, Masayoshi Tomizuka, Jiachen Li, and Zhuo Xu
- Abstract summary: Grouptron is a multi-scale dynamic forecasting framework that leverages pedestrian group detection and representation.
Method achieves 9.3% decrease in final displacement error (FDE) compared with state-of-the-art methods on ETH/UCY benchmark datasets.
- Score: 30.885838880247263
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Accurate, long-term forecasting of human pedestrian trajectories in highly
dynamic and interactive scenes is a long-standing challenge. Recent advances in
using data-driven approaches have achieved significant improvements in terms of
prediction accuracy. However, the lack of group-aware analysis has limited the
performance of forecasting models. This is especially apparent in highly
populated scenes, where pedestrians are moving in groups and the interactions
between groups are extremely complex and dynamic. In this paper, we present
Grouptron, a multi-scale dynamic forecasting framework that leverages
pedestrian group detection and utilizes individual-level, group-level, and
scene-level information for better understanding and representation of the
scenes. Our approach employs spatio-temporal clustering algorithms to identify
pedestrian groups, creates spatio-temporal graphs at the individual, group, and
scene levels. It then uses graph neural networks to encode dynamics at
different scales and incorporates encoding across different scales for
trajectory prediction. We carried out extensive comparisons and ablation
experiments to demonstrate the effectiveness of our approach. Our method
achieves 9.3% decrease in final displacement error (FDE) compared with
state-of-the-art methods on ETH/UCY benchmark datasets, and 16.1% decrease in
FDE in more crowded scenes where extensive human group interactions are more
frequently present.
Related papers
- TimeGraphs: Graph-based Temporal Reasoning [64.18083371645956]
TimeGraphs is a novel approach that characterizes dynamic interactions as a hierarchical temporal graph.
Our approach models the interactions using a compact graph-based representation, enabling adaptive reasoning across diverse time scales.
We evaluate TimeGraphs on multiple datasets with complex, dynamic agent interactions, including a football simulator, the Resistance game, and the MOMA human activity dataset.
arXiv Detail & Related papers (2024-01-06T06:26:49Z) - Local-Global Information Interaction Debiasing for Dynamic Scene Graph
Generation [51.92419880088668]
We propose a novel DynSGG model based on multi-task learning, DynSGG-MTL, which introduces the local interaction information and global human-action interaction information.
Long-temporal human actions supervise the model to generate multiple scene graphs that conform to the global constraints and avoid the model being unable to learn the tail predicates.
arXiv Detail & Related papers (2023-08-10T01:24:25Z) - The MI-Motion Dataset and Benchmark for 3D Multi-Person Motion
Prediction [13.177817435234449]
3D multi-person motion prediction is a challenging task that involves modeling individual behaviors and interactions between people.
We introduce the Multi-Person Interaction Motion (MI-Motion) dataset, which includes skeleton sequences of multiple individuals collected by motion capture systems.
The dataset contains 167k frames of interacting people's skeleton poses and is categorized into 5 different activity scenes.
arXiv Detail & Related papers (2023-06-23T15:38:22Z) - Conditioned Human Trajectory Prediction using Iterative Attention Blocks [70.36888514074022]
We present a simple yet effective pedestrian trajectory prediction model aimed at pedestrians positions prediction in urban-like environments.
Our model is a neural-based architecture that can run several layers of attention blocks and transformers in an iterative sequential fashion.
We show that without explicit introduction of social masks, dynamical models, social pooling layers, or complicated graph-like structures, it is possible to produce on par results with SoTA models.
arXiv Detail & Related papers (2022-06-29T07:49:48Z) - Dynamic-Group-Aware Networks for Multi-Agent Trajectory Prediction with
Relational Reasoning [29.294244001911242]
We propose DynGroupNet, a dynamic-group-aware network, which can model time-varying interactions in highly dynamic scenes.
Based on DynGroupNet, we design a prediction system to forecast socially plausible trajectories with dynamic relational reasoning.
arXiv Detail & Related papers (2022-06-27T08:36:56Z) - HighlightMe: Detecting Highlights from Human-Centric Videos [62.265410865423]
We present a domain- and user-preference-agnostic approach to detect highlightable excerpts from human-centric videos.
We use an autoencoder network equipped with spatial-temporal graph convolutions to detect human activities and interactions.
We observe a 4-12% improvement in the mean average precision of matching the human-annotated highlights over state-of-the-art methods.
arXiv Detail & Related papers (2021-10-05T01:18:15Z) - SGCN:Sparse Graph Convolution Network for Pedestrian Trajectory
Prediction [64.16212996247943]
We present a Sparse Graph Convolution Network(SGCN) for pedestrian trajectory prediction.
Specifically, the SGCN explicitly models the sparse directed interaction with a sparse directed spatial graph to capture adaptive interaction pedestrians.
visualizations indicate that our method can capture adaptive interactions between pedestrians and their effective motion tendencies.
arXiv Detail & Related papers (2021-04-04T03:17:42Z) - HGCN-GJS: Hierarchical Graph Convolutional Network with Groupwise Joint
Sampling for Trajectory Prediction [14.57655217378212]
In this work, we propose a hierarchical graph convolutional network, HGCN-GJS, for trajectory prediction.
We introduce a novel joint sampling scheme for modeling the joint distribution of multiple pedestrians in the future trajectories.
We demonstrate the performance of our network on several trajectory prediction datasets, achieving state-of-the-art results on all datasets considered.
arXiv Detail & Related papers (2020-09-15T14:51:10Z) - Predicting Temporal Sets with Deep Neural Networks [50.53727580527024]
We propose an integrated solution based on the deep neural networks for temporal sets prediction.
A unique perspective is to learn element relationship by constructing set-level co-occurrence graph.
We design an attention-based module to adaptively learn the temporal dependency of elements and sets.
arXiv Detail & Related papers (2020-06-20T03:29:02Z) - CoMoGCN: Coherent Motion Aware Trajectory Prediction with Graph
Representation [12.580809204729583]
We propose a novel framework, coherent motion aware graph convolutional network (CoMoGCN), for trajectory prediction in crowded scenes with group constraints.
Our method achieves state-of-the-art performance on several different trajectory prediction benchmarks, and the best average performance among all benchmarks considered.
arXiv Detail & Related papers (2020-05-02T09:10:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.