Temporal Pyramid Network for Pedestrian Trajectory Prediction with
Multi-Supervision
- URL: http://arxiv.org/abs/2012.01884v2
- Date: Fri, 4 Dec 2020 02:11:49 GMT
- Title: Temporal Pyramid Network for Pedestrian Trajectory Prediction with
Multi-Supervision
- Authors: Rongqin Liang, Yuanman Li, Xia Li, yi tang, Jiantao Zhou, Wenbin Zou
- Abstract summary: We propose a temporal pyramid network for pedestrian trajectory prediction through a squeeze modulation and a dilation modulation.
Our hierarchical framework builds a feature pyramid with increasingly richer temporal information from top to bottom, which can better capture the motion behavior at various tempos.
By progressively merging the top coarse features of global context to the bottom fine features of rich local context, our method can fully exploit both the long-range and short-range information of the trajectory.
- Score: 27.468166556263256
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Predicting human motion behavior in a crowd is important for many
applications, ranging from the natural navigation of autonomous vehicles to
intelligent security systems of video surveillance. All the previous works
model and predict the trajectory with a single resolution, which is rather
inefficient and difficult to simultaneously exploit the long-range information
(e.g., the destination of the trajectory), and the short-range information
(e.g., the walking direction and speed at a certain time) of the motion
behavior. In this paper, we propose a temporal pyramid network for pedestrian
trajectory prediction through a squeeze modulation and a dilation modulation.
Our hierarchical framework builds a feature pyramid with increasingly richer
temporal information from top to bottom, which can better capture the motion
behavior at various tempos. Furthermore, we propose a coarse-to-fine fusion
strategy with multi-supervision. By progressively merging the top coarse
features of global context to the bottom fine features of rich local context,
our method can fully exploit both the long-range and short-range information of
the trajectory. Experimental results on several benchmarks demonstrate the
superiority of our method.
Related papers
- Deep Learning-Based Robust Multi-Object Tracking via Fusion of mmWave Radar and Camera Sensors [6.166992288822812]
Multi-Object Tracking plays a critical role in ensuring safer and more efficient navigation through complex traffic scenarios.
This paper presents a novel deep learning-based method that integrates radar and camera data to enhance the accuracy and robustness of Multi-Object Tracking in autonomous driving systems.
arXiv Detail & Related papers (2024-07-10T21:09:09Z) - Layout Sequence Prediction From Noisy Mobile Modality [53.49649231056857]
Trajectory prediction plays a vital role in understanding pedestrian movement for applications such as autonomous driving and robotics.
Current trajectory prediction models depend on long, complete, and accurately observed sequences from visual modalities.
We propose LTrajDiff, a novel approach that treats objects obstructed or out of sight as equally important as those with fully visible trajectories.
arXiv Detail & Related papers (2023-10-09T20:32:49Z) - MotionTrack: Learning Motion Predictor for Multiple Object Tracking [68.68339102749358]
We introduce a novel motion-based tracker, MotionTrack, centered around a learnable motion predictor.
Our experimental results demonstrate that MotionTrack yields state-of-the-art performance on datasets such as Dancetrack and SportsMOT.
arXiv Detail & Related papers (2023-06-05T04:24:11Z) - MSTFormer: Motion Inspired Spatial-temporal Transformer with
Dynamic-aware Attention for long-term Vessel Trajectory Prediction [0.6451914896767135]
MSTFormer is a motion inspired vessel trajectory prediction method based on Transformer.
We propose a data augmentation method to describe the spatial features and motion features of the trajectory.
Second, we propose a Multi-headed Dynamic-aware Self-attention mechanism to focus on trajectory points with frequent motion transformations.
Third, we construct a knowledge-inspired loss function to further boost the performance of the model.
arXiv Detail & Related papers (2023-03-21T02:11:37Z) - MotionTrack: Learning Robust Short-term and Long-term Motions for
Multi-Object Tracking [56.92165669843006]
We propose MotionTrack, which learns robust short-term and long-term motions in a unified framework to associate trajectories from a short to long range.
For dense crowds, we design a novel Interaction Module to learn interaction-aware motions from short-term trajectories, which can estimate the complex movement of each target.
For extreme occlusions, we build a novel Refind Module to learn reliable long-term motions from the target's history trajectory, which can link the interrupted trajectory with its corresponding detection.
arXiv Detail & Related papers (2023-03-18T12:38:33Z) - PDFormer: Propagation Delay-Aware Dynamic Long-Range Transformer for
Traffic Flow Prediction [78.05103666987655]
spatial-temporal Graph Neural Network (GNN) models have emerged as one of the most promising methods to solve this problem.
We propose a novel propagation delay-aware dynamic long-range transFormer, namely PDFormer, for accurate traffic flow prediction.
Our method can not only achieve state-of-the-art performance but also exhibit competitive computational efficiency.
arXiv Detail & Related papers (2023-01-19T08:42:40Z) - Motion Transformer with Global Intention Localization and Local Movement
Refinement [103.75625476231401]
Motion TRansformer (MTR) models motion prediction as the joint optimization of global intention localization and local movement refinement.
MTR achieves state-of-the-art performance on both the marginal and joint motion prediction challenges.
arXiv Detail & Related papers (2022-09-27T16:23:14Z) - SDMTL: Semi-Decoupled Multi-grained Trajectory Learning for 3D human
motion prediction [5.581663772616127]
We propose a novel end-to-end network, Semi-Decoupled Multi-grained Trajectory Learning network, to predict future human motion.
Specifically, we capture the temporal dynamics of motion trajectory at multi-granularity, including fine granularity and coarse.
We learn multi-grained trajectory information using BSMEs hierarchically and further capture the information of temporal evolutional directions at each granularity.
arXiv Detail & Related papers (2020-10-11T01:29:21Z) - A Spatial-Temporal Attentive Network with Spatial Continuity for
Trajectory Prediction [74.00750936752418]
We propose a novel model named spatial-temporal attentive network with spatial continuity (STAN-SC)
First, spatial-temporal attention mechanism is presented to explore the most useful and important information.
Second, we conduct a joint feature sequence based on the sequence and instant state information to make the generative trajectories keep spatial continuity.
arXiv Detail & Related papers (2020-03-13T04:35:50Z) - Trajectron++: Dynamically-Feasible Trajectory Forecasting With
Heterogeneous Data [37.176411554794214]
Reasoning about human motion is an important prerequisite to safe and socially-aware robotic navigation.
We present Trajectron++, a modular, graph-structured recurrent model that forecasts the trajectories of a general number of diverse agents.
We demonstrate its performance on several challenging real-world trajectory forecasting datasets.
arXiv Detail & Related papers (2020-01-09T16:47:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.