Social LSTM with Dynamic Occupancy Modeling for Realistic Pedestrian Trajectory Prediction
- URL: http://arxiv.org/abs/2511.09735v1
- Date: Fri, 14 Nov 2025 01:06:53 GMT
- Title: Social LSTM with Dynamic Occupancy Modeling for Realistic Pedestrian Trajectory Prediction
- Authors: Ahmed Alia, Mohcine Chraibi, Armin Seyfried,
- Abstract summary: This paper proposes a novel deep learning model that enhances the Social LSTM with a new Dynamic Occupied Space loss function.<n>This loss function guides Social LSTM in learning to avoid realistic collisions without increasing displacement error across different crowd densities.<n>The proposed model consistently outperforms several state-of-the-art deep learning models across most test sets.
- Score: 2.1711205684359247
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In dynamic and crowded environments, realistic pedestrian trajectory prediction remains a challenging task due to the complex nature of human motion and the mutual influences among individuals. Deep learning models have recently achieved promising results by implicitly learning such patterns from 2D trajectory data. However, most approaches treat pedestrians as point entities, ignoring the physical space that each person occupies. To address these limitations, this paper proposes a novel deep learning model that enhances the Social LSTM with a new Dynamic Occupied Space loss function. This loss function guides Social LSTM in learning to avoid realistic collisions without increasing displacement error across different crowd densities, ranging from low to high, in both homogeneous and heterogeneous density settings. Such a function achieves this by combining the average displacement error with a new collision penalty that is sensitive to scene density and individual spatial occupancy. For efficient training and evaluation, five datasets were generated from real pedestrian trajectories recorded during the Festival of Lights in Lyon 2022. Four datasets represent homogeneous crowd conditions -- low, medium, high, and very high density -- while the fifth corresponds to a heterogeneous density distribution. The experimental findings indicate that the proposed model not only lowers collision rates but also enhances displacement prediction accuracy in each dataset. Specifically, the model achieves up to a 31% reduction in the collision rate and reduces the average displacement error and the final displacement error by 5% and 6%, respectively, on average across all datasets compared to the baseline. Moreover, the proposed model consistently outperforms several state-of-the-art deep learning models across most test sets.
Related papers
- UniSH: Unifying Scene and Human Reconstruction in a Feed-Forward Pass [83.7071371474926]
UniSH is a unified, feed-forward framework for joint metric-scale 3D scene and human reconstruction.<n>Our framework bridges strong, disparate priors from scene reconstruction and HMR.<n>Our model achieves state-of-the-art performance on human-centric scene reconstruction.
arXiv Detail & Related papers (2026-01-03T16:06:27Z) - Grasp-HGN: Grasping the Unexpected [1.2174549154334797]
Current grasp models perform poorly on unseen objects, negatively affecting users' independence and quality of life.<n>We propose Grasp-LLaVA, a Grasp Vision Language Model enabling human-like reasoning to infer the suitable grasp type estimate.<n>Finally, we propose Hybrid Grasp Network (HGN), an edge-cloud deployment infrastructure enabling fast grasp estimation on edge.
arXiv Detail & Related papers (2025-08-11T05:58:28Z) - OmniTraj: Pre-Training on Heterogeneous Data for Adaptive and Zero-Shot Human Trajectory Prediction [62.385417528148224]
We present OmniTraj, a Transformer-based model pre-trained on a large-scale, heterogeneous dataset.<n>Experiments show that explicitly conditioning on the frame rate enables OmniTraj to achieve state-of-the-art zero-shot transfer performance.
arXiv Detail & Related papers (2025-07-31T15:37:09Z) - Efficient Federated Learning with Heterogeneous Data and Adaptive Dropout [62.73150122809138]
Federated Learning (FL) is a promising distributed machine learning approach that enables collaborative training of a global model using multiple edge devices.<n>We propose the FedDHAD FL framework, which comes with two novel methods: Dynamic Heterogeneous model aggregation (FedDH) and Adaptive Dropout (FedAD)<n>The combination of these two methods makes FedDHAD significantly outperform state-of-the-art solutions in terms of accuracy (up to 6.7% higher), efficiency (up to 2.02 times faster), and cost (up to 15.0% smaller)
arXiv Detail & Related papers (2025-07-14T16:19:00Z) - 4D-VLA: Spatiotemporal Vision-Language-Action Pretraining with Cross-Scene Calibration [31.111439909825627]
Existing methods typically model the dataset's action distribution using simple observations as inputs.<n>We propose 4D-VLA, a novel approach that effectively integrates 4D information into the input to these sources of chaos.<n>Our model consistently outperforms existing methods, demonstrating stronger spatial understanding and adaptability.
arXiv Detail & Related papers (2025-06-27T14:09:29Z) - Enhancing Monocular Height Estimation via Sparse LiDAR-Guided Correction [25.268217296287688]
We investigate a state-of-the-art MHE model trained purely on synthetic data.<n>We find that the model relies heavily on shadow cues, a factor that can lead to overestimation or underestimation of heights.<n>We propose a novel correction pipeline that integrates sparse, imperfect global LiDAR measurements.
arXiv Detail & Related papers (2025-05-11T08:54:09Z) - SSTP: Efficient Sample Selection for Trajectory Prediction [38.92588125424176]
Training advanced trajectory prediction models on existing large-scale datasets is time-consuming and computationally expensive.<n>We propose the SSTP framework, which constructs a compact yet density-balanced dataset tailored to trajectory prediction.<n> Experiments show that SSTP achieves comparable performance to full-dataset training using only half the data.
arXiv Detail & Related papers (2024-09-25T22:00:11Z) - Towards Understanding and Mitigating Dimensional Collapse in Heterogeneous Federated Learning [112.69497636932955]
Federated learning aims to train models across different clients without the sharing of data for privacy considerations.
We study how data heterogeneity affects the representations of the globally aggregated models.
We propose sc FedDecorr, a novel method that can effectively mitigate dimensional collapse in federated learning.
arXiv Detail & Related papers (2022-10-01T09:04:17Z) - Uncertainty Quantification Techniques for Space Weather Modeling:
Thermospheric Density Application [0.0]
We propose two techniques to develop nonlinear ML models to predict thermospheric density.
We show the performance for models trained on local and global datasets.
We achieve errors of 11% on independent test data with well-calibrated uncertainty estimates.
arXiv Detail & Related papers (2022-01-06T14:17:50Z) - X-model: Improving Data Efficiency in Deep Learning with A Minimax Model [78.55482897452417]
We aim at improving data efficiency for both classification and regression setups in deep learning.
To take the power of both worlds, we propose a novel X-model.
X-model plays a minimax game between the feature extractor and task-specific heads.
arXiv Detail & Related papers (2021-10-09T13:56:48Z) - SGCN:Sparse Graph Convolution Network for Pedestrian Trajectory
Prediction [64.16212996247943]
We present a Sparse Graph Convolution Network(SGCN) for pedestrian trajectory prediction.
Specifically, the SGCN explicitly models the sparse directed interaction with a sparse directed spatial graph to capture adaptive interaction pedestrians.
visualizations indicate that our method can capture adaptive interactions between pedestrians and their effective motion tendencies.
arXiv Detail & Related papers (2021-04-04T03:17:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.