Fugu-MT 論文翻訳(概要): Training Machine Learning Models on Human Spatio-temporal Mobility Data: An Experimental Study [Experiment Paper]

論文の概要: Training Machine Learning Models on Human Spatio-temporal Mobility Data: An Experimental Study [Experiment Paper]

arxiv url: http://arxiv.org/abs/2508.13135v2
Date: Tue, 19 Aug 2025 01:52:50 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-20 13:30:22.903733
Title: Training Machine Learning Models on Human Spatio-temporal Mobility Data: An Experimental Study [Experiment Paper]
Title（参考訳）: 時空間移動データに基づく機械学習モデルの学習:実験的検討 [実験論文]
Authors: Yueyang Liu, Lance Kennedy, Ruochen Kong, Joon-Seok Kim, Andreas Züfle,
Abstract要約: 個人レベルの人体移動予測は、感染症のモニタリング、子供、高齢者のケアへの応用に関する重要な研究のトピックとして浮上している。機械学習モデルをトレーニングするためのベストプラクティスを決定すること。意味情報を明示的に含めることで、モデルが生活の個々のパターンをよりよく理解し、予測を改善することができることを示す。
参考スコア（独自算出の注目度）: 0.5382679710017696
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Individual-level human mobility prediction has emerged as a significant topic of research with applications in infectious disease monitoring, child, and elderly care. Existing studies predominantly focus on the microscopic aspects of human trajectories: such as predicting short-term trajectories or the next location visited, while offering limited attention to macro-level mobility patterns and the corresponding life routines. In this paper, we focus on an underexplored problem in human mobility prediction: determining the best practices to train a machine learning model using historical data to forecast an individuals complete trajectory over the next days and weeks. In this experiment paper, we undertake a comprehensive experimental analysis of diverse models, parameter configurations, and training strategies, accompanied by an in-depth examination of the statistical distribution inherent in human mobility patterns. Our empirical evaluations encompass both Long Short-Term Memory and Transformer-based architectures, and further investigate how incorporating individual life patterns can enhance the effectiveness of the prediction. We show that explicitly including semantic information such as day-of-the-week and user-specific historical information can help the model better understand individual patterns of life and improve predictions. Moreover, since the absence of explicit user information is often missing due to user privacy, we show that the sampling of users may exacerbate data skewness and result in a substantial loss in predictive accuracy. To mitigate data imbalance and preserve diversity, we apply user semantic clustering with stratified sampling to ensure that the sampled dataset remains representative. Our results further show that small-batch stochastic gradient optimization improves model performance, especially when human mobility training data is limited.
Abstract（参考訳）: 個人レベルの人体移動予測は、感染症のモニタリング、子供、高齢者のケアへの応用に関する重要な研究のトピックとして浮上している。既存の研究は主に人間の軌道の微視的な側面に焦点を当てており、例えば短期的な軌跡の予測や、来訪した場所の予測などであり、マクロレベルの移動パターンやそれに対応する生活習慣に限定的な注意を払っている。本稿では、過去のデータを用いて機械学習モデルを訓練し、次の日と数週間にわたって個人が完全な軌道を予測するためのベストプラクティスを決定するという、人間の移動予測における未探索の問題に焦点を当てる。本研究では, 多様なモデル, パラメータ構成, トレーニング戦略の総合的な実験分析を行い, 人間の移動パターンに固有の統計分布の詳細な検討を行った。我々の経験的評価は、長期記憶とトランスフォーマーに基づくアーキテクチャの両方を包含し、個別の生活パターンを取り入れることで、予測の有効性を高めることができる。日報やユーザ固有の歴史的情報などの意味情報を明示的に含めることで、モデルが生活の個々のパターンをよりよく理解し、予測を改善することができることを示す。さらに、ユーザのプライバシーが原因で明示的なユーザ情報が欠落することがしばしばあることから、ユーザのサンプリングによってデータの歪が悪化し、予測精度が著しく低下する可能性があることを示す。データの不均衡を緩和し,多様性を保ちつつ,ユーザセマンティッククラスタリングと階層化サンプリングを適用し,サンプルデータセットが代表的であることを保証する。以上の結果から,小型確率勾配最適化は,特に人体移動訓練データに制限がある場合,モデル性能を向上させることが示唆された。

論文の概要: Training Machine Learning Models on Human Spatio-temporal Mobility Data: An Experimental Study [Experiment Paper]

関連論文リスト