Fugu-MT 論文翻訳(概要): ESIA: An Energy-Based Spatiotemporal Interaction-Aware Framework for Pedestrian Intention Prediction

論文の概要: ESIA: An Energy-Based Spatiotemporal Interaction-Aware Framework for Pedestrian Intention Prediction

arxiv url: http://arxiv.org/abs/2604.23728v1
Date: Sun, 26 Apr 2026 14:13:58 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-28 17:12:07.523963
Title: ESIA: An Energy-Based Spatiotemporal Interaction-Aware Framework for Pedestrian Intention Prediction
Title（参考訳）: ESIA: 歩行者の意図予測のための時空間相互作用認識フレームワーク
Authors: Yanping Wu, Meiting Dang, Lin Wu, Edmond S. L. Ho, Zhenghua Chen, Chongfeng Wei,
Abstract要約: 自動運転の最近の進歩は、歩行者の意図予測の研究を動機付けている。既存の研究は、単純化されたマルチエージェント相互作用パターン、不透明な推論論理、グローバルな行動予測の欠如によって制約されている。本研究では,歩行者と環境をノードとして扱うための条件ランダムフィールド(CRF)に基づくパラダイムを提案する。
参考スコア（独自算出の注目度）: 24.45524909253344
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recent advances in autonomous driving have motivated research on pedestrian intention prediction, which aims to infer future crossing decisions and actions by modeling temporal dynamics, social interactions, and environmental context. However, existing studies remain constrained by oversimplified multi-agent interaction patterns, opaque reasoning logic, and a lack of global consistency in behavioral predictions, which compromise both robustness and interpretability. In this work, we propose ESIA (Energy-based Spatiotemporal Interaction-Aware framework), a novel Conditional Random Field (CRF)-based paradigm. We cast the intention prediction task as a structured prediction problem over a unified graph-based representation, treating pedestrians and the environment as spatiotemporal nodes. To characterize their distinct roles, we assign unary potentials to nodes to capture individual intentions, and pairwise potentials to edges to encode social and environmental interactions. These potentials are integrated into a unified global energy function to ensure scene-level consistency across behavioral predictions. To further constrain inference without ground-truth supervision, we introduce structural consistency terms to penalize logical contradictions. This optimization is efficiently solved via a novel Unary-Seeded Simulated Annealing (U-SSA) algorithm, which leverages high-confidence unary priors to rapidly converge to a high-quality solution. Extensive experiments on standard benchmarks demonstrate that ESIA achieves state-of-the-art performance with improved interpretability over existing methods.
Abstract（参考訳）: 自律運転の最近の進歩は、時間的ダイナミクス、社会的相互作用、環境コンテキストをモデル化することによって、将来の横断決定や行動を予測する歩行者意図予測の研究を動機付けている。しかし、既存の研究は、過度に単純化されたマルチエージェントの相互作用パターン、不透明な推論論理、そして頑健さと解釈可能性の両方を損なう行動予測におけるグローバルな一貫性の欠如によって制約されている。本研究では,新しい条件付きランダムフィールド(CRF)のパラダイムであるESIA(Energy-based Spatiotemporal Interaction-Aware framework)を提案する。本研究では,歩行者と環境を時空間ノードとして扱うため,意図予測タスクをグラフベースの統一表現上の構造化予測問題として用いた。それぞれの役割を特徴付けるために、ノードに単一ポテンシャルを割り当て、個々の意図を捉え、エッジにペアにポテンシャルを割り当て、社会的および環境的相互作用を符号化する。これらのポテンシャルは、行動予測におけるシーンレベルの一貫性を確保するため、統一されたグローバルエネルギー関数に統合される。地道な監督なしに推論をさらに制約するために、論理的矛盾を罰する構造的一貫性項を導入する。この最適化は、高品質な解に迅速に収束するために、高信頼な単項前処理を利用する新しいUnary-Seeded Simulated Annealing (U-SSA)アルゴリズムによって効率よく解決される。標準ベンチマークの大規模な実験は、ESIAが既存の手法よりも高い解釈性で最先端のパフォーマンスを達成することを実証している。

論文の概要: ESIA: An Energy-Based Spatiotemporal Interaction-Aware Framework for Pedestrian Intention Prediction

関連論文リスト