Fugu-MT 論文翻訳(概要): TrajFusionNet: Pedestrian Crossing Intention Prediction via Fusion of Sequential and Visual Trajectory Representations

論文の概要: TrajFusionNet: Pedestrian Crossing Intention Prediction via Fusion of Sequential and Visual Trajectory Representations

arxiv url: http://arxiv.org/abs/2508.19866v1
Date: Wed, 27 Aug 2025 13:29:15 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-28 19:07:41.642183
Title: TrajFusionNet: Pedestrian Crossing Intention Prediction via Fusion of Sequential and Visual Trajectory Representations
Title（参考訳）: TrajFusionNet: 逐次的および視覚的軌道表現の融合による歩行者交差意図予測
Authors: François G. Landry, Moulay A. Akhloufi,
Abstract要約: TrajFusionNetは、歩行者の横断意図を予測するトランスフォーマーベースのモデルである。観測および予測された歩行者軌道と車両速度の逐次的表現から学習する。歩行者横断意図予測に最もよく使用される3つのデータセットに対して、最先端の結果が得られている。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: With the introduction of vehicles with autonomous capabilities on public roads, predicting pedestrian crossing intention has emerged as an active area of research. The task of predicting pedestrian crossing intention involves determining whether pedestrians in the scene are likely to cross the road or not. In this work, we propose TrajFusionNet, a novel transformer-based model that combines future pedestrian trajectory and vehicle speed predictions as priors for predicting crossing intention. TrajFusionNet comprises two branches: a Sequence Attention Module (SAM) and a Visual Attention Module (VAM). The SAM branch learns from a sequential representation of the observed and predicted pedestrian trajectory and vehicle speed. Complementarily, the VAM branch enables learning from a visual representation of the predicted pedestrian trajectory by overlaying predicted pedestrian bounding boxes onto scene images. By utilizing a small number of lightweight modalities, TrajFusionNet achieves the lowest total inference time (including model runtime and data preprocessing) among current state-of-the-art approaches. In terms of performance, it achieves state-of-the-art results across the three most commonly used datasets for pedestrian crossing intention prediction.
Abstract（参考訳）: 公道上での自動運転車の導入により、歩行者の横断意図の予測が研究の活発な領域として浮上した。歩行者の横断意図を予測するタスクは、現場の歩行者が道路を横断するかどうかを決定することである。本研究では,横断意図を予測するための先行モデルとして,将来の歩行者軌道と車両速度予測を組み合わせたトランスフォーマーベースモデルであるTrajFusionNetを提案する。 TrajFusionNetはSequence Attention Module(SAM)とVisual Attention Module(VAM)の2つのブランチで構成されている。 SAMブランチは、観測および予測された歩行者軌道と車両速度の逐次表現から学習する。相補的に、VAMブランチは、予測された歩行者境界ボックスをシーンイメージ上にオーバーレイすることで、予測された歩行者軌跡の視覚的表現から学習することができる。少数の軽量なモダリティを利用することで、TrajFusionNetは現在の最先端のアプローチの中で最も低い推測時間(モデルランタイムとデータ前処理を含む)を達成する。性能の面では、歩行者横断意図予測のために最もよく使用される3つのデータセットの最先端の結果を達成する。

関連論文リスト

Multi-Vehicle Trajectory Prediction at Intersections using State and Intention Information [50.40632021583213]
道路員の将来の軌跡予測への伝統的なアプローチは、過去の軌跡を知ることに依存している。この研究は、交差点で複数の車両の予測を行うために、現在の状態と意図された方向を知ることに依存する。この情報を車両間で送るメッセージは、それぞれがより総合的な環境概要を提供する。
論文参考訳（メタデータ） (2023-01-06T15:13:23Z)
Pedestrian Stop and Go Forecasting with Hybrid Feature Fusion [87.77727495366702]
歩行者の立ち止まりと予測の新たな課題を紹介します。都市交通における歩行者の立ち寄り行動を明示的に研究するためのベンチマークであるTransをリリースする。歩行者の歩行動作に注釈を付けたいくつかの既存のデータセットから構築し、さまざまなシナリオや行動を実現する。
論文参考訳（メタデータ） (2022-03-04T18:39:31Z)
PePScenes: A Novel Dataset and Baseline for Pedestrian Action Prediction in 3D [10.580548257913843]
nuScenesにフレーム毎の2D/3Dバウンディングボックスと動作アノテーションを追加して作成された新しい歩行者行動予測データセットを提案する。また,歩行者横断行動予測のための様々なデータモダリティを組み込んだハイブリッドニューラルネットワークアーキテクチャを提案する。
論文参考訳（メタデータ） (2020-12-14T18:13:44Z)
Pedestrian Intention Prediction: A Multi-task Perspective [83.7135926821794]
グローバルに展開するためには、自動運転車は歩行者の安全を保証する必要がある。本研究は歩行者の意図と視覚状態を共同で予測することでこの問題を解決しようとするものである。この方法はマルチタスク学習アプローチにおけるリカレントニューラルネットワークである。
論文参考訳（メタデータ） (2020-10-20T13:42:31Z)
Vehicle Trajectory Prediction in Crowded Highway Scenarios Using Bird Eye View Representations and CNNs [0.0]
本稿では,図形表現を用いた車両軌道予測の新しい手法について述べる。この問題は、交通参加者間の基盤となる関係を学習するためにネットワークを訓練する画像回帰問題である。このモデルは2つの反対の交通流で同時に30台以上の車両で高速道路のシナリオでテストされている。
論文参考訳（メタデータ） (2020-08-26T11:15:49Z)
TNT: Target-driveN Trajectory Prediction [76.21200047185494]
我々は移動エージェントのための目標駆動軌道予測フレームワークを開発した。我々は、車や歩行者の軌道予測をベンチマークする。私たちはArgoverse Forecasting、InterAction、Stanford Drone、および社内のPedestrian-at-Intersectionデータセットの最先端を達成しています。
論文参考訳（メタデータ） (2020-08-19T06:52:46Z)
Probabilistic Crowd GAN: Multimodal Pedestrian Trajectory Prediction using a Graph Vehicle-Pedestrian Attention Network [12.070251470948772]
本稿では,確率的集団GANが確率的マルチモーダル予測をどうやって生成できるかを示す。ソーシャルインタラクションをモデル化するグラフ車両歩行者注意ネットワーク(GVAT)も提案する。本研究では,軌道予測手法の既存の状況の改善を実証し,集団間相互作用の真のマルチモーダル性と不確実性を直接モデル化する方法について述べる。
論文参考訳（メタデータ） (2020-06-23T11:25:16Z)
TPNet: Trajectory Proposal Network for Motion Prediction [81.28716372763128]
Trajectory Proposal Network (TPNet) は、新しい2段階の動作予測フレームワークである。 TPNetはまず、仮説の提案として将来の軌道の候補セットを生成し、次に提案の分類と修正によって最終的な予測を行う。 4つの大規模軌道予測データセットの実験は、TPNetが定量的かつ定性的に、最先端の結果を達成することを示した。
論文参考訳（メタデータ） (2020-04-26T00:01:49Z)
Spatiotemporal Relationship Reasoning for Pedestrian Intent Prediction [57.56466850377598]
視覚データに対する推論は、ロボティクスとビジョンベースのアプリケーションにとって望ましい能力である。本稿では,歩行者の意図を推論するため,現場の異なる物体間の関係を明らかにするためのグラフ上でのフレームワークを提案する。歩行者の意図は、通りを横切る、あるいは横断しない将来の行動として定義され、自動運転車にとって非常に重要な情報である。
論文参考訳（メタデータ） (2020-02-20T18:50:44Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。