Fugu-MT 論文翻訳(概要): Behavior-Grounded Lane Representation Learning for Multi-Task Traffic Digital Twins

論文の概要: Behavior-Grounded Lane Representation Learning for Multi-Task Traffic Digital Twins

arxiv url: http://arxiv.org/abs/2605.01901v1
Date: Sun, 03 May 2026 14:31:41 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-05 20:33:49.986093
Title: Behavior-Grounded Lane Representation Learning for Multi-Task Traffic Digital Twins
Title（参考訳）: マルチタスクトラヒックディジタル双生児のための行動学習型レーン表現学習
Authors: Rei Tamaru, Pei Li, Bin Ran,
Abstract要約: 交通デジタル双生児のための行動基底車線表現学習フレームワークGeoLaneRepを紹介する。 GeoLaneRepは、静的レーン幾何学、観測された車両軌跡、および運用ディスクリプタを共有カメラ間のセマンティック埋め込みに共同でエンコードする。同一の挙動埋め込みが拡散型ジェネレータを条件として,目的とする運用仕様を満たすレーンジオメトリを合成可能であることを示す。
参考スコア（独自算出の注目度）: 22.723849089108977
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Traffic digital twins are powerful tools for advanced traffic management, and most systems are built on static geometric representations. However, these representations fail to capture the dynamic functional semantics required for behavior-aware reasoning, such as how a lane operates under complex traffic conditions. To address this gap, we introduce GeoLaneRep, a behavior-grounded lane representation learning framework for traffic digital twins. GeoLaneRep jointly encodes static lane geometry, observed vehicle trajectories, and operational descriptors into a shared, cross-camera semantic embedding. The encoder is trained with a joint objective combining contrastive cross-camera alignment, auxiliary role supervision, and temporal anomaly detection. Across 16 roadside cameras and 132 lanes, the learned embeddings achieve a $0.004$ lateral-rank error and an edge-role F1 of $1.000$ in zero-shot cross-camera matching, and an AUROC of $0.991$ for window-level anomaly detection. We further show that the same behavioral embeddings can condition a diffusion-based generator to synthesize lane geometries that satisfy targeted operational specifications, with $87.9\%$ overall specification accuracy across 38 lane groups. GeoLaneRep thus provides a semantic interface between roadside observations and downstream digital twin tasks, supporting cross-camera transfer, behavior-aware monitoring, and goal-directed lane synthesis. The framework is openly available at https://github.com/raynbowy23/GeoLaneRep.
Abstract（参考訳）: トラフィックデジタルツインは高度なトラフィック管理のための強力なツールであり、ほとんどのシステムは静的な幾何学的表現に基づいて構築されている。しかしながら、これらの表現は、複雑な交通条件下で車線がどのように機能するかなど、行動認識推論に必要な動的な機能的意味を捉えることができない。このギャップに対処するために,交通デジタル双生児のための行動基底車線表現学習フレームワークGeoLaneRepを紹介する。 GeoLaneRepは、静的レーン幾何学、観測された車両軌跡、および運用ディスクリプタを共有カメラ間のセマンティック埋め込みに共同でエンコードする。エンコーダは、コントラスト的なクロスカメラアライメント、補助的役割監督、時間的異常検出を組み合わせた共同目標で訓練される。 16台の路面カメラと132車線にまたがって、学習された埋め込みは横方向の誤差0.004ドル、エッジロールF1はゼロショットのクロスカメラマッチングで1,000ドル、AUROCは0.991ドルでウィンドウレベルの異常検出を行う。さらに、同じ挙動埋め込みが拡散型ジェネレータを条件として、対象とする運用仕様を満たすレーンジオメトリーを合成し、38車線群で全体の仕様精度を87.9 %$に設定できることを示す。これにより、GeoLaneRepは、路面観察と下流デジタルツインタスク間のセマンティックインターフェースを提供し、クロスカメラ転送、行動認識監視、ゴール指向レーン合成をサポートする。フレームワークはhttps://github.com/raynbowy23/GeoLaneRepで公開されている。

論文の概要: Behavior-Grounded Lane Representation Learning for Multi-Task Traffic Digital Twins

関連論文リスト