Fugu-MT 論文翻訳(概要): One World, Dual Timeline: Decoupled Spatio-Temporal Gaussian Scene Graph for 4D Cooperative Driving Reconstruction

論文の概要: One World, Dual Timeline: Decoupled Spatio-Temporal Gaussian Scene Graph for 4D Cooperative Driving Reconstruction

arxiv url: http://arxiv.org/abs/2605.07910v1
Date: Fri, 08 May 2026 15:48:57 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-11 19:43:39.176563
Title: One World, Dual Timeline: Decoupled Spatio-Temporal Gaussian Scene Graph for 4D Cooperative Driving Reconstruction
Title（参考訳）: 4次元協調運転再建のための時空間ガウスシーングラフを分離した2次元タイムライン
Authors: Yulong Chen, Xiaoyun Dong, Haoyu Zhang, Zongxian Yang, Lewei Xie, Xinke Li, Yifan Zhang, Kai Wang, Jianping Wang,
Abstract要約: 4次元協調運転再建のためのDust (Deco Upled Spatio-Temporal) Gaussian Scene Graphを提案する。 DUSTは最先端の性能を達成し、最強ベースラインよりも3.2dBのダイナミック領域PSNRを改善する。 DUSTはまた、フレシェビデオ距離を37.7%減らし、時間同期を大きく抑えている。
参考スコア（独自算出の注目度）: 24.030187603322076
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Reconstructing dynamic scenes from Vehicle-to-Infrastructure Cooperative Autonomous Driving (VICAD) data is fundamentally complicated by temporal asynchrony: vehicle and infrastructure cameras operate on independent clocks, capturing the same dynamic agent such as cars and pedestrians at different physical times. Existing Gaussian Scene Graph methods implicitly assume synchronized observations and assign a single pose per agent per frame, which is an assumption that breaks in cooperative settings, where the resulting gradient conflicts cause severe ghosting on dynamic agents. We identify this as a representation-level failure, not an optimization artifact: we prove that any single-timeline formulation incurs an irreducible photometric loss scaling quadratically with agent velocity and cross-source time offset. To resolve this, we propose Dust (DecoUpled Spatio-Temporal) Gaussian Scene Graph for 4D Cooperative Driving Reconstruction. DUST Gaussian Scene Graph shares a canonical Gaussian set per agent for appearance consistency, while maintaining decouple pose trajectories aligned to each source's true capture timestamps. We prove that this decoupling enables the pose-gradient kernel block-diagonal, eliminating cross-source interference entirely. To make Dust practical, we further introduce a static anchor-based pose correction pipeline that corrects spatio misalignment between vehicle and infrastructure annotations, and a pose-regularized joint optimization scheme that prevents trajectory jitter and drift during early training. On 26 sequences from V2X-Seq, DUST achieves state-of-the-art performance, improving dynamic-area PSNR by 3.2 dB over the strongest baseline and reducing Fréchet Video Distance by 37.7%, with keeping robustness under larger temporal asynchrony. Code is available at https://anonymous.4open.science/r/DUST-6A55.
Abstract（参考訳）: 車両とインフラカメラは独立した時計で動作し、車や歩行者などと同じ動的エージェントを物理的に捉えている。既存のガウスのシーングラフ法では、同期された観察を暗黙的に仮定し、フレームごとに1つのポーズを割り当てている。単一時間線の定式化は、エージェント速度とクロスソースタイムオフセットとを2次的に比較して、既約光度損失のスケーリングを引き起こすことを証明します。そこで我々は,Dust (Deco Upled Spatio-Temporal) Gaussian Scene Graph for 4D Cooperative Driving Reconstructionを提案する。 DUST Gaussian Scene Graphは、各ソースの真のキャプチャタイムスタンプに整合した2重ポーズトラジェクトリを維持しながら、外観整合性のためのエージェント毎の標準ガウスセットを共有する。この分離により、ポーズ段階のカーネルブロック対角線が実現し、ソース間の干渉を完全に排除できることを示す。 Dustを実用化するために、車両とインフラのアノテーション間の空間的不一致を補正する静的アンカーベースのポーズ補正パイプラインや、軌道ジッタやドリフトを早期訓練中に防止するポーズ規則化された共同最適化スキームも導入する。 V2X-Seqの26のシーケンスにおいて、DUSTは最先端の性能を達成し、最強のベースライン上で3.2dBのダイナミック領域PSNRを改善し、フレシェ・ビデオ距離を37.7%削減し、時間的同期の下で堅牢性を維持する。コードはhttps://anonymous.4open.science/r/DUST-6A55で公開されている。

論文の概要: One World, Dual Timeline: Decoupled Spatio-Temporal Gaussian Scene Graph for 4D Cooperative Driving Reconstruction

関連論文リスト