Fugu-MT 論文翻訳(概要): Temporal Sampling Frequency Matters: A Capacity-Aware Study of End-to-End Driving Trajectory Prediction

論文の概要: Temporal Sampling Frequency Matters: A Capacity-Aware Study of End-to-End Driving Trajectory Prediction

arxiv url: http://arxiv.org/abs/2605.10388v1
Date: Mon, 11 May 2026 11:34:42 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-12 23:28:50.769107
Title: Temporal Sampling Frequency Matters: A Capacity-Aware Study of End-to-End Driving Trajectory Prediction
Title（参考訳）: 時間サンプリング周波数問題:エンド・ツー・エンド駆動軌道予測のキャパシティ・アウェアによる検討
Authors: Yumao Liu, Tao Liu, Xiangyu Li, Jiaxiang Li, Ke Ma,
Abstract要約: エンド・ツー・エンド(E2E)の自律走行軌道予測は、しばしば最高時間周波数でサンプリングされたカメラフレームで訓練される。時間サンプリング周波数を明示的なトレーニングセット設計変数として扱うことにより、この仮定を疑問視する。各モデルデータセットペアに対して、固定されたプロトコルの下で同じモデルをトレーニングし、評価するので、周波数応答はサンプリング周波数による予測性能の変化を反映する。
参考スコア（独自算出の注目度）: 7.358157927566997
License: http://creativecommons.org/licenses/by/4.0/
Abstract: End to end (E2E) autonomous driving trajectory prediction is often trained with camera frames sampled at the highest available temporal frequency, assuming that denser sampling improves performance. We question this assumption by treating temporal sampling frequency as an explicit training set design variable. Starting from high frequency E2E driving datasets, we construct frequency sweep training sets by temporally subsampling camera frames along each trajectory. For each model dataset pair, we train and evaluate the same model under a fixed protocol, so the frequency response reflects how prediction performance changes with sampling frequency. We analyze this response from a capacity aware perspective. Sparse sampling may miss driving relevant cues, while dense sampling may add redundant visual content and off manifold noise. For finite capacity models, this can create a driving irrelevant capacity burden. We evaluate three smaller E2E models and a larger VLA style AutoVLA model on Waymo, nuScenes, and PAVE. Results show model and dataset dependent frequency responses. Smaller E2E models often show non monotonic or near plateau trends and achieve their best 3 second ADE at lower or intermediate frequencies. In contrast, AutoVLA achieves its best 3 second ADE and FDE at the highest evaluated frequency on all three datasets. Iteration matched controls suggest that the advantage of lower or intermediate frequencies for smaller models is not explained only by unequal training update counts. These findings show that temporal sampling frequency should be reported and tuned, rather than fixed to the highest available value.
Abstract（参考訳）: エンド・ツー・エンド(E2E)の自律走行軌道予測は、高密度サンプリングにより性能が向上すると仮定して、最高時間周波数でサンプリングされたカメラフレームでしばしば訓練される。時間サンプリング周波数を明示的なトレーニングセット設計変数として扱うことにより、この仮定を疑問視する。高周波E2E駆動データセットから、各軌道に沿ってカメラフレームを時間的にサブサンプリングすることで、周波数スイープ訓練セットを構築する。各モデルデータセットペアに対して、固定されたプロトコルの下で同じモデルをトレーニングし、評価するので、周波数応答はサンプリング周波数による予測性能の変化を反映する。我々はこの応答をキャパシティ・アウェアネスの観点から分析する。スパースサンプリングは関連する手がかりを見逃しかねないが、高密度サンプリングは冗長な視覚的内容とオフ多様体ノイズを付加する可能性がある。有限容量モデルの場合、これは無関係なキャパシティ負荷を引き起こす可能性がある。 Waymo, nuScenes, PAVEの3つの小型E2Eモデルと大型VLAスタイルのAutoVLAモデルを評価した。結果はモデルとデータセット依存周波数応答を示す。より小さなE2Eモデルは、非単調または近高原の傾向を示し、低い周波数または中間周波数で最高の3秒ADEを達成する。対照的にAutoVLAは、3つのデータセットで最高評価周波数で3秒のADEとFDEを達成している。反復整合制御は、より小さなモデルの低周波または中間周波の利点が不平等なトレーニング更新数によってのみ説明されないことを示唆している。これらの結果から,時間的サンプリング頻度は,最も高い値に固定されるのではなく,報告・調整されるべきであることが示唆された。

論文の概要: Temporal Sampling Frequency Matters: A Capacity-Aware Study of End-to-End Driving Trajectory Prediction

関連論文リスト