Fugu-MT 論文翻訳(概要): GaussianCaR: Gaussian Splatting for Efficient Camera-Radar Fusion

論文の概要: GaussianCaR: Gaussian Splatting for Efficient Camera-Radar Fusion

arxiv url: http://arxiv.org/abs/2602.08784v1
Date: Mon, 09 Feb 2026 15:25:19 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-23 08:17:41.290926
Title: GaussianCaR: Gaussian Splatting for Efficient Camera-Radar Fusion
Title（参考訳）: GaussianCaR: 効率的なカメラレーダ融合のためのガウススプラッティング
Authors: Santiago Montiel-Marín, Miguel Antunes-García, Fabio Sánchez-García, Angel Llamazares, Holger Caesar, Luis M. Bergasa,
Abstract要約: 実験の結果,本手法はBEVセグメンテーションタスクにおける技術状況に匹敵する,あるいは超えた性能を達成できることが示された。私たちの主な貢献は、BEVセグメンテーションのためのエンドツーエンドネットワークであるGaussianCaRです。
参考スコア（独自算出の注目度）: 8.829313789934693
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Robust and accurate perception of dynamic objects and map elements is crucial for autonomous vehicles performing safe navigation in complex traffic scenarios. While vision-only methods have become the de facto standard due to their technical advances, they can benefit from effective and cost-efficient fusion with radar measurements. In this work, we advance fusion methods by repurposing Gaussian Splatting as an efficient universal view transformer that bridges the view disparity gap, mapping both image pixels and radar points into a common Bird's-Eye View (BEV) representation. Our main contribution is GaussianCaR, an end-to-end network for BEV segmentation that, unlike prior BEV fusion methods, leverages Gaussian Splatting to map raw sensor information into latent features for efficient camera-radar fusion. Our architecture combines multi-scale fusion with a transformer decoder to efficiently extract BEV features. Experimental results demonstrate that our approach achieves performance on par with, or even surpassing, the state of the art on BEV segmentation tasks (57.3%, 82.9%, and 50.1% IoU for vehicles, roads, and lane dividers) on the nuScenes dataset, while maintaining a 3.2x faster inference runtime. Code and project page are available online.
Abstract（参考訳）: 複雑な交通シナリオにおいて安全なナビゲーションを行う自動運転車にとって、動的オブジェクトとマップ要素のロバストで正確な認識が不可欠である。視覚のみの手法は技術的進歩によりデファクトスタンダードとなっているが、レーダー計測による効果的で費用効率のよい融合の恩恵を受けることができる。本研究では,画像画素とレーダポイントの両方を共通のバードアイビュー(BEV)表現にマッピングし,ビューの格差を埋める効率的なユニバーサルビュートランスフォーマーとしてガウススプラッティングを再利用することで,融合法を推し進める。我々の主な貢献は、BEVセグメンテーションのためのエンドツーエンドネットワークであるGaussianCaRである。これは、従来のBEV融合法とは異なり、Gaussian Splattingを利用して、生センサ情報を潜在機能にマッピングし、効率的なカメラレーダ融合を実現する。我々のアーキテクチャはマルチスケール融合と変圧器デコーダを組み合わせて効率よくBEV特徴を抽出する。実験結果から,本手法は,車両,道路,レーンディバイザにおけるBEVセグメンテーションタスク(57.3%,82.9%,50.1%IoU)の精度を3.2倍高速な推論ランタイムを維持しつつ,その性能を達成できた。コードとプロジェクトページはオンラインで公開されている。

論文の概要: GaussianCaR: Gaussian Splatting for Efficient Camera-Radar Fusion

関連論文リスト