Fugu-MT 論文翻訳(概要): NAP3D: NeRF Assisted 3D-3D Pose Alignment for Autonomous Vehicles

論文の概要: NAP3D: NeRF Assisted 3D-3D Pose Alignment for Autonomous Vehicles

arxiv url: http://arxiv.org/abs/2512.15080v1
Date: Wed, 17 Dec 2025 04:56:38 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-23 08:17:40.388336
Title: NAP3D: NeRF Assisted 3D-3D Pose Alignment for Autonomous Vehicles
Title（参考訳）: NAP3D:NeRFによる自動運転車用3D-3Dポッドアライメント
Authors: Gaurav Bansal,
Abstract要約: この研究は、エージェントの現在の深度画像と事前訓練されたニューラルレージアンスフィールド(NeRF)の3D-3D対応を利用する補完的アプローチである、NeRF-Assisted 3D Pose Alignment (NAP3D)を導入している。 NAP3Dは、これまで観測された場所を再考することに頼ることなく、新しい視点からでも推定されたポーズを洗練させる。実験により、NAP3Dはカスタムデータセット上で5cm以内のカメラポーズ補正を達成し、2D-3D Perspective-N-Pointベースラインを頑健に上回っている。
参考スコア（独自算出の注目度）: 1.1168121941015012
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Accurate localization is essential for autonomous vehicles, yet sensor noise and drift over time can lead to significant pose estimation errors, particularly in long-horizon environments. A common strategy for correcting accumulated error is visual loop closure in SLAM, which adjusts the pose graph when the agent revisits previously mapped locations. These techniques typically rely on identifying visual mappings between the current view and previously observed scenes and often require fusing data from multiple sensors. In contrast, this work introduces NeRF-Assisted 3D-3D Pose Alignment (NAP3D), a complementary approach that leverages 3D-3D correspondences between the agent's current depth image and a pre-trained Neural Radiance Field (NeRF). By directly aligning 3D points from the observed scene with synthesized points from the NeRF, NAP3D refines the estimated pose even from novel viewpoints, without relying on revisiting previously observed locations. This robust 3D-3D formulation provides advantages over conventional 2D-3D localization methods while remaining comparable in accuracy and applicability. Experiments demonstrate that NAP3D achieves camera pose correction within 5 cm on a custom dataset, robustly outperforming a 2D-3D Perspective-N-Point baseline. On TUM RGB-D, NAP3D consistently improves 3D alignment RMSE by approximately 6 cm compared to this baseline given varying noise, despite PnP achieving lower raw rotation and translation parameter error in some regimes, highlighting NAP3D's improved geometric consistency in 3D space. By providing a lightweight, dataset-agnostic tool, NAP3D complements existing SLAM and localization pipelines when traditional loop closure is unavailable.
Abstract（参考訳）: 正確な位置決めは自動運転車にとって不可欠であるが、センサーノイズと時間とともにドリフトすることで、特にロングホライゾン環境では、重要なポーズ推定エラーにつながる可能性がある。累積誤差を修正するための一般的な戦略はSLAMの視覚ループクロージャであり、エージェントが以前マップされた場所を再訪したときにポーズグラフを調整する。これらの技術は、通常、現在のビューと以前観察されたシーンの間の視覚的マッピングを識別することに依存しており、しばしば複数のセンサーからデータを取り出す必要がある。対照的に、この研究は、エージェントの現在の深度画像と事前訓練されたニューラルレージアンスフィールド(NeRF)の間の3D-3D対応を利用する補完的アプローチである、NeRF-Assisted 3D Pose Alignment (NAP3D)を導入している。 NAP3Dは、観測現場の3D点をNeRFから合成された点と直列させることで、従来観測されていた位置を再考することなく、新しい視点からでも推定されたポーズを洗練する。このロバストな3D-3D定式化は、従来の2D-3Dローカライゼーション法よりも優れているが、精度と適用性は同等である。実験により、NAP3Dはカスタムデータセット上で5cm以内のカメラポーズ補正を達成し、2D-3D Perspective-N-Pointベースラインを頑健に上回っている。 TUM RGB-Dでは、PnPが低い生回転と翻訳パラメータの誤差を達成しているにも関わらず、NAP3Dは3D空間における幾何整合性を向上している。軽量でデータセットに依存しないツールを提供することで、NAP3Dは従来のループクロージャが利用できない場合に、既存のSLAMとローカライゼーションパイプラインを補完する。

論文の概要: NAP3D: NeRF Assisted 3D-3D Pose Alignment for Autonomous Vehicles

関連論文リスト