Fugu-MT 論文翻訳(概要): SG2Loc: Sequential Visual Localization on 3D Scene Graphs

論文の概要: SG2Loc: Sequential Visual Localization on 3D Scene Graphs

arxiv url: http://arxiv.org/abs/2606.11880v1
Date: Wed, 10 Jun 2026 10:03:21 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-11 16:42:38.403056
Title: SG2Loc: Sequential Visual Localization on 3D Scene Graphs
Title（参考訳）: SG2Loc: 3次元シーングラフにおける連続的な視覚的位置決め
Authors: Nicole Damblon, Olga Vysotska, Federico Tombari, Marc Pollefeys, Daniel Barath,
Abstract要約: 本稿では,3次元シーングラフを用いたシーケンシャルな視覚的ローカライゼーションのための,新しい軽量なアプローチを提案する。コンパクトなシーングラフと効率的なセマンティックマッチングを活用することで,実世界のデータセットの性能を維持しながら,ストレージを大幅に削減する。
参考スコア（独自算出の注目度）: 108.42363585468458
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Visual localization in complex indoor environments remains a critical challenge for robotics and AR applications. Sequential localization, where pose estimates are refined over time, is important for autonomous agents. However, traditional methods often require storing extensive image databases or point clouds, leading to significant overhead. This paper introduces a novel, lightweight approach to sequential visual localization using 3D scene graphs. Our method represents the environment with a compact scene graph, where nodes represent objects (with coarse meshes) and edges encode spatial relationships. For each image in the localization phase, we extract per-patch semantic features, predicting object identities. Localization is performed within a particle filter framework. Each particle, representing a camera pose, projects the coarse object meshes from the scene graph into the image, assigning object identities to patches based on visibility. The similarity of the per-patch features, in the input image, and object features from the scene graph determines the weight of a particle. Subsequent images are incorporated sequentially, refining the pose estimate. By leveraging a compact scene graph and efficient semantic matching, our method significantly reduces storage while maintaining performance on real-world datasets. The code will be available at https://github.com/DmblnNicole/sg2loc.
Abstract（参考訳）: 複雑な屋内環境における視覚的ローカライゼーションは、ロボティクスやARアプリケーションにとって重要な課題である。ポーズ推定が時間とともに洗練される連続的なローカライゼーションは、自律エージェントにとって重要である。しかしながら、従来の手法では、大規模なイメージデータベースやポイントクラウドを格納する必要があることが多いため、大きなオーバーヘッドが生じる。本稿では,3次元シーングラフを用いたシーケンシャルな視覚的ローカライゼーションのための,新しい軽量なアプローチを提案する。提案手法は,ノードがオブジェクト(粗いメッシュ)を表現し,エッジが空間関係を符号化する,コンパクトなシーングラフを用いて環境を表現する。ローカライゼーションフェーズの各画像に対して、オブジェクトの同一性を予測するパッチごとのセマンティック特徴を抽出する。局所化は粒子フィルタフレームワーク内で実行される。カメラのポーズを表す各粒子は、シーングラフから粗いオブジェクトメッシュをイメージに投影し、可視性に基づいてオブジェクトのアイデンティティをパッチに割り当てる。シーングラフからのオブジェクト特徴と入力画像におけるパッチごとの特徴の類似性は、粒子の重量を決定する。その後の画像を順次組み込んで、ポーズ推定を精算する。コンパクトなシーングラフと効率的なセマンティックマッチングを活用することで,実世界のデータセットの性能を維持しながら,ストレージを大幅に削減する。コードはhttps://github.com/DmblnNicole/sg2loc.comから入手できる。

論文の概要: SG2Loc: Sequential Visual Localization on 3D Scene Graphs

関連論文リスト