Fugu-MT 論文翻訳(概要): C-GenReg: Training-Free 3D Point Cloud Registration by Multi-View-Consistent Geometry-to-Image Generation with Probabilistic Modalities Fusion

論文の概要: C-GenReg: Training-Free 3D Point Cloud Registration by Multi-View-Consistent Geometry-to-Image Generation with Probabilistic Modalities Fusion

arxiv url: http://arxiv.org/abs/2604.16680v1
Date: Fri, 17 Apr 2026 20:29:32 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-21 21:52:52.126425
Title: C-GenReg: Training-Free 3D Point Cloud Registration by Multi-View-Consistent Geometry-to-Image Generation with Probabilistic Modalities Fusion
Title（参考訳）: C-GenReg: 確率的モーダリティ融合による多視点幾何画像生成による訓練不要3Dポイントクラウド登録
Authors: Yuval Haitman, Amit Efraim, Joseph M. Francos,
Abstract要約: C-GenRegは3Dポイントクラウド登録のためのトレーニング不要のフレームワークである。これは、世界規模の生成先行と登録指向のビジョンファウンデーションモデルの相補的な強みを活用する。実屋外のLiDARデータ上で動作可能な生成登録フレームワークを初めて紹介する。
参考スコア（独自算出の注目度）: 14.324844136229649
License: http://creativecommons.org/licenses/by/4.0/
Abstract: We introduce C-GenReg, a training-free framework for 3D point cloud registration that leverages the complementary strengths of world-scale generative priors and registration-oriented Vision Foundation Models (VFMs). Current learning-based 3D point cloud registration methods struggle to generalize across sensing modalities, sampling differences, and environments. Hence, C-GenReg augments the geometric point cloud registration branch by transferring the matching problem into an auxiliary image domain, where VFMs excel, using a World Foundation Model to synthesize multi-view-consistent RGB representations from the input geometry. This generative transfer, preserves spatial coherence across source and target views without any fine-tuning. From these generated views, a VFM pretrained for finding dense correspondences extracts matches. The resulting pixel correspondences are lifted back to 3D via the original depth maps. To further enhance robustness, we introduce a "Match-then-Fuse" probabilistic cold-fusion scheme that combines two independent correspondence posteriors, that of the generated-RGB branch with that of the raw geometric branch. This principled fusion preserves each modality inductive bias and provides calibrated confidence without any additional learning. C-GenReg is zero-shot and plug-and-play: all modules are pretrained and operate without fine-tuning. Extensive experiments on indoor (3DMatch, ScanNet) and outdoor (Waymo) benchmarks demonstrate strong zero-shot performance and superior cross-domain generalization. For the first time, we demonstrate a generative registration framework that operates successfully on real outdoor LiDAR data, where no imagery data is available.
Abstract（参考訳）: C-GenRegは3Dポイントクラウド登録のためのトレーニング不要のフレームワークで、世界規模の生成前と登録指向のビジョンファウンデーションモデル(VFM)の相補的な強みを利用する。現在の学習ベースの3Dポイントクラウド登録手法は、センシングのモダリティ、違いのサンプリング、環境の一般化に苦慮している。したがって、C-GenRegは、マッチング問題を補助画像領域に転送することで、幾何学点雲登録枝を拡張し、VFMは、ワールドファンデーションモデルを用いて入力幾何学から多視点のRGB表現を合成する。この生成転送は、微調整なしでソースとターゲットビュー間の空間的コヒーレンスを保存する。これらのビューから、密度の高い対応関係の抽出マッチを見つけるために予め訓練されたVFMが生成される。結果のピクセル対応は、元の深度マップを介して3Dに持ち上げられる。さらにロバスト性を高めるために,生成したRGBブランチと生の幾何学的ブランチの2つの独立対応部分を組み合わせた,"Match-then-Fuse"確率的コールドフュージョン方式を導入する。この原理的融合は、各モード誘導バイアスを保存し、追加の学習なしに校正された信頼を提供する。 C-GenRegはゼロショットでプラグイン&プレイで、すべてのモジュールは微調整なしで事前トレーニングされ、動作します。屋内 (3DMatch, ScanNet) および屋外 (Waymo) ベンチマークの大規模な実験は、強いゼロショット性能と優れたクロスドメイン一般化を示す。画像データが得られない実屋外LiDARデータ上で, 実効動作する生成登録フレームワークを初めて実演する。

論文の概要: C-GenReg: Training-Free 3D Point Cloud Registration by Multi-View-Consistent Geometry-to-Image Generation with Probabilistic Modalities Fusion

関連論文リスト