Fugu-MT 論文翻訳(概要): RegFormer++: An Efficient Large-Scale 3D LiDAR Point Registration Network with Projection-Aware 2D Transformer

論文の概要: RegFormer++: An Efficient Large-Scale 3D LiDAR Point Registration Network with Projection-Aware 2D Transformer

arxiv url: http://arxiv.org/abs/2603.14290v1
Date: Sun, 15 Mar 2026 09:03:10 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 16:19:35.726401
Title: RegFormer++: An Efficient Large-Scale 3D LiDAR Point Registration Network with Projection-Aware 2D Transformer
Title（参考訳）: RegFormer++:プロジェクション対応2Dトランスを用いた大規模LiDARポイントレジストレーションネットワーク
Authors: Jiuming Liu, Guangming Wang, Zhe Liu, Chaokang Jiang, Haoang Li, Mengmeng Liu, Tianchen Deng, Marc Pollefeys, Michael Ying Yang, Hesheng Wang,
Abstract要約: 本稿では,大規模クラウドアライメントを実現するために,RegFormer++と呼ばれる新しいエンド・ツー・エンド差動トランスフォーマネットワークを提案する。元の3D座標を2次元投影位置に充填するため、設計したトランスフォーマーは2次元処理における高効率と3次元幾何情報からの精度の両面から恩恵を受けることができる。 KITTI、NuScenes、Argoverseのデータセットを用いた実験により、我々のモデルは精度と効率の両面で最先端のパフォーマンスを達成することが示された。
参考スコア（独自算出の注目度）: 71.59462491691819
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Although point cloud registration has achieved remarkable advances in object-level and indoor scenes, large-scale LiDAR registration methods has been rarely explored before. Challenges mainly arise from the huge point scale, complex point distribution, and numerous outliers within outdoor LiDAR scans. In addition, most existing registration works generally adopt a two-stage paradigm: They first find correspondences by extracting discriminative local descriptors and then leverage robust estimators (e.g. RANSAC) to filter outliers, which are highly dependent on well-designed descriptors and post-processing choices. To address these problems, we propose a novel end-to-end differential transformer network, termed RegFormer++, for large-scale point cloud alignment without requiring any further post-processing. Specifically, a hierarchical projection-aware 2D transformer with linear complexity is proposed to project raw LiDAR points onto a cylindrical surface and extract global point features, which can improve resilience to outliers due to long-range dependencies. Because we fill original 3D coordinates into 2D projected positions, our designed transformer can benefit from both high efficiency in 2D processing and accuracy from 3D geometric information. Furthermore, to effectively reduce wrong point matching, a Bijective Association Transformer (BAT) is designed, combining both cross attention and all-to-all point gathering. To improve training stability and robustness, a feature-transformed optimal transport module is also designed for regressing the final pose transformation. Extensive experiments on KITTI, NuScenes, and Argoverse datasets demonstrate that our model achieves state-of-the-art performance in terms of both accuracy and efficiency.
Abstract（参考訳）: 点雲の登録は、オブジェクトレベルや屋内のシーンにおいて顕著な進歩を遂げているが、大規模なLiDAR登録法は、これまで研究されることはめったにない。課題は主に、屋外のLiDARスキャンにおける巨大な点スケール、複雑な点分布、および多数のアウトリーチから生じる。識別可能なローカル記述子を抽出し、次に堅牢な推定子(例えばRANSAC)を活用して、よく設計された記述子と後処理の選択に大きく依存するアウターをフィルタリングする。これらの問題に対処するために、我々はRegFormer++と呼ばれる新しいエンド・ツー・エンドのディファレンシャル・トランスフォーマーネットワークを提案し、さらなる後処理を必要とせずに大規模ポイントクラウドアライメントを実現する。具体的には,線形複雑度を有する階層型射影型2次元変圧器を提案し,LDARの生点を円筒面に投影し,大域点特徴を抽出し,長距離依存による外れ値のレジリエンスを向上させる。元の3D座標を2次元投影位置に充填するため、設計したトランスフォーマーは2次元処理における高効率と3次元幾何情報からの精度の両面から恩恵を受けることができる。さらに, 誤り点マッチングを効果的に低減するために, クロスアテンションとオール・ツー・オールポイント・アソシエーション・アソシエーション・トランスフォーマー(BAT)を併用して設計する。トレーニング安定性とロバスト性を改善するため、最終的なポーズ変換を遅らせるために、特徴変換された最適輸送モジュールも設計されている。 KITTI、NuScenes、Argoverseのデータセットに対する大規模な実験により、我々のモデルは精度と効率の両面で最先端のパフォーマンスを達成することを示した。

論文の概要: RegFormer++: An Efficient Large-Scale 3D LiDAR Point Registration Network with Projection-Aware 2D Transformer

関連論文リスト