Fugu-MT 論文翻訳(概要): MV-RoMa: From Pairwise Matching into Multi-View Track Reconstruction

論文の概要: MV-RoMa: From Pairwise Matching into Multi-View Track Reconstruction

arxiv url: http://arxiv.org/abs/2603.27542v1
Date: Sun, 29 Mar 2026 06:50:21 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-31 23:18:45.016738
Title: MV-RoMa: From Pairwise Matching into Multi-View Track Reconstruction
Title（参考訳）: MV-RoMa:Pairwise MatchingからMulti-View Track Restructionへ
Authors: Jongmin Lee, Seungyeop Kang, Sungjoo Yoo,
Abstract要約: MV-RoMaは、ソース画像から複数の可視目標への密対応を推定する多視点密マッチングモデルである。我々は、モデルが一貫したマルチビュー対応を、構造移動のための高品質トラックとして統合する後処理戦略を提案する(SfM)。 MV-RoMaは既存のスパース法や密マッチング法よりも信頼性が高く、かなり密集した3次元再構成を行う。
参考スコア（独自算出の注目度）: 14.717756921141364
License: http://creativecommons.org/publicdomain/zero/1.0/
Abstract: Establishing consistent correspondences across images is essential for 3D vision tasks such as structure-from-motion (SfM), yet most existing matchers operate in a pairwise manner, often producing fragmented and geometrically inconsistent tracks when their predictions are chained across views. We propose MV-RoMa, a multi-view dense matching model that jointly estimates dense correspondences from a source image to multiple co-visible targets. Specifically, we design an efficient model architecture which avoids high computational cost of full cross-attention for multi-view feature interaction: (i) multi-view encoder that leverages pair-wise matching results as a geometric prior, and (ii) multi-view matching refiner that refines correspondences using pixel-wise attention. Additionally, we propose a post-processing strategy that integrates our model's consistent multi-view correspondences as high-quality tracks for SfM. Across diverse and challenging benchmarks, MV-RoMa produces more reliable correspondences and substantially denser, more accurate 3D reconstructions than existing sparse and dense matching methods. Project page: https://icetea-cv.github.io/mv-roma/.
Abstract（参考訳）: 画像間で一貫した対応を確立することは、構造移動(SfM)のような3次元視覚タスクには不可欠であるが、既存のほとんどのマッカーはペア方式で動作し、予測がビューにチェーンされているときにしばしば断片的かつ幾何学的に一貫性のないトラックを生成する。本稿では,ソース画像から複数の同一視対象への濃密対応を同時推定する多視点密マッチングモデルMV-RoMaを提案する。具体的には,マルチビュー機能間相互作用のためのクロスアテンションの計算コストが高いことを回避した,効率的なモデルアーキテクチャを設計する。 (i)幾何先行としてペアワイドマッチング結果を利用するマルチビューエンコーダ (II)画素ワイドアテンションを用いて対応を洗練するマルチビューマッチングリファインダ。さらに、SfMの高品質トラックとして、モデルの一貫性のあるマルチビュー対応を統合する後処理戦略を提案する。 MV-RoMaは、多種多様で挑戦的なベンチマークで、既存のスパース法や密マッチング法よりも信頼性が高く、かなり高密度で高精度な3D再構成を生成する。プロジェクトページ:https://icetea-cv.github.io/mv-roma/。

論文の概要: MV-RoMa: From Pairwise Matching into Multi-View Track Reconstruction

関連論文リスト