Fugu-MT 論文翻訳(概要): Free Geometry: Refining 3D Reconstruction from Longer Versions of Itself

論文の概要: Free Geometry: Refining 3D Reconstruction from Longer Versions of Itself

arxiv url: http://arxiv.org/abs/2604.14048v1
Date: Wed, 15 Apr 2026 16:24:03 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-16 20:38:32.635157
Title: Free Geometry: Refining 3D Reconstruction from Longer Versions of Itself
Title（参考訳）: フリージオメトリ:より長いバージョンから3Dレコンストラクションを仕上げる
Authors: Yuhang Dai, Xingyi Yang,
Abstract要約: Free Geometryは、フィードフォワードの3D再構成モデルが3D基底の真理なしにテスト時に自己進化することを可能にするフレームワークである。当社のアプローチは、4つのベンチマークデータセットにまたがって、Depth Anything 3やVGGTといった最先端の基礎モデルを継続的に改善しています。
参考スコア（独自算出の注目度）: 38.95274911975506
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Feed-forward 3D reconstruction models are efficient but rigid: once trained, they perform inference in a zero-shot manner and cannot adapt to the test scene. As a result, visually plausible reconstructions often contain errors, particularly under occlusions, specularities, and ambiguous cues. To address this, we introduce Free Geometry, a framework that enables feed-forward 3D reconstruction models to self-evolve at test time without any 3D ground truth. Our key insight is that, when the model receives more views, it produces more reliable and view-consistent reconstructions. Leveraging this property, given a testing sequence, we mask a subset of frames to construct a self-supervised task. Free Geometry enforces cross-view feature consistency between representations from full and partial observations, while maintaining the pairwise relations implied by the held-out frames. This self-supervision allows for fast recalibration via lightweight LoRA updates, taking less than 2 minutes per dataset on a single GPU. Our approach consistently improves state-of-the-art foundation models, including Depth Anything 3 and VGGT, across 4 benchmark datasets, yielding an average improvement of 3.73% in camera pose accuracy and 2.88% in point map prediction. Code is available at https://github.com/hiteacherIamhumble/Free-Geometry .
Abstract（参考訳）: フィードフォワード3D再構成モデルは効率的だが剛性があり、訓練されるとゼロショットで推論を行い、テストシーンに適応できない。結果として、視覚的にもっともらしい再建は、特に閉塞、特異性、曖昧な手がかりの下でエラーを含むことが多い。これを解決するために、我々は、フィードフォワード3次元再構成モデルを3次元基底真理なしにテスト時に自己展開できるフレームワークであるFree Geometryを紹介した。私たちの重要な洞察は、モデルがより多くのビューを受け取ると、より信頼性が高く、ビューに一貫性のある再構築が得られます。この特性を活用して、テストシーケンスが与えられたら、フレームのサブセットを隠蔽して自己教師型タスクを構築する。 Free Geometryは、完全な観察から部分的な観察までの表現間の横断的な特徴整合を強制する一方で、保持されたフレームによって暗示されるペアワイズ関係を維持している。このセルフスーパービジョンは、軽量なLoRAアップデートによる高速なリカバリを可能にし、単一のGPU上でデータセット毎に2分未満で済む。我々のアプローチは、4つのベンチマークデータセットでDepth Anything 3とVGGTを含む最先端の基礎モデルを継続的に改善し、カメラポーズの精度が3.73%、ポイントマップの予測が2.88%向上した。コードはhttps://github.com/hiteacherIamhumble/Free-Geometryで入手できる。

論文の概要: Free Geometry: Refining 3D Reconstruction from Longer Versions of Itself

関連論文リスト