Fugu-MT 論文翻訳(概要): GeM-NR: Geometry-Aware Multi-View Editing for Nonrigid Scene Changes

論文の概要: GeM-NR: Geometry-Aware Multi-View Editing for Nonrigid Scene Changes

arxiv url: http://arxiv.org/abs/2606.05142v1
Date: Wed, 03 Jun 2026 17:49:48 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-04 20:44:18.939443
Title: GeM-NR: Geometry-Aware Multi-View Editing for Nonrigid Scene Changes
Title（参考訳）: GeM-NR:非厳密なシーン変化のための幾何学的マルチビュー編集
Authors: Josef Bengtson, Yaroslava Lochman, Fredrik Kahl,
Abstract要約: 一般的なマルチビュー一貫した画像編集のためのトレーニング不要なアプローチであるGeM-NRを提案する。アンカー画像とクエリ未編集画像が与えられた場合、GeM-NRはアンカー編集と一貫してクエリ画像を編集する。本手法は,編集シーンの3次元表現生成を含む,多種多様な編集タスクの一貫性を向上させる。
参考スコア（独自算出の注目度）: 18.344326720479824
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Recent developments in multi-view image editing with generative models have brought us a step closer toward general 3D content generation and customization. Most existing works focus on rigid or appearance-only edits by utilizing the geometry of the unedited scene. This naturally limits these methods to edits that preserve the underlying scene structure. Other approaches are trained for specific image editing tasks, such as object removal and addition. Despite this progress, general nonrigid edits, i.e., edits that substantially change the scene geometry, remain challenging for existing methods. We propose GeM-NR, a fast and flexible training-free approach for general multi-view consistent image editing, including edits that drastically change the geometry and appearance of the scene. Given an anchor image edited with a chosen backbone editor (such as FLUX, Qwen, BrushNet) and a query unedited image, GeM-NR edits the query image consistently with the anchor edit. The method incorporates multiple stages: (i) depth map estimation, where we propose a strategy to maximize the alignment between the 3D point clouds of the edited and unedited scenes, (ii) projection onto a query viewpoint, and (iii) refinement of the obtained image conditioned on the unedited query. The conditioning-based formulation scales well from two to many views of an object. We demonstrate the ability of our method to handle edits with significant changes in geometry and appearance, something that existing methods struggle with. We perform an extensive evaluation showing that our method improves consistency for a wide variety of edit tasks, including generating 3D representations of the edited scene. Both quantitative and qualitative results indicate the state-of-the-art performance of our method in terms of edit quality as well as geometric and photometric consistency across multiple views.
Abstract（参考訳）: 生成モデルを用いた多視点画像編集の最近の進歩は、一般的な3Dコンテンツ生成とカスタマイズに向けて一歩近づいた。現存する作品の多くは、未編集のシーンの幾何学を利用して、厳密または外観のみの編集に焦点を当てている。これにより、これらのメソッドは、下層のシーン構造を保存する編集に制限される。他のアプローチは、オブジェクトの削除や追加など、特定の画像編集タスクのために訓練されている。この進歩にもかかわらず、一般的な非厳密な編集、すなわちシーンの幾何学を著しく変える編集は、既存の方法では難しいままである。本稿では,多視点一貫した画像編集のための高速でフレキシブルなトレーニングフリーアプローチであるGeM-NRを提案する。選択したバックボーンエディタ(FLUX, Qwen, BrushNetなど)で編集されたアンカーイメージとクエリ未編集のイメージが与えられた場合、GeM-NRはアンカー編集で一貫してクエリ画像を編集する。方法は複数の段階を含む。 (i)深度マップ推定では,編集されたシーンと未編集シーンの3次元点雲のアライメントを最大化するための戦略を提案する。 (ii)クエリ視点へのプロジェクション、および (iii)未編集のクエリで条件付けされた画像の精細化。条件付けに基づく定式化は、オブジェクトの2つのビューから多くのビューにうまくスケールする。我々は,既存の手法が抱える幾何や外観に大きな変化を伴って編集を処理できることを実証する。本手法は,編集シーンの3次元表現生成を含む多種多様な編集タスクの整合性を向上することを示す広範囲な評価を行う。定量的および定性的な結果は、編集品質および幾何的および測光的整合性の観点から、我々の手法の最先端性能を示している。

論文の概要: GeM-NR: Geometry-Aware Multi-View Editing for Nonrigid Scene Changes

関連論文リスト