Fugu-MT 論文翻訳(概要): Dragging with Geometry: From Pixels to Geometry-Guided Image Editing

論文の概要: Dragging with Geometry: From Pixels to Geometry-Guided Image Editing

arxiv url: http://arxiv.org/abs/2509.25740v1
Date: Tue, 30 Sep 2025 03:53:11 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-01 14:45:00.008025
Title: Dragging with Geometry: From Pixels to Geometry-Guided Image Editing
Title（参考訳）: 幾何学によるドラッグ: 画像から幾何学誘導画像編集へ
Authors: Xinyu Pu, Hongsong Wang, Jie Gui, Pan Zhou,
Abstract要約: 幾何学誘導型ドラッグベース画像編集手法であるGeoDragを提案する。ジオドラグは3次元幾何学と2次元空間先行を共同で符号化する統一された変位場に基づいており、コヒーレントで高忠実で構造に一貫性のある編集を可能にしている。
参考スコア（独自算出の注目度）: 42.176957681367185
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Interactive point-based image editing serves as a controllable editor, enabling precise and flexible manipulation of image content. However, most drag-based methods operate primarily on the 2D pixel plane with limited use of 3D cues. As a result, they often produce imprecise and inconsistent edits, particularly in geometry-intensive scenarios such as rotations and perspective transformations. To address these limitations, we propose a novel geometry-guided drag-based image editing method - GeoDrag, which addresses three key challenges: 1) incorporating 3D geometric cues into pixel-level editing, 2) mitigating discontinuities caused by geometry-only guidance, and 3) resolving conflicts arising from multi-point dragging. Built upon a unified displacement field that jointly encodes 3D geometry and 2D spatial priors, GeoDrag enables coherent, high-fidelity, and structure-consistent editing in a single forward pass. In addition, a conflict-free partitioning strategy is introduced to isolate editing regions, effectively preventing interference and ensuring consistency. Extensive experiments across various editing scenarios validate the effectiveness of our method, showing superior precision, structural consistency, and reliable multi-point editability. The code will be available on https://github.com/xinyu-pu/GeoDrag .
Abstract（参考訳）: インタラクティブなポイントベースの画像編集は、制御可能なエディタとして機能し、画像コンテンツの正確かつ柔軟な操作を可能にする。しかしながら、ほとんどのドラッグベースの手法は、主に3Dキューを限定的に使用した2Dピクセル平面で動作する。結果として、特に回転や視点変換のような幾何学的集約的なシナリオにおいて、不正確で一貫性のない編集をしばしば生み出す。これらの制約に対処するために,ジオドラッグという3つの重要な課題に対処する幾何学誘導型ドラッグベース画像編集手法を提案する。 1)3次元幾何学的手がかりをピクセルレベルの編集に取り入れる。 2 幾何学のみによる不連続の緩和及び 3)多点引きずりによる紛争の解決。ジオドラグは3次元幾何と2次元空間先行を共同で符号化する統一された変位場に基づいており、コヒーレントで高忠実で構造に一貫性のある編集を1つの前方通過で実現している。さらに、編集領域を分離し、干渉を効果的に防止し、一貫性を確保するために、競合のない分割戦略を導入する。本手法の有効性を検証し, 精度, 構造整合性, 信頼性の高い多点編集性を示す。コードはhttps://github.com/xinyu-pu/GeoDragで入手できる。

論文の概要: Dragging with Geometry: From Pixels to Geometry-Guided Image Editing

関連論文リスト