Fugu-MT 論文翻訳(概要): SAM-Flow: Source-Anchored Masked Flow for Training-Free Image Editing

論文の概要: SAM-Flow: Source-Anchored Masked Flow for Training-Free Image Editing

arxiv url: http://arxiv.org/abs/2606.06228v1
Date: Thu, 04 Jun 2026 14:36:19 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-05 22:39:44.860621
Title: SAM-Flow: Source-Anchored Masked Flow for Training-Free Image Editing
Title（参考訳）: SAM-Flow:学習不要な画像編集のためのソースアンコール型マスケッドフロー
Authors: Haowang Cui, Rui Chen, Tao Luo, Tao Guo, Zheng Qin, Jiaze Wang,
Abstract要約: SAM-Flowは、ローカライズされたトレーニングフリーの画像編集のための、ソースアンコールされたマスキングフローフレームワークである。正確なセマンティック編集を実現し、背景保存を大幅に改善する。スタブルディフュージョン3やFLUXのようなメインストリームのフローマッチングバックボーンと、微調整なしで統合できる。
参考スコア（独自算出の注目度）: 11.868477331606911
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Training-free image editing has recently attracted increasing attention due to its ability to modify real images using powerful pre-trained diffusion and flow-matching models without additional training. However, existing inversion-based and differential-flow-based methods usually perform global latent transport, which inevitably propagates editing effects to non-target regions and leads to background leakage. To address this problem, we propose SAM-Flow, a source-anchored masked flow framework for localized training-free image editing. Instead of updating the whole latent representation, SAM-Flow first uses a scout image and token-grounded attention maps to localize the editable semantic regions. It then applies differential velocity updates only within these regions, while anchoring the remaining areas to the source-image latent trajectory. To further improve spatial stability and boundary naturalness, we introduce a time-varying source-anchored projection mechanism with dynamic soft masks, transition regions, and temporal mask accumulation. The proposed method is plug-and-play and can be integrated with mainstream flow-matching backbones such as Stable Diffusion 3 and FLUX without any fine-tuning. Extensive qualitative and quantitative experiments demonstrate that SAM-Flow achieves accurate semantic editing while significantly improving background preservation, providing a simple and general localized editing paradigm for training-free image editing. Code is available at: https://github.com/chwbob/Sam-Flow.
Abstract（参考訳）: トレーニング不要の画像編集は、トレーニングを必要とせず、強力な事前学習拡散モデルとフローマッチングモデルを用いて実際の画像を修正できることから、近年注目を集めている。しかし、既存のインバージョンベースおよび差分フローベース手法は、通常グローバルな遅延輸送を行い、必然的に非ターゲット領域に編集効果を伝播させ、バックグラウンドリークを引き起こす。そこで本研究では,学習自由な画像編集のためのソースアンコール型マスキングフローフレームワークであるSAM-Flowを提案する。 SAM-Flowは、潜在表現全体を更新する代わりに、最初にスカウトイメージとトークングラウンドのアテンションマップを使用して、編集可能なセマンティック領域をローカライズする。次に、これらの領域内でのみ差速度更新を適用し、残りの領域をソースイメージの潜在軌道に固定する。空間的安定性と境界自然性をさらに向上するため,ダイナミックなソフトマスク,遷移領域,時間的マスク蓄積を備えた時間変化ソースアンコレッドプロジェクション機構を導入する。提案手法はプラグアンドプレイであり,スタブルディフュージョン3やFLUXなどのメインストリームのフローマッチングバックボーンと一体化可能である。大規模定性的かつ定量的な実験によりSAM-Flowは背景保存を著しく改善し、訓練不要な画像編集のためのシンプルで汎用的な編集パラダイムを提供するとともに、正確な意味編集を実現することが示されている。コードは、https://github.com/chwbob/Sam-Flow.comで入手できる。

論文の概要: SAM-Flow: Source-Anchored Masked Flow for Training-Free Image Editing

関連論文リスト