Fugu-MT 論文翻訳(概要): Training-Free Image Editing with Visual Context Integration and Concept Alignment

論文の概要: Training-Free Image Editing with Visual Context Integration and Concept Alignment

arxiv url: http://arxiv.org/abs/2604.04487v1
Date: Mon, 06 Apr 2026 07:26:18 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-07 15:49:19.131153
Title: Training-Free Image Editing with Visual Context Integration and Concept Alignment
Title（参考訳）: ビジュアルコンテキスト統合とコンセプトアライメントによる学習自由な画像編集
Authors: Rui Song, Guo-Hua Wang, Qing-Guo Chen, Weihua Luo, Tongda Xu, Zhening Liu, Yan Wang, Zehong Lin, Jun Zhang,
Abstract要約: VicoEditは、事前訓練されたテキストプロンプト編集モデルに視覚的コンテキストを注入する、トレーニングフリーでインバージョンフリーな方法である。本手法は,最先端のトレーニングベースモデルよりも優れた編集性能が得られることを示す。
参考スコア（独自算出の注目度）: 45.18177214621628
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In image editing, it is essential to incorporate a context image to convey the user's precise requirements, such as subject appearance or image style. Existing training-based visual context-aware editing methods incur data collection effort and training cost. On the other hand, the training-free alternatives are typically established on diffusion inversion, which struggles with consistency and flexibility. In this work, we propose VicoEdit, a training-free and inversion-free method to inject the visual context into the pretrained text-prompted editing model. More specifically, VicoEdit directly transforms the source image into the target one based on the visual context, thereby eliminating the need for inversion that can lead to deviated trajectories. Moreover, we design a posterior sampling approach guided by concept alignment to enhance the editing consistency. Empirical results demonstrate that our training-free method achieves even better editing performance than the state-of-the-art training-based models.
Abstract（参考訳）: 画像編集では、ユーザの正確な要求、例えば被写体やイメージスタイルを伝えるために、コンテキストイメージを組み込むことが不可欠である。既存のトレーニングベースのビジュアルコンテキスト対応編集手法は、データ収集の労力とトレーニングコストを発生させる。一方、トレーニングフリーの代替手段は一般的に、一貫性と柔軟性に苦しむ拡散反転に基づいて確立される。本研究では、事前訓練されたテキストプロンプト編集モデルに視覚的コンテキストを注入するトレーニングフリーでインバージョンフリーなVicoEditを提案する。より具体的には、VicoEditは視覚的コンテキストに基づいてソースイメージを直接ターゲットイメージに変換する。さらに、編集一貫性を高めるために、概念アライメントによってガイドされた後部サンプリングアプローチを設計する。実験結果から,本手法は最先端のトレーニングベースモデルよりも優れた編集性能が得られることが示された。

関連論文リスト

Training-Free Multi-Concept Image Editing [14.75123947134721]
Concept Distillation Sampling (CDS)は、ターゲットレスマルチコンセプト画像編集のためのトレーニング不要のフレームワークである。これは、高度に安定な蒸留バックボーンを統合することによって、従来の手法の言語的ボトルネックを克服する。本手法は,所望の編集の参照サンプルを必要とせずに,インスタンスの忠実度を保存する。
論文参考訳（メタデータ） (2026-02-24T12:27:51Z)
SPIE: Semantic and Structural Post-Training of Image Editing Diffusion Models with AI feedback [28.807572302899004]
SPIEは、命令ベースの画像編集拡散モデルの意味的および構造的後学習のための新しいアプローチである。広範な人間のアノテーションに頼ることなく、拡散モデルと人間の嗜好を一致させるオンライン強化学習フレームワークを導入する。実験結果から,SPIEは複雑なシーンで,わずか10ステップで複雑な編集を行うことができることがわかった。
論文参考訳（メタデータ） (2025-04-17T10:46:39Z)
Concept Lancet: Image Editing with Compositional Representation Transplant [58.9421919837084]
Concept Lancetは、画像編集における原則的表現操作のためのゼロショットプラグイン・アンド・プレイフレームワークである。本稿では,抽出された視覚概念の表現の疎線型結合として,潜時(テキスト埋め込み,拡散スコア)空間におけるソース入力を分解する。我々は、対応する編集方向を強制するために、カスタマイズされた概念移植プロセスを実行する。
論文参考訳（メタデータ） (2025-04-03T17:59:58Z)
Edicho: Consistent Image Editing in the Wild [90.42395533938915]
エディコは拡散モデルに基づく訓練なしの解を踏む。これは、明示的な画像対応を使って直接編集する基本的な設計原理を特徴としている。
論文参考訳（メタデータ） (2024-12-30T16:56:44Z)
UIP2P: Unsupervised Instruction-based Image Editing via Edit Reversibility Constraint [87.20985852686785]
本研究では,教師なし指導に基づく画像編集手法を提案する。本稿では,編集可逆性制約 (ERC) と呼ばれる新しい編集機構を導入することで,これらの課題に対処する。これにより、地道に編集されたイメージの必要性を回避し、実際のイメージキャプチャペアまたはイメージキャプチャインストラクショントリプルからなるデータセットで、初めてトレーニングをアンロックすることが可能になります。
論文参考訳（メタデータ） (2024-12-19T18:59:58Z)
AdapEdit: Spatio-Temporal Guided Adaptive Editing Algorithm for Text-Based Continuity-Sensitive Image Editing [24.9487669818162]
本稿では,適応的な画像編集を実現するための時間的ガイド付き適応編集アルゴリズムAdapEditを提案する。我々のアプローチは、モデルの事前保存において大きな利点があり、モデルトレーニング、微調整された追加データ、最適化を必要としない。提案手法は,様々な原画像や編集命令を対象とし,競争性能を実証し,従来の手法よりも優れていたことを示す。
論文参考訳（メタデータ） (2023-12-13T09:45:58Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。