Fugu-MT 論文翻訳(概要): RegionE: Adaptive Region-Aware Generation for Efficient Image Editing

論文の概要: RegionE: Adaptive Region-Aware Generation for Efficient Image Editing

arxiv url: http://arxiv.org/abs/2510.25590v1
Date: Wed, 29 Oct 2025 14:58:37 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-30 15:50:45.713165
Title: RegionE: Adaptive Region-Aware Generation for Efficient Image Editing
Title（参考訳）: RegionE: 効率的な画像編集のための適応型領域認識生成
Authors: Pengtao Chen, Xianfang Zeng, Maosen Zhao, Mingzhu Shen, Peng Ye, Bangyin Xiang, Zhibo Wang, Wei Cheng, Gang Yu, Tao Chen,
Abstract要約: RegionEは、追加のトレーニングなしでIIEタスクを加速する適応型、リージョン対応の生成フレームワークである。フレームワークは,1)適応領域分割,2)領域認識生成,3)適応速度劣化キャッシュの3つの主要コンポーネントから構成される。我々はRereaEをStep1X-Edit、FLUX.1 Kontext、Qwen-Image-Editといった最先端IIEベースモデルに適用した。
参考スコア（独自算出の注目度）: 28.945176886517448
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Recently, instruction-based image editing (IIE) has received widespread attention. In practice, IIE often modifies only specific regions of an image, while the remaining areas largely remain unchanged. Although these two types of regions differ significantly in generation difficulty and computational redundancy, existing IIE models do not account for this distinction, instead applying a uniform generation process across the entire image. This motivates us to propose RegionE, an adaptive, region-aware generation framework that accelerates IIE tasks without additional training. Specifically, the RegionE framework consists of three main components: 1) Adaptive Region Partition. We observed that the trajectory of unedited regions is straight, allowing for multi-step denoised predictions to be inferred in a single step. Therefore, in the early denoising stages, we partition the image into edited and unedited regions based on the difference between the final estimated result and the reference image. 2) Region-Aware Generation. After distinguishing the regions, we replace multi-step denoising with one-step prediction for unedited areas. For edited regions, the trajectory is curved, requiring local iterative denoising. To improve the efficiency and quality of local iterative generation, we propose the Region-Instruction KV Cache, which reduces computational cost while incorporating global information. 3) Adaptive Velocity Decay Cache. Observing that adjacent timesteps in edited regions exhibit strong velocity similarity, we further propose an adaptive velocity decay cache to accelerate the local denoising process. We applied RegionE to state-of-the-art IIE base models, including Step1X-Edit, FLUX.1 Kontext, and Qwen-Image-Edit. RegionE achieved acceleration factors of 2.57, 2.41, and 2.06. Evaluations by GPT-4o confirmed that semantic and perceptual fidelity were well preserved.
Abstract（参考訳）: 近年,命令ベース画像編集(IIE)が注目されている。実際には、IIEは画像の特定の領域だけを変更するが、残りの領域はほとんど変わらない。これらの2つの領域は生成困難と計算冗長性において著しく異なるが、既存のIIEモデルは、画像全体にわたって均一な生成プロセスを適用する代わりに、この区別を考慮しない。これは、追加のトレーニングなしでIIEタスクを加速する適応型地域対応生成フレームワークであるRereaEを提案する動機である。具体的には、RereaEフレームワークは3つの主要コンポーネントから構成される。 1)適応地域分割。未編集領域の軌跡は直線であり,複数段階の復号化予測を1ステップで推測できることがわかった。そこで,初期復調段階では,最終推定結果と参照画像との差に基づき,編集済み領域と未編集領域に分割する。 2)地域対応世代。地域を識別した後、未編集領域の1ステップ予測にマルチステップのデノベーションを置き換える。編集された領域では、軌道は湾曲し、局所的な反復的 denoising が必要となる。局所的な反復生成の効率と品質を改善するため,グローバル情報を導入しながら計算コストを削減できる領域命令KVキャッシュを提案する。 3)適応的ベロシティ低下キャッシュ。編集領域の隣接時間ステップが強い速度類似性を示すのを観察し, 局所デノナイジングプロセスの高速化を目的とした適応型速度減衰キャッシュを提案する。我々はRereaEをStep1X-Edit, FLUX.1を含む最先端IIEベースモデルに適用した。 KontextとQwen-Image-Edit。 RegionEは2.57、2.41、2.06の加速係数を達成した。 GPT-4oによる評価では,意味的および知覚的忠実度が良好に保存されていることが確認された。

論文の概要: RegionE: Adaptive Region-Aware Generation for Efficient Image Editing

関連論文リスト