Fugu-MT 論文翻訳(概要): Extreme Blind Image Restoration via Prompt-Conditioned Information Bottleneck

論文の概要: Extreme Blind Image Restoration via Prompt-Conditioned Information Bottleneck

arxiv url: http://arxiv.org/abs/2510.00728v1
Date: Wed, 01 Oct 2025 10:13:27 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-03 16:59:20.505554
Title: Extreme Blind Image Restoration via Prompt-Conditioned Information Bottleneck
Title（参考訳）: Prompt-Conditioned Information Bottleneckによる極端ブラインド画像復元
Authors: Hongeun Kim, Bryan Sangwoo Kim, Jong Chul Ye,
Abstract要約: ブラインド画像復元法(BIR)は目覚ましい成果を上げたが, EBIR(Extreme Blind Image Restoration)に直面するとフェールした。本稿では,難解なELQ-to-HQ復元プロセスを分解する新しいフレームワークを提案する。
参考スコア（独自算出の注目度）: 47.027290803102666
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Blind Image Restoration (BIR) methods have achieved remarkable success but falter when faced with Extreme Blind Image Restoration (EBIR), where inputs suffer from severe, compounded degradations beyond their training scope. Directly learning a mapping from extremely low-quality (ELQ) to high-quality (HQ) images is challenging due to the massive domain gap, often leading to unnatural artifacts and loss of detail. To address this, we propose a novel framework that decomposes the intractable ELQ-to-HQ restoration process. We first learn a projector that maps an ELQ image onto an intermediate, less-degraded LQ manifold. This intermediate image is then restored to HQ using a frozen, off-the-shelf BIR model. Our approach is grounded in information theory; we provide a novel perspective of image restoration as an Information Bottleneck problem and derive a theoretically-driven objective to train our projector. This loss function effectively stabilizes training by balancing a low-quality reconstruction term with a high-quality prior-matching term. Our framework enables Look Forward Once (LFO) for inference-time prompt refinement, and supports plug-and-play strengthening of existing image restoration models without need for finetuning. Extensive experiments under severe degradation regimes provide a thorough analysis of the effectiveness of our work.
Abstract（参考訳）: ブラインド画像復元法(Blind Image Restoration, BIR)は, トレーニング範囲を超えて重度の複合劣化に苦しむEBIR(Extreme Blind Image Restoration, EBIR)に直面すると, 目覚ましい成功を収めた。極めて低品質(ELQ)から高品質(HQ)イメージへのマッピングを直接学習することは、ドメインのギャップが大きいため、しばしば不自然なアーティファクトや詳細の喪失につながるため、難しい。そこで本研究では,難解なELQ-to-HQ復元プロセスを分解する新しいフレームワークを提案する。まず、ELQ画像を中間劣化の少ないLQ多様体にマッピングするプロジェクタを学習する。この中間画像は、凍結した市販のBIRモデルを使用して本社に復元される。提案手法は情報理論に基づいており,情報ボトルネック問題としての画像復元の新たな視点を提供し,プロジェクタを訓練する理論的目的を導出する。この損失関数は、高品質の事前マッチング項と低品質の再構成項のバランスをとることにより、トレーニングを効果的に安定化させる。我々のフレームワークは、推測時即時改善のためのLook Forward Once(LFO)を可能にし、微調整を必要とせずに既存の画像復元モデルのプラグアンドプレイ強化をサポートする。過酷な劣化体制下での大規模な実験は、我々の研究の有効性を徹底的に分析する。

論文の概要: Extreme Blind Image Restoration via Prompt-Conditioned Information Bottleneck

関連論文リスト