Fugu-MT 論文翻訳(概要): Faithful Extreme Image Rescaling with Learnable Reversible Transformation and Semantic Priors

論文の概要: Faithful Extreme Image Rescaling with Learnable Reversible Transformation and Semantic Priors

arxiv url: http://arxiv.org/abs/2605.00605v1
Date: Fri, 01 May 2026 12:19:40 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-04 17:43:28.94445
Title: Faithful Extreme Image Rescaling with Learnable Reversible Transformation and Semantic Priors
Title（参考訳）: 学習可能な可逆変換とセマンティックプリミティブを用いた忠実なエクストリームイメージ再スケーリング
Authors: Hao Wei, Yanhui Zhou, Chenyang Ge, Saeed Anwar, Ajmal Mian,
Abstract要約: FaithEIRは、極端なイメージ再スケーリングのための拡散ベースのフレームワークである。特異値分解にインスパイアされ、学習可能な可逆変換を開発する。量子化による情報損失を補うために,前もって適応的な詳細情報を提案する。
参考スコア（独自算出の注目度）: 46.54433210034761
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Most recent extreme rescaling methods struggle to preserve semantically consistent structures and produce realistic details, due to the severely ill-posed nature of low- to high-resolution mapping under scaling factors of $16\times$ or higher. To alleviate the above problems, we propose FaithEIR, a diffusion-based framework for extreme image rescaling. Inspired by singular value decomposition, we develop learnable reversible transformation that enables invertible downscaling and upscaling in the latent space. To compensate for information loss due to quantization, we propose an adaptive detail prior, a high-frequency dictionary that captures the empirical average of commonly occurring structures in the training data. Finally, we design a lightweight pixel semantic embedder to provide semantic conditioning for the pretrained diffusion model. We present extensive experimental results demonstrating that our FaithEIR consistently outperforms state-of-the-art methods, achieving superior reconstruction fidelity and perceptual quality. Our code, model weights, and detailed results are released at https://github.com/cshw2021/FaithEIR.
Abstract（参考訳）: 最近の極端な再スケーリング手法は、16ドル以上のスケーリング係数の下で、低解像度から高解像度のマッピングが著しく不適切な性質を持つため、意味的に一貫した構造を保存し、現実的な詳細を生成するのに苦労している。上記の問題を緩和するために,極端画像再スケーリングのための拡散ベースのフレームワークであるFaithEIRを提案する。特異値分解にインスパイアされた学習可能可逆変換は、潜在空間における可逆的なダウンスケーリングとアップスケーリングを可能にする。量子化による情報損失を補うために、トレーニングデータにおいて一般的に発生する構造の経験的平均をキャプチャする高周波辞書、適応ディテールを前もって提案する。最後に,事前学習した拡散モデルに対するセマンティックコンディショニングを提供するために,軽量な画素セマンティックインバータを設計する。以上の結果から,FithEIRは最先端の手法よりも優れ,再現精度と知覚品質に優れていたことが示唆された。私たちのコード、モデルウェイト、詳細な結果はhttps://github.com/cshw2021/FaithEIR.comで公開されています。

論文の概要: Faithful Extreme Image Rescaling with Learnable Reversible Transformation and Semantic Priors

関連論文リスト