Fugu-MT 論文翻訳(概要): TOC-SR: Task-Optimal Compact diffusion for Image Super Resolution

論文の概要: TOC-SR: Task-Optimal Compact diffusion for Image Super Resolution

arxiv url: http://arxiv.org/abs/2605.02767v1
Date: Mon, 04 May 2026 16:06:24 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-05 20:33:50.397627
Title: TOC-SR: Task-Optimal Compact diffusion for Image Super Resolution
Title（参考訳）: TOC-SR:画像超解像のためのタスク最適コンパクト拡散
Authors: Sowmya Vajrala, Akshay Bankar, Manjunath Arveti, Shreyas Pandith, Sravanth Kodavanti, Subhajit Sanyal, Amit Unde, Srinivas Soumitri Miriyala,
Abstract要約: 本稿では,コンパクトな拡散バックボーンを初めて発見し,効率的な1ステップ超解像モデルを構築するための枠組みを提案する。その結果、コンパクトな拡散バックボーンは、拡張拡散モデルと比較してパラメータの6.6倍、GMACの2.8倍の減少を達成する。提案手法は, 高い復元品質を維持しつつ, 効率的な超解像を実現することを実証した。
参考スコア（独自算出の注目度）: 4.395931521489804
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Diffusion models have recently demonstrated strong performance for image restoration tasks, including super-resolution. However, their large model size and iterative sampling procedures make them computationally expensive for practical deployment. In this work, we present TOC-SR, a framework for building efficient one-step super-resolution models by first discovering a compact diffusion backbone. Starting from a sixteen-channel latent diffusion model, we construct parameter-efficient surrogate blocks using feature-wise generative distillation and perform architecture discovery using epsilon-constrained Bayesian Optimization to minimize model complexity while preserving generative fidelity. The resulting compact diffusion backbone achieves a 6.6x reduction in parameters and a 2.8x reduction in GMACs compared to the expanded diffusion model. We then adapt this backbone for super-resolution and distill the diffusion process into a single-step generator. Experiments demonstrate that the proposed approach enables efficient super-resolution while maintaining strong reconstruction quality.
Abstract（参考訳）: 拡散モデルは最近、超解像を含む画像復元タスクに強い性能を示した。しかし、その大きなモデルサイズと反復的なサンプリング手順により、実用的な展開には計算コストがかかる。本研究では,コンパクトな拡散バックボーンを初めて発見し,効率的な1ステップ超解像モデルを構築するためのフレームワークTOC-SRを提案する。 16チャンネルの潜伏拡散モデルから, 特徴量生成蒸留を用いてパラメータ効率のよいサロゲートブロックを構築し, エプシロン制約ベイズ最適化を用いて構造探索を行い, モデル複雑性を最小化し, 生成忠実性を維持した。その結果、コンパクトな拡散バックボーンは、拡張拡散モデルと比較してパラメータの6.6倍、GMACの2.8倍の減少を達成する。次に、このバックボーンを超高分解能に適応させ、拡散過程を単段発生器に蒸留する。提案手法は, 高い復元品質を維持しつつ, 効率的な超解像を実現することを実証した。

論文の概要: TOC-SR: Task-Optimal Compact diffusion for Image Super Resolution

関連論文リスト