Fugu-MT 論文翻訳(概要): teasr: training-efficient any-step diffusion transformer for real-world image super-resolution

論文の概要: teasr: training-efficient any-step diffusion transformer for real-world image super-resolution

arxiv url: http://arxiv.org/abs/2606.16188v1
Date: Mon, 15 Jun 2026 04:02:37 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-16 16:21:34.069821
Title: teasr: training-efficient any-step diffusion transformer for real-world image super-resolution
Title（参考訳）: ティーザー:現実世界の超高解像度画像のためのトレーニング効率の非ステップ拡散トランスフォーマー
Authors: Xiang Gao, Chenxin Zhu, Yushun Fang, Qiang Hu, Xiaoyun Zhang,
Abstract要約: TEASRはReal-ISRのためのトレーニング効率の良い任意のステップ拡散フレームワークである。我々のキーとなる考え方は、単一拡散モデル内で自己逆蒸留を行うことである。ノイズレベルの一段階生成を安定化する時間ステップ対応補正戦略を提案する。
参考スコア（独自算出の注目度）: 10.733502031936958
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Diffusion models excel in Real-World Image Super-Resolution (Real-ISR) due to their powerful generative priors but suffer from slow iterative sampling. Although existing one-step distillation methods accelerate inference, they typically require auxiliary teacher models that inflate training memory and restrict scalability to large-scale architectures. Furthermore, these fixed-step models lack the flexibility to trade off speed for quality. In this paper, we propose TEASR, a training-efficient any-step diffusion framework for Real-ISR that enables both one-step and multi-step restoration within a unified model. Our key idea is to perform self-adversarial distillation within a single diffusion model, eliminating the need for auxiliary teachers or discriminators. Specifically, we propose a timestep-aware rectification strategy that stabilizes one-step generation across noise levels. These two designs further enables the distillation of 20B-parameter diffusion models on a single GPU, significantly improving training efficiency. Moreover, we introduce a dual-branch diffusion transformer with decoupled timestep condition to separate the current noise state and the denoising target to enhance sampling quality. Extensive experiments demonstrate that TEASR supports seamless any-step sampling and consistently outperforms state-of-the-art methods across multiple datasets.
Abstract（参考訳）: 拡散モデルは、実世界の超解像(Real-ISR)において、強力な生成前駆体により優れているが、反復サンプリングが遅い。既存の一段階蒸留法は推論を加速するが、訓練用メモリを膨張させ、大規模アーキテクチャに拡張性を制限する補助的な教師モデルを必要とするのが一般的である。さらに、これらの固定ステップモデルは、品質のためにスピードをトレードオフする柔軟性に欠けています。本稿では,Real-ISRのためのトレーニング効率の高い任意のステップ拡散フレームワークであるTEASRを提案する。我々の鍵となる考え方は、単一拡散モデル内で自己共分散蒸留を行うことであり、補助教師や差別者の必要性をなくすことである。具体的には、ノイズレベルの一段階生成を安定化する時間ステップ対応補正戦略を提案する。これらの2つの設計により、1つのGPU上で20Bパラメータ拡散モデルの蒸留が可能となり、トレーニング効率が大幅に向上した。さらに,分離した時間ステップ条件のデュアルブランチ拡散変圧器を導入し,現在のノイズ状態とデノナイジングターゲットを分離し,サンプリング品質を向上させる。大規模な実験により、TEASRはシームレスな任意のステップサンプリングをサポートし、複数のデータセットにわたる最先端メソッドを一貫して上回っていることが示されている。

論文の概要: teasr: training-efficient any-step diffusion transformer for real-world image super-resolution

関連論文リスト