Fugu-MT 論文翻訳(概要): SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training

論文の概要: SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training

arxiv url: http://arxiv.org/abs/2506.05301v1
Date: Thu, 05 Jun 2025 17:51:05 GMT
ステータス: 翻訳完了
システム内更新日: 2025-06-06 21:53:49.872548
Title: SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training
Title（参考訳）: SeedVR2:拡散逆行によるワンステップビデオ再生
Authors: Jianyi Wang, Shanchuan Lin, Zhijie Lin, Yuxi Ren, Meng Wei, Zongsheng Yue, Shangchen Zhou, Hao Chen, Yang Zhao, Ceyuan Yang, Xuefeng Xiao, Chen Change Loy, Lu Jiang,
Abstract要約: 実データに対する対角的VRトレーニングを行うセドVR2と呼ばれる一段階拡散型VRモデルを提案する。単一ステップで高精細度VRを扱うために、モデルアーキテクチャとトレーニング手順の両方にいくつかの拡張を導入する。
参考スコア（独自算出の注目度）: 82.68200031146299
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Recent advances in diffusion-based video restoration (VR) demonstrate significant improvement in visual quality, yet yield a prohibitive computational cost during inference. While several distillation-based approaches have exhibited the potential of one-step image restoration, extending existing approaches to VR remains challenging and underexplored, particularly when dealing with high-resolution video in real-world settings. In this work, we propose a one-step diffusion-based VR model, termed as SeedVR2, which performs adversarial VR training against real data. To handle the challenging high-resolution VR within a single step, we introduce several enhancements to both model architecture and training procedures. Specifically, an adaptive window attention mechanism is proposed, where the window size is dynamically adjusted to fit the output resolutions, avoiding window inconsistency observed under high-resolution VR using window attention with a predefined window size. To stabilize and improve the adversarial post-training towards VR, we further verify the effectiveness of a series of losses, including a proposed feature matching loss without significantly sacrificing training efficiency. Extensive experiments show that SeedVR2 can achieve comparable or even better performance compared with existing VR approaches in a single step.
Abstract（参考訳）: 近年の拡散型ビデオ再生(VR)の進歩は視覚的品質を著しく向上させたが、推論の際には計算コストが禁じられている。蒸留法に基づくいくつかのアプローチは、一段階のイメージ復元の可能性を示しているが、既存のVRへのアプローチは、特に現実世界で高解像度のビデオを扱う際には、困難かつ未探索のままである。本研究では,実データに対する対角的VRトレーニングを行うセドVR2と呼ばれる一段階拡散型VRモデルを提案する。単一ステップで高精細度VRを扱うために、モデルアーキテクチャとトレーニング手順の両方にいくつかの拡張を導入する。具体的には、ウィンドウサイズを動的に調整して出力解像度に適合させる適応型ウィンドウアテンション機構を提案する。また,VRに対する対戦後学習の安定化と改善を図るため,提案した特徴マッチング損失を含む一連の損失の有効性を,トレーニング効率を著しく損なうことなく検証する。大規模な実験により、SeedVR2は1ステップで既存のVRアプローチと同等またはそれ以上のパフォーマンスを達成できることが示された。

論文の概要: SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training

関連論文リスト