Fugu-MT 論文翻訳(概要): DVFace: Spatio-Temporal Dual-Prior Diffusion for Video Face Restoration

論文の概要: DVFace: Spatio-Temporal Dual-Prior Diffusion for Video Face Restoration

arxiv url: http://arxiv.org/abs/2604.14560v1
Date: Thu, 16 Apr 2026 02:42:31 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-17 21:29:31.690998
Title: DVFace: Spatio-Temporal Dual-Prior Diffusion for Video Face Restoration
Title（参考訳）: DVFace:ビデオ顔復元のための時空間二重拡散
Authors: Zheng Chen, Bowen Chai, Rongjun Gao, Mingtao Nie, Xi Li, Bingnan Duan, Jianping Fang, Xiaohong Liu, Linghe Kong, Yulun Zhang,
Abstract要約: 顔の復元は、劣化した顔のビデオを高品質な細部、安定したアイデンティティ、時間的コヒーレンスに拡張することを目的としている。近年の拡散法は、再生に強い先行性をもたらし、より現実的な合成を可能にしている。実世界の顔復元のための一段階拡散フレームワークであるDVFaceを提案する。
参考スコア（独自算出の注目度）: 38.685358794153736
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Video face restoration aims to enhance degraded face videos into high-quality results with realistic facial details, stable identity, and temporal coherence. Recent diffusion-based methods have brought strong generative priors to restoration and enabled more realistic detail synthesis. However, existing approaches for face videos still rely heavily on generic diffusion priors and multi-step sampling, which limit both facial adaptation and inference efficiency. These limitations motivate the use of one-step diffusion for video face restoration, yet achieving faithful facial recovery alongside temporally stable outputs remains challenging. In this paper, we propose, DVFace, a one-step diffusion framework for real-world video face restoration. Specifically, we introduce a spatio-temporal dual-codebook design to extract complementary spatial and temporal facial priors from degraded videos. We further propose an asymmetric spatio-temporal fusion module to inject these priors into the diffusion backbone according to their distinct roles. Evaluation on various benchmarks shows that DVFace delivers superior restoration quality, temporal consistency, and identity preservation compared to recent methods. Code: https://github.com/zhengchen1999/DVFace.
Abstract（参考訳）: ビデオの顔の復元は、劣化した顔のビデオを、現実的な顔の詳細、安定したアイデンティティ、時間的コヒーレンスで高品質な結果にすることを目的としている。近年の拡散法は、再生に強い先行性をもたらし、より現実的な詳細合成を可能にしている。しかし、既存の顔ビデオのアプローチは、顔適応と推論効率の両方を制限する一般的な拡散先行と多段階サンプリングに大きく依存している。これらの制限は、ビデオ顔の復元にワンステップ拡散を利用する動機となっているが、時間的に安定した出力とともに忠実な顔回復を実現することは、依然として困難である。本稿では,実世界の顔復元のための一段階拡散フレームワークであるDVFaceを提案する。具体的には、劣化したビデオから相補的な空間的・時間的顔の先行情報を抽出する時空間二重コードブックの設計を提案する。さらに、これらの前駆体を異なる役割に応じて拡散バックボーンに注入する非対称時空間融合モジュールを提案する。各種ベンチマークによる評価から,DVFaceは最近の手法と比較して,回復性,時間的整合性,アイデンティティの保存性に優れていた。コード:https://github.com/zhengchen 1999/DVFace

論文の概要: DVFace: Spatio-Temporal Dual-Prior Diffusion for Video Face Restoration

関連論文リスト