Fugu-MT 論文翻訳(概要): RAVEN: Real-time Autoregressive Video Extrapolation with Consistency-model GRPO

論文の概要: RAVEN: Real-time Autoregressive Video Extrapolation with Consistency-model GRPO

arxiv url: http://arxiv.org/abs/2605.15190v1
Date: Thu, 14 May 2026 17:59:30 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-15 21:45:35.016805
Title: RAVEN: Real-time Autoregressive Video Extrapolation with Consistency-model GRPO
Title（参考訳）: RAVEN: 一貫性モデルGRPOを用いたリアルタイム自動回帰ビデオ外挿
Authors: Yanzuo Lu, Ronglai Zuo, Jiankang Deng,
Abstract要約: 因果自己回帰ビデオ拡散モデルは、以前生成されたコンテンツから将来のチャンクを外挿することでリアルタイムストリーミング生成をサポートする。本稿では,リアルタイム自動回帰ビデオ補間ネットワーク(RAVEN)を紹介した。これは,各自己ロールアウトを,クリーンな歴史的エンドポイントのインターリーブシーケンスに再パッケージするトレーニングタイムテストフレームワークである。
参考スコア（独自算出の注目度）: 53.38929612273108
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Causal autoregressive video diffusion models support real-time streaming generation by extrapolating future chunks from previously generated content. Distilling such generators from high-fidelity bidirectional teachers yields competitive few-step models, yet a persistent gap between the history distributions encountered during training and those arising at inference constrains generation quality over long horizons. We introduce the Real-time Autoregressive Video Extrapolation Network (RAVEN), a training-time test framework that repacks each self rollout into an interleaved sequence of clean historical endpoints and noisy denoising states. This formulation aligns training attention with inference-time extrapolation and allows downstream chunk losses to supervise the history representations on which future predictions depend. We further propose Consistency-model Group Relative Policy Optimization (CM-GRPO), which reformulates a consistency sampling step as a conditional Gaussian transition and applies online Reinforcement Learning (RL) directly to this kernel, avoiding the Euler-Maruyama auxiliary process adopted in prior flow-model RL formulations. Experiments demonstrate that RAVEN surpasses recent causal video distillation baselines across quality, semantic, and dynamic degree evaluations, and that CM-GRPO provides further gains when combined with RAVEN.
Abstract（参考訳）: 因果自己回帰ビデオ拡散モデルは、以前生成されたコンテンツから将来のチャンクを外挿することでリアルタイムストリーミング生成をサポートする。高忠実度双方向教師からこれらのジェネレータを蒸留すると、競争力のある数ステップモデルが得られるが、トレーニング中に遭遇した履歴分布と推論制約で発生するものとは、長い地平線上での世代品質の持続的なギャップがある。本稿では,実時間自動回帰ビデオ補間ネットワーク(RAVEN)を紹介した。これは,各自己ロールアウトを,クリーンな歴史的エンドポイントとノイズの多い騒音のある状態のインターリーブシーケンスに再パッケージする訓練時テストフレームワークである。この定式化は、トレーニングの注意を推論時外挿と整合させ、下流のチャンクの損失を、将来の予測が依存する履歴表現を監督することを可能にする。さらに、コンシステンシーモデルグループ相対政策最適化(CM-GRPO)を提案し、コンシステンシーサンプリングステップを条件付きガウス遷移として再構成し、オンライン強化学習(RL)をこのカーネルに直接適用し、以前のフローモデルRLの定式化で採用されるオイラー・丸山補助プロセスを回避する。実験により、RAVENは、品質、意味、動的度の評価において、最近の因果ビデオ蒸留ベースラインを超越し、CM-GRPOはRAVENと組み合わせることでさらなる利益をもたらすことが示された。

論文の概要: RAVEN: Real-time Autoregressive Video Extrapolation with Consistency-model GRPO

関連論文リスト