Fugu-MT 論文翻訳(概要): On-Policy Adversarial Flow Distillation for Autoregressive Video Generation

論文の概要: On-Policy Adversarial Flow Distillation for Autoregressive Video Generation

arxiv url: http://arxiv.org/abs/2605.26105v1
Date: Mon, 25 May 2026 17:58:12 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-26 19:50:20.654459
Title: On-Policy Adversarial Flow Distillation for Autoregressive Video Generation
Title（参考訳）: 自己回帰ビデオ生成のためのオンライン逆流蒸留
Authors: Yang Luo, Shengju Qian, Xiaohang Tang, Zirui Zhu, Yong Liu, Xin Wang, Yang You,
Abstract要約: 逆流蒸留(英: Adversarial Flow Distillation、AFD)は、異種ブラックボックスビデオ蒸留のためのオンラインフレームワークである。 AFDは教師に問い合わせ、同じプロンプトで現在の生徒をロールアウトする。 AFDは、教師のスコア、潜伏者、軌道の騒音、ステップアライメント、逆鎖強化学習を必要とせず、高密度な速度場監視を提供する。
参考スコア（独自算出の注目度）: 34.70887671905755
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Autoregressive video generators are attractive for streaming, long-horizon, and interactive applications, but distilling strong black-box teachers into causal students remains difficult. The student must learn under its own rollout distribution, whereas practical teachers may expose only prompt-conditioned completed videos and may differ in architecture, capacity, temporal design, and sampling schedule. This interface makes supervised fine-tuning off-policy, score-based distillation inapplicable, and direct adversarial imitation too sparse for denoising-time credit assignment. We propose Adversarial Flow Distillation (AFD), an on-policy framework for heterogeneous black-box video distillation. AFD queries the teacher and rolls out the current student on the same prompts, trains a prompt-paired Bradley-Terry discriminator to estimate clean-sample teacher-student discrepancy, and converts the resulting on-policy advantage into forward-process flow-matching updates on the student's own noised states. Thus, AFD provides dense velocity-field supervision while requiring no teacher scores, latents, denoising trajectories, step alignment, or reverse-chain reinforcement learning. Experiments across two causal AR student families show that AFD consistently improves motion- and physics-sensitive generation while preserving general video quality, and ablations validate the importance of adaptive on-policy feedback and forward-process credit assignment. The method requires only clean teacher videos and student rollouts, providing a practical route for distilling proprietary or heterogeneous video generators into efficient autoregressive students.
Abstract（参考訳）: 自動回帰ビデオジェネレータは、ストリーミング、ロングホライゾン、インタラクティブなアプリケーションには魅力的なものだが、強力なブラックボックス教師を因果学生に蒸留することは依然として困難である。実践的な教師は、即席で完成したビデオのみを公開することができ、建築、能力、時間的デザイン、サンプリングスケジュールが異なる可能性がある。このインターフェースは、監督された微調整のオフポリシー、スコアベースの蒸留の適用不可能、直接の敵の模倣を、デノナイジングタイムのクレジット代入には小さすぎる。不均一なブラックボックスビデオ蒸留のためのオンラインフレームワークであるAdversarial Flow Distillation (AFD)を提案する。 AFDは教師に問い合わせて、現在の生徒を同じプロンプトでロールアウトさせ、即興のBradley-Terry判別器を訓練し、クリーンサンプルの教師/学生間の不一致を推定し、その結果得られる政治上の優位性を、生徒自身のノイズのある状態における前処理のフローマッチング更新に変換する。このように、AFDは教師のスコア、潜伏者、軌道の騒音、ステップアライメント、逆鎖強化学習を必要とせず、高密度な速度場監視を提供する。 2つの因果的AR学生を対象とした実験では、AFDは一般的なビデオ品質を維持しながら、動きや物理に敏感な生成を一貫して改善し、アブリケーションは、適応的なオン・ポリティクスフィードバックとフォワード・プロセス・クレジット・アサインの重要性を検証している。この方法は、クリーンな教師のビデオと学生のロールアウトだけを必要とし、プロプライエタリまたは異種のビデオジェネレータを効率よく自己回帰的な学生に蒸留する実用的な方法を提供する。

論文の概要: On-Policy Adversarial Flow Distillation for Autoregressive Video Generation

関連論文リスト