Fugu-MT 論文翻訳(概要): Normalizing Trajectory Models

論文の概要: Normalizing Trajectory Models

arxiv url: http://arxiv.org/abs/2605.08078v2
Date: Tue, 12 May 2026 22:42:27 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-14 17:13:58.799089
Title: Normalizing Trajectory Models
Title（参考訳）: 軌道模型の正規化
Authors: Jiatao Gu, Tianrong Chen, Ying Shen, David Berthelot, Shuangfei Zhai, Josh Susskind,
Abstract要約: 正規化軌道モデル(NTM)を導入する NTMは、それぞれの逆ステップを、正確な確度トレーニングを伴う表現的条件正規化フローとしてモデル化する。テキスト・ツー・イメージのベンチマークでは、NTMは4つのサンプリングステップで強力な画像生成ベースラインにマッチする。
参考スコア（独自算出の注目度）: 36.48526680200944
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Diffusion-based models decompose sampling into many small Gaussian denoising steps -- an assumption that breaks down when generation is compressed to a few coarse transitions. Existing few-step methods address this through distillation, consistency training, or adversarial objectives, but sacrifice the likelihood framework in the process. We introduce Normalizing Trajectory Models (NTM), which models each reverse step as an expressive conditional normalizing flow with exact likelihood training. Architecturally, NTM combines shallow invertible blocks within each step with a deep parallel predictor across the trajectory, forming an end-to-end network trainable from scratch or initializable from pretrained flow-matching models. Its exact trajectory likelihood further enables self-distillation: a lightweight denoiser trained on the model's own score produces high-quality samples in four steps. On text-to-image benchmarks, NTM matches or outperforms strong image generation baselines in just four sampling steps while uniquely retaining exact likelihood over the generative trajectory.
Abstract（参考訳）: 拡散に基づくモデルは、サンプリングを多くの小さなガウスの分解ステップに分解する -- 生成がいくつかの粗い遷移に圧縮されたときに分解される仮定。既存の数段階の方法は、蒸留、一貫性のトレーニング、あるいは敵の目標を通じてこの問題に対処するが、プロセスにおける可能性の枠組みを犠牲にする。そこで我々は,各ステップの逆ステップを,正確な精度のトレーニングで表現的条件付き正規化フローとしてモデル化する正規化軌道モデル(NTM)を提案する。アーキテクチャ的には、NTMは各ステップ内の浅い非可逆ブロックと軌道を横断する深い並列予測器を結合し、スクラッチからトレーニング可能なエンドツーエンドネットワークや、事前訓練されたフローマッチングモデルから初期化可能なネットワークを形成する。モデル自身のスコアに基づいて訓練された軽量デノイザーは、4ステップで高品質なサンプルを生成する。テキスト・ツー・イメージのベンチマークでは、NTMは4つのサンプリングステップで強力な画像生成ベースラインにマッチまたは性能を向上すると同時に、生成軌跡に対する正確な確率を独自に保持する。

論文の概要: Normalizing Trajectory Models

関連論文リスト