Fugu-MT 論文翻訳(概要): Are We Really Learning the Score Function? Reinterpreting Diffusion Models Through Wasserstein Gradient Flow Matching

論文の概要: Are We Really Learning the Score Function? Reinterpreting Diffusion Models Through Wasserstein Gradient Flow Matching

arxiv url: http://arxiv.org/abs/2509.00336v1
Date: Sat, 30 Aug 2025 03:30:22 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-04 15:17:03.1841
Title: Are We Really Learning the Score Function? Reinterpreting Diffusion Models Through Wasserstein Gradient Flow Matching
Title（参考訳）: スコア関数は本当に学習されているか? Wasserstein Gradient Flow Matching による拡散モデルの再解釈
Authors: An B. Vuong, Michael T. McCann, Javier E. Santos, Yen Ting Lin,
Abstract要約: トレーニングされた拡散ネットワークが真のスコア関数に必要な積分的制約と微分的制約の両方に反することを示す。拡散学習は、WGF(Wasserstein Gradient Flow)の流速場に適合する流れとして理解されている。本研究は, 拡散生成モデルを理解するための原理的, エレガント, 理論的基礎的な枠組みとしてWGFの観点を採用することを提唱する。
参考スコア（独自算出の注目度）: 6.821102133726069
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Diffusion models are commonly interpreted as learning the score function, i.e., the gradient of the log-density of noisy data. However, this assumption implies that the target of learning is a conservative vector field, which is not enforced by the neural network architectures used in practice. We present numerical evidence that trained diffusion networks violate both integral and differential constraints required of true score functions, demonstrating that the learned vector fields are not conservative. Despite this, the models perform remarkably well as generative mechanisms. To explain this apparent paradox, we advocate a new theoretical perspective: diffusion training is better understood as flow matching to the velocity field of a Wasserstein Gradient Flow (WGF), rather than as score learning for a reverse-time stochastic differential equation. Under this view, the "probability flow" arises naturally from the WGF framework, eliminating the need to invoke reverse-time SDE theory and clarifying why generative sampling remains successful even when the neural vector field is not a true score. We further show that non-conservative errors from neural approximation do not necessarily harm density transport. Our results advocate for adopting the WGF perspective as a principled, elegant, and theoretically grounded framework for understanding diffusion generative models.
Abstract（参考訳）: 拡散モデルは一般に、スコア関数、すなわちノイズデータの対数密度の勾配を学ぶものとして解釈される。しかし、この仮定は、学習の対象が保守的なベクトル場であり、実際にはニューラルネットワークアーキテクチャによって強制されないことを意味している。本稿では、学習されたベクトル場が保守的でないことを示すため、訓練された拡散ネットワークが真のスコア関数に必要な積分的制約と微分的制約の両方に違反することを示す。それにもかかわらず、モデルは非常に優れた生成機構として機能する。拡散トレーニングは、逆時間確率微分方程式のスコア学習よりも、WGF(Wasserstein Gradient Flow)の速度場に一致する流れとして理解されている。この観点では、「確率フロー」はWGFフレームワークから自然に発生し、逆時間SDE理論を呼び出す必要性を排除し、ニューラルベクトル場が真のスコアではない場合でもなぜ生成的サンプリングが成功し続けるのかを明らかにする。さらに,神経近似による非保存誤差が必ずしも密度輸送を損なわないことを示す。本研究は, 拡散生成モデルを理解するための原理的, エレガント, 理論的基礎的な枠組みとしてWGFの観点を採用することを提唱する。

論文の概要: Are We Really Learning the Score Function? Reinterpreting Diffusion Models Through Wasserstein Gradient Flow Matching

関連論文リスト