Fugu-MT 論文翻訳(概要): Generalization Properties of Score-matching Diffusion Models for Intrinsically Low-dimensional Data

論文の概要: Generalization Properties of Score-matching Diffusion Models for Intrinsically Low-dimensional Data

arxiv url: http://arxiv.org/abs/2603.03700v1
Date: Wed, 04 Mar 2026 03:59:02 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-05 21:29:15.175035
Title: Generalization Properties of Score-matching Diffusion Models for Intrinsically Low-dimensional Data
Title（参考訳）: 固有低次元データに対するスコアマッチング拡散モデルの一般化特性
Authors: Saptarshi Chakraborty, Quentin Berthet, Peter L. Bartlett,
Abstract要約: 有限個のサンプルから未知分布の$$を学習するためのスコアベース拡散モデルの統計的収束について検討する。以上の結果から,拡散モデルがデータ固有の幾何学に自然に適応していることが示唆された。我々の理論は, 拡散モデルの解析を, GANと最適輸送で確立された急激なミニマックス速度で橋渡しするものである。
参考スコア（独自算出の注目度）: 32.72306410557258
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Despite the remarkable empirical success of score-based diffusion models, their statistical guarantees remain underdeveloped. Existing analyses often provide pessimistic convergence rates that do not reflect the intrinsic low-dimensional structure common in real data, such as that arising in natural images. In this work, we study the statistical convergence of score-based diffusion models for learning an unknown distribution $μ$ from finitely many samples. Under mild regularity conditions on the forward diffusion process and the data distribution, we derive finite-sample error bounds on the learned generative distribution, measured in the Wasserstein-$p$ distance. Unlike prior results, our guarantees hold for all $p \ge 1$ and require only a finite-moment assumption on $μ$, without compact-support, manifold, or smooth-density conditions. Specifically, given $n$ i.i.d.\ samples from $μ$ with finite $q$-th moment and appropriately chosen network architectures, hyperparameters, and discretization schemes, we show that the expected Wasserstein-$p$ error between the learned distribution $\hatμ$ and $μ$ scales as $\mathbb{E}\, \mathbb{W}_p(\hatμ,μ) = \widetilde{O}\!\left(n^{-1 / d^\ast_{p,q}(μ)}\right),$ where $d^\ast_{p,q}(μ)$ is the $(p,q)$-Wasserstein dimension of $μ$. Our results demonstrate that diffusion models naturally adapt to the intrinsic geometry of data and mitigate the curse of dimensionality, since the convergence rate depends on $d^\ast_{p,q}(μ)$ rather than the ambient dimension. Moreover, our theory conceptually bridges the analysis of diffusion models with that of GANs and the sharp minimax rates established in optimal transport. The proposed $(p,q)$-Wasserstein dimension also extends classical Wasserstein dimension notions to distributions with unbounded support, which may be of independent theoretical interest.
Abstract（参考訳）: スコアベース拡散モデルの顕著な経験的成功にもかかわらず、それらの統計的保証は未発達のままである。既存の分析は、自然画像のような実データに共通する本質的な低次元構造を反映しない悲観的収束率を与えることが多い。本研究では,有限個のサンプルから未知分布の$μ$を学習するためのスコアベース拡散モデルの統計的収束について検討する。前方拡散過程とデータ分布に関する穏やかな規則性条件の下では、ワッサーシュタイン-$p$距離で測定された学習された生成分布上の有限サンプル誤差境界を導出する。以前の結果とは異なり、我々の保証はすべての$p \ge 1$ を保ち、コンパクトな支持、多様体、あるいは滑らかな密度条件なしで、$μ$ 上の有限モーメントの仮定しか必要としない。具体的には、$n$ i.i.d.\サンプルが$μ$から有限$q$-秒のモーメントを持ち、適切に選択されたネットワークアーキテクチャ、ハイパーパラメータ、離散化スキームによって与えられると、学習した分布の$\hatμ$と$μ$スケールの間で期待されるWasserstein-$p$エラーが$\mathbb{E}\, \mathbb{W}_p(\hatμ,μ) = \widetilde{O}\! \left(n^{-1 / d^\ast_{p,q}(μ)}\right),$ ここで$d^\ast_{p,q}(μ)$は$(p,q)$-ワッサーシュタイン次元$μ$である。本結果は,拡散モデルがデータ固有の幾何学に自然に適応し,次元の呪いを軽減することを示し,収束速度は周囲次元よりも$d^\ast_{p,q}(μ)$に依存する。さらに, この理論は, 拡散モデルとGANの拡散モデルの解析と, 最適輸送で確立された急激なミニマックス速度を橋渡しするものである。提案された$(p,q)$-ワッサーシュタイン次元はまた、古典的なワッサーシュタイン次元の概念を非有界な分布に拡張する。

論文の概要: Generalization Properties of Score-matching Diffusion Models for Intrinsically Low-dimensional Data

関連論文リスト