Fugu-MT 論文翻訳(概要): The critical slowing down in diffusion models

論文の概要: The critical slowing down in diffusion models

arxiv url: http://arxiv.org/abs/2605.12597v2
Date: Wed, 20 May 2026 15:46:04 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-21 19:19:56.20039
Title: The critical slowing down in diffusion models
Title（参考訳）: 拡散モデルにおける臨界減速
Authors: Luca Maria Del Bono, Giulio Biroli, Patrick Charbonneau, Marylou Gabrié,
Abstract要約: パラメータ学習において,正確な解と一致する1層ネットワークアーキテクチャを用いてスコアモデルをトレーニングすると,パラメータ学習における臨界速度低下の一形態が示される。この速度低下は生成過程にも影響し、学習された生成モデルでさえ、臨界点近くをサンプリングすることのよく知られた困難さが持続することを示している。 2層アーキテクチャを使用することで、システムサイズを2次的にではなく、対数的にスケールするトレーニング時間によって、致命的な遅延を劇的に削減できることがわかった。
参考スコア（独自算出の注目度）: 8.207196072624464
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Computational sampling has been central to the sciences since the mid-20th century. While machine-learning-based approaches have recently enabled major advances, their behavior remains poorly understood, with limited theoretical control over when and why they succeed. Here we provide such insight for diffusion models-a class of generative schemes highly effective in practice-by analyzing their application to the $O(n)$ model of statistical field theory in the Gaussian limit $n \to \infty$. In this analytically tractable setting, we show that training a score model with a one-layer network architecture matching the exact solution exhibits a form of critical slowing down in parameter learning. This slowing down also impacts the generation process, indicating that the well-known difficulties of sampling near criticality persist even for learned generative models. To overcome this bottleneck, we demonstrate the power of combining architectural depth with physical locality. We find that using a two-layer architecture drastically reduces the critical slowing down, with the training time scaling logarithmically rather than quadratically with system size. By introducing a local score approximation we show that this acceleration in training time can be achieved without increasing the number of neural network parameters. Taken together, these results demonstrate that diffusion models can overcome the critical slowing down through appropriate architectural design, and establish a controlled framework for understanding and improving learned sampling methods in statistical physics and beyond.
Abstract（参考訳）: 計算サンプリングは20世紀中頃から科学の中心となっている。機械学習に基づくアプローチは、最近大きな進歩を可能にしたが、その振る舞いは理解されていないままであり、いつ、なぜ成功するかに関する理論的な制御は限られている。ガウス極限$n \to \infty$ における統計場理論の $O(n)$ モデルへの応用を分析することによって、そのような拡散モデルに対する洞察を与える。本稿では, パラメータ学習において, 正確な解と一致する1層ネットワークアーキテクチャを用いてスコアモデルのトレーニングを行うことにより, パラメータ学習における臨界速度低下の一形態が示されることを示す。この減速は生成過程にも影響し、学習された生成モデルでさえ、臨界点近くをサンプリングすることのよく知られた困難さが持続することを示している。このボトルネックを克服するために、アーキテクチャの深さと物理的局所性を組み合わせる力を示す。 2層アーキテクチャを使用することで、システムサイズを2次的にではなく、対数的にスケールするトレーニング時間によって、致命的な遅延を劇的に削減できることがわかった。局所的なスコア近似を導入することで、ニューラルネットワークパラメータの数を増やすことなく、トレーニング時間のこの加速度を達成できることが示される。これらの結果は、拡散モデルが適切なアーキテクチャ設計を通じて臨界減速を克服し、統計物理学等における学習されたサンプリング手法の理解と改善のための制御された枠組みを確立することを実証する。

論文の概要: The critical slowing down in diffusion models

関連論文リスト