Fugu-MT 論文翻訳(概要): Multilevel Diffusion: Infinite Dimensional Score-Based Diffusion Models for Image Generation

論文の概要: Multilevel Diffusion: Infinite Dimensional Score-Based Diffusion Models for Image Generation

arxiv url: http://arxiv.org/abs/2303.04772v4
Date: Sat, 19 Oct 2024 03:10:43 GMT
ステータス: 翻訳完了
システム内更新日: 2024-11-28 17:07:28.677751
Title: Multilevel Diffusion: Infinite Dimensional Score-Based Diffusion Models for Image Generation
Title（参考訳）: 多値拡散:画像生成のための無限次元スコアベース拡散モデル
Authors: Paul Hagemann, Sophie Mildenberger, Lars Ruthotto, Gabriele Steidl, Nicole Tianjiao Yang,
Abstract要約: スコアベース拡散モデル (SBDM) は画像生成のための最先端のアプローチとして登場した。本稿では, 無限次元のSBDM, すなわち, 矩形領域でサポートされている関数としてトレーニングデータをモデル化する。無限次元設定において、現在のSBDMアプローチの2つの欠点を克服する方法を実証する。
参考スコア（独自算出の注目度）: 2.5556910002263984
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Score-based diffusion models (SBDM) have recently emerged as state-of-the-art approaches for image generation. Existing SBDMs are typically formulated in a finite-dimensional setting, where images are considered as tensors of finite size. This paper develops SBDMs in the infinite-dimensional setting, that is, we model the training data as functions supported on a rectangular domain. In addition to the quest for generating images at ever-higher resolutions, our primary motivation is to create a well-posed infinite-dimensional learning problem that we can discretize consistently on multiple resolution levels. We thereby intend to obtain diffusion models that generalize across different resolution levels and improve the efficiency of the training process. We demonstrate how to overcome two shortcomings of current SBDM approaches in the infinite-dimensional setting. First, we modify the forward process using trace class operators to ensure that the latent distribution is well-defined in the infinite-dimensional setting and derive the reverse processes for finite-dimensional approximations. Second, we illustrate that approximating the score function with an operator network is beneficial for multilevel training. After deriving the convergence of the discretization and the approximation of multilevel training, we demonstrate some practical benefits of our infinite-dimensional SBDM approach on a synthetic Gaussian mixture example, the MNIST dataset, and a dataset generated from a nonlinear 2D reaction-diffusion equation.
Abstract（参考訳）: スコアベース拡散モデル(SBDM)は画像生成のための最先端のアプローチとして最近登場した。既存のSBDMは通常有限次元の設定で定式化され、画像は有限サイズのテンソルと見なされる。本稿では, 無限次元のSBDM, すなわち, 矩形領域でサポートされている関数としてトレーニングデータをモデル化する。常に高解像度で画像を生成することの探求に加えて、我々の主な動機は、複数の解像度レベルで一貫した離散化が可能な、よく考えられた無限次元の学習問題を作ることである。そこで我々は,様々な解像度レベルにまたがって一般化し,学習プロセスの効率化を図る拡散モデルを得る。無限次元設定において、現在のSBDMアプローチの2つの欠点を克服する方法を実証する。まず、トレースクラス演算子を用いて前方過程を変更し、無限次元の設定において潜伏分布が適切に定義されていることを保証し、有限次元近似の逆過程を導出する。第2に,演算子ネットワークによるスコア関数の近似は,多段階学習に有用であることを示す。離散化の収束とマルチレベルトレーニングの近似を導出した後、合成ガウス混合例、MNISTデータセット、非線形2次元反応拡散方程式から生成されたデータセットに対して、無限次元SBDMアプローチの実用的利点を示す。

論文の概要: Multilevel Diffusion: Infinite Dimensional Score-Based Diffusion Models for Image Generation

関連論文リスト