Fugu-MT 論文翻訳(概要): Fractals made Practical: Denoising Diffusion as Partitioned Iterated Function Systems

論文の概要: Fractals made Practical: Denoising Diffusion as Partitioned Iterated Function Systems

arxiv url: http://arxiv.org/abs/2603.13069v1
Date: Fri, 13 Mar 2026 15:15:50 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-16 17:38:12.152031
Title: Fractals made Practical: Denoising Diffusion as Partitioned Iterated Function Systems
Title（参考訳）: フラクタルの実践:分割反復関数系としての拡散の認知
Authors: Ann Dooms,
Abstract要約: DDIMの逆チェーンがPIFS(Partitioned Iterated Function System)として動作することを示す。 PIFSは、拡散モデルスケジュール、アーキテクチャ、および訓練目標を記述するための統一設計言語として機能する。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: What is a diffusion model actually doing when it turns noise into a photograph? We show that the deterministic DDIM reverse chain operates as a Partitioned Iterated Function System (PIFS) and that this framework serves as a unified design language for denoising diffusion model schedules, architectures, and training objectives. From the PIFS structure we derive three computable geometric quantities: a per-step contraction threshold $L^*_t$, a diagonal expansion function $f_t(λ)$ and a global expansion threshold $λ^{**}$. These quantities require no model evaluation and fully characterize the denoising dynamics. They structurally explain the two-regime behavior of diffusion models: global context assembly at high noise via diffuse cross-patch attention and fine-detail synthesis at low noise via patch-by-patch suppression release in strict variance order. Self-attention emerges as the natural primitive for PIFS contraction. The Kaplan-Yorke dimension of the PIFS attractor is determined analytically through a discrete Moran equation on the Lyapunov spectrum. Through the study of the fractal geometry of the PIFS, we derive three optimal design criteria and show that four prominent empirical design choices (the cosine schedule offset, resolution-dependent logSNR shift, Min-SNR loss weighting, and Align Your Steps sampling) each arise as approximate solutions to our explicit geometric optimization problems tuning theory into practice.
Abstract（参考訳）: ノイズを写真に変えるとき、拡散モデルは実際に何をするのか? 決定論的DDIM逆連鎖はPIFS(Partitioned Iterated Function System)として機能し,拡散モデルスケジュール,アーキテクチャ,トレーニング目的を記述するための統一設計言語として機能することを示す。 PIFS構造から、ステップ毎の縮約しきい値$L^*_t$、対角展開関数$f_t(λ)$、大域拡張しきい値$λ^{**}$の3つの計算可能な幾何量を得る。これらの量はモデル評価を必要とせず、denoising dynamicsを完全に特徴づける。彼らは拡散モデルの2つの登録挙動を構造的に説明し、拡散横断注意による高雑音時の大域的コンテキストアセンブリと、パッチ・バイ・パッチ抑圧リリースによる厳密な分散順序での低雑音時の細部合成である。 PIFS収縮の自然なプリミティブとして自己注意が現れる。 PIFSアトラクターのKaplan-Yorke次元は、リアプノフスペクトル上の離散モラン方程式を通じて解析的に決定される。 PIFSのフラクタル幾何学の研究を通じて、3つの最適設計基準を導出し、4つの顕著な経験的設計選択(cosine schedule offset, resolution-dependent logSNR shift, Min-SNR loss weighting, and Align Your Steps sample)が、我々の明示的な幾何最適化問題チューニング理論の近似解として実際に現れることを示す。

関連論文リスト

Breaking the Bottlenecks: Scalable Diffusion Models for 3D Molecular Generation [0.0]
拡散モデルは分子設計のための強力な生成モデルとして登場した。彼らの使用は、長いサンプリング軌道、逆過程のばらつき、そして力学の認知における構造的認識の制限によって制限されている。直接分極拡散モデル(英語版)は、逆MCMC更新を決定論的分極ステップに置き換えることでこれらの非効率性を緩和する。
論文参考訳（メタデータ） (2026-01-13T20:09:44Z)
SH-SAS: An Implicit Neural Representation for Complex Spherical-Harmonic Scattering Fields for 3D Synthetic Aperture Sonar [10.13553727839228]
本稿では,複雑な音響散乱場を球面調和係数の集合として表現する暗黙的神経表現SH-SASを紹介する。以上の結果から,SH-SASは従来の手法よりも3次元再構成品質と幾何学的指標の点で優れていた。
論文参考訳（メタデータ） (2025-09-14T04:29:28Z)
Diffusion Models for Solving Inverse Problems via Posterior Sampling with Piecewise Guidance [52.705112811734566]
断片的なガイダンススキームを用いて,逆問題を解決するための新しい拡散型フレームワークが導入された。提案手法は問題に依存しず,様々な逆問題に容易に適応できる。このフレームワークは, (4時間), (8時間) の超分解能タスクに対して, (23%), (24%) および (24%) の無作為マスクを塗布する場合の (25%) の推論時間を短縮する。
論文参考訳（メタデータ） (2025-07-22T19:35:14Z)
Diffusion Models With Learned Adaptive Noise [12.530583016267768]
本稿では,拡散過程がデータから学べるかどうかを考察する。広く信じられている仮定は、ELBOはノイズプロセスに不変であるということである。画像間で異なる速度でノイズを印加する学習拡散過程であるMULANを提案する。
論文参考訳（メタデータ） (2023-12-20T18:00:16Z)
SE(3) Diffusion Model-based Point Cloud Registration for Robust 6D Object Pose Estimation [66.16525145765604]
実世界のシナリオにおける6次元オブジェクトポーズ推定のためのSE(3)拡散モデルに基づく点クラウド登録フレームワークを提案する。提案手法は,3次元登録タスクをデノナイズ拡散過程として定式化し,音源雲の姿勢を段階的に洗練する。実世界のTUD-L, LINEMOD, およびOccluded-LINEMODデータセットにおいて, 拡散登録フレームワークが顕著なポーズ推定性能を示すことを示す。
論文参考訳（メタデータ） (2023-10-26T12:47:26Z)
A Variational Perspective on Solving Inverse Problems with Diffusion Models [101.831766524264]
逆タスクは、データ上の後続分布を推測するものとして定式化することができる。しかし、拡散過程の非線形的かつ反復的な性質が後部を引き付けるため、拡散モデルではこれは困難である。そこで我々は,真の後続分布を近似する設計手法を提案する。
論文参考訳（メタデータ） (2023-05-07T23:00:47Z)
Score-based Diffusion Models in Function Space [137.70916238028306]
拡散モデルは、最近、生成モデリングの強力なフレームワークとして登場した。この研究は、関数空間における拡散モデルをトレーニングするためのDDO(Denoising Diffusion Operators)と呼ばれる数学的に厳密なフレームワークを導入する。データ解像度に依存しない固定コストで、対応する離散化アルゴリズムが正確なサンプルを生成することを示す。
論文参考訳（メタデータ） (2023-02-14T23:50:53Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。