Fugu-MT 論文翻訳(概要): Hierarchical Koopman Diffusion: Fast Generation with Interpretable Diffusion Trajectory

論文の概要: Hierarchical Koopman Diffusion: Fast Generation with Interpretable Diffusion Trajectory

arxiv url: http://arxiv.org/abs/2510.12220v1
Date: Tue, 14 Oct 2025 07:17:35 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-15 21:19:14.976446
Title: Hierarchical Koopman Diffusion: Fast Generation with Interpretable Diffusion Trajectory
Title（参考訳）: 階層的クープマン拡散:解釈的拡散軌道による高速発生
Authors: Hanru Bai, Weiyang Ding, Difan Zou,
Abstract要約: textbfHierarchical Koopman Diffusionは、一段階のサンプリングと解釈可能な生成軌道の両方を達成する新しいフレームワークである。我々のフレームワークは,拡散モデルにおける高速サンプリングと解釈可能性のギャップを埋め,生成モデルにおける説明可能な画像合成の道を開く。
参考スコア（独自算出の注目度）: 30.327899232038863
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Diffusion models have achieved impressive success in high-fidelity image generation but suffer from slow sampling due to their inherently iterative denoising process. While recent one-step methods accelerate inference by learning direct noise-to-image mappings, they sacrifice the interpretability and fine-grained control intrinsic to diffusion dynamics, key advantages that enable applications like editable generation. To resolve this dichotomy, we introduce \textbf{Hierarchical Koopman Diffusion}, a novel framework that achieves both one-step sampling and interpretable generative trajectories. Grounded in Koopman operator theory, our method lifts the nonlinear diffusion dynamics into a latent space where evolution is governed by globally linear operators, enabling closed-form trajectory solutions. This formulation not only eliminates iterative sampling but also provides full access to intermediate states, allowing manual intervention during generation. To model the multi-scale nature of images, we design a hierarchical architecture that disentangles generative dynamics across spatial resolutions via scale-specific Koopman subspaces, capturing coarse-to-fine details systematically. We empirically show that the Hierarchical Koopman Diffusion not only achieves competitive one-step generation performance but also provides a principled mechanism for interpreting and manipulating the generative process through spectral analysis. Our framework bridges the gap between fast sampling and interpretability in diffusion models, paving the way for explainable image synthesis in generative modeling.
Abstract（参考訳）: 拡散モデルは高忠実度画像生成において顕著に成功したが、本質的に反復的なデノナイジングプロセスのためにサンプリングが遅い。最近のワンステップ手法は直接ノイズ・ツー・イメージマッピングを学習することで推論を加速するが、編集可能な生成のようなアプリケーションを可能にする重要な利点である拡散力学に固有の解釈可能性ときめ細かい制御を犠牲にする。この二分法を解くために,一段階のサンプリングと解釈可能な生成軌道を両立させる新しいフレームワークである‘textbf{Hierarchical Koopman Diffusion} を導入する。クープマン作用素理論に基づいて、この手法は非線形拡散力学を、大域的線形作用素によって進化が支配される潜在空間に持ち上げ、閉形式の軌道解を可能にする。この定式化は反復サンプリングを除去するだけでなく、中間状態への完全なアクセスを提供し、生成時の手動介入を可能にする。画像のマルチスケールな性質をモデル化するために,空間分解における生成力学をスケール特異的なクープマン部分空間で切り離す階層的アーキテクチャを設計し,粗大から細小までを体系的に捉えた。階層的クープマン拡散は競合するワンステップ生成性能を達成するだけでなく、スペクトル解析によって生成過程を解釈・操作する原理的なメカニズムも提供することを実証的に示す。我々のフレームワークは,拡散モデルにおける高速サンプリングと解釈可能性のギャップを埋め,生成モデルにおける説明可能な画像合成の道を開く。

論文の概要: Hierarchical Koopman Diffusion: Fast Generation with Interpretable Diffusion Trajectory

関連論文リスト