Fugu-MT 論文翻訳(概要): Diffusion Bridge or Flow Matching? A Unifying Framework and Comparative Analysis

論文の概要: Diffusion Bridge or Flow Matching? A Unifying Framework and Comparative Analysis

arxiv url: http://arxiv.org/abs/2509.24531v1
Date: Mon, 29 Sep 2025 09:45:22 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-30 22:32:19.908108
Title: Diffusion Bridge or Flow Matching? A Unifying Framework and Comparative Analysis
Title（参考訳）: 拡散ブリッジとフローマッチング : 統一フレームワークと比較分析
Authors: Kaizhen Zhu, Mokai Pan, Zhechuan Yu, Jingya Wang, Jingyi Yu, Ye Shi,
Abstract要約: 拡散ブリッジとフローマッチングは、任意の分布間の変換において魅力的な経験的性能を示した。我々は,これらのフレームワークを最適制御のレンズを通して再キャストし,拡散橋のコスト関数が低いことを証明した。これらの理論的主張を裏付けるために,潜伏変圧器上に構築された拡散橋の新しい強力なアーキテクチャを提案する。
参考スコア（独自算出の注目度）: 57.614436689939986
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Diffusion Bridge and Flow Matching have both demonstrated compelling empirical performance in transformation between arbitrary distributions. However, there remains confusion about which approach is generally preferable, and the substantial discrepancies in their modeling assumptions and practical implementations have hindered a unified theoretical account of their relative merits. We have, for the first time, provided a unified theoretical and experimental validation of these two models. We recast their frameworks through the lens of Stochastic Optimal Control and prove that the cost function of the Diffusion Bridge is lower, guiding the system toward more stable and natural trajectories. Simultaneously, from the perspective of Optimal Transport, interpolation coefficients $t$ and $1-t$ of Flow Matching become increasingly ineffective when the training data size is reduced. To corroborate these theoretical claims, we propose a novel, powerful architecture for Diffusion Bridge built on a latent Transformer, and implement a Flow Matching model with the same structure to enable a fair performance comparison in various experiments. Comprehensive experiments are conducted across Image Inpainting, Super-Resolution, Deblurring, Denoising, Translation, and Style Transfer tasks, systematically varying both the distributional discrepancy (different difficulty) and the training data size. Extensive empirical results align perfectly with our theoretical predictions and allow us to delineate the respective advantages and disadvantages of these two models. Our code is available at https://anonymous.4open.science/r/DBFM-3E8E/.
Abstract（参考訳）: 拡散ブリッジとフローマッチングは、任意の分布間の変換において魅力的な経験的性能を示した。しかし、どの手法が一般的に好まれるかについては混乱が残っており、モデリングの前提や実践的な実装における相当な相違により、相対的なメリットの統一的な理論的な説明が妨げられている。この2つのモデルの統一的な理論的および実験的検証を初めて提供しました。確率的最適制御(Stochastic Optimal Control)のレンズでそれらのフレームワークをリキャストし、拡散橋のコスト関数が低く、より安定で自然な軌道へと導くことを証明する。同時に、最適輸送の観点からは、トレーニングデータのサイズが小さくなると、補間係数$t$と1-t$のフローマッチングは効果が低下する。これらの理論的主張を裏付けるために,潜伏変圧器上に構築された拡散橋の新しい強力なアーキテクチャを提案し,様々な実験において公平な性能比較を可能にするために,同じ構造を持つフローマッチングモデルを実装した。 Image Inpainting、Super-Resolution、Deblurring、Denoising、Translation、Style Transferタスクにわたって総合的な実験を行い、分散不一致(差分困難)とトレーニングデータサイズの両方を体系的に変化させる。大規模な実験結果は、我々の理論的な予測と完全に一致し、これらの2つのモデルのそれぞれの利点と欠点を明確化することができる。私たちのコードはhttps://anonymous.4open.science/r/DBFM-3E8E/で利用可能です。

論文の概要: Diffusion Bridge or Flow Matching? A Unifying Framework and Comparative Analysis

関連論文リスト