Fugu-MT 論文翻訳(概要): Heterogeneous Decentralized Diffusion Models

論文の概要: Heterogeneous Decentralized Diffusion Models

arxiv url: http://arxiv.org/abs/2603.06741v1
Date: Fri, 06 Mar 2026 08:43:43 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-10 15:13:13.012769
Title: Heterogeneous Decentralized Diffusion Models
Title（参考訳）: 不均一分散拡散モデル
Authors: Zhiying Jiang, Raihan Seraj, Marcos Villagra, Bidhan Roy,
Abstract要約: フロンティアスケールの拡散モデルの訓練には、しばしば密結合クラスタに集中した相当な計算資源を必要とする。不均一なトレーニング目標をサポートしながら、リソース要求を削減できる効率的なフレームワークを提案する。
参考スコア（独自算出の注目度）: 11.120199309935435
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Training frontier-scale diffusion models often requires substantial computational resources concentrated in tightly coupled clusters, limiting participation to well-resourced institutions. While Decentralized Diffusion Models (DDM) enable training multiple experts in isolation, existing approaches require 1176 GPU-days and homogeneous training objectives across all experts. We present an efficient framework that reduces resource requirements while supporting heterogeneous training objectives. Our approach combines three contributions: (1) a heterogeneous decentralized training paradigm that allows experts to use different objectives (DDPM and Flow Matching), unified at inference time via a deterministic schedule-aware conversion into a common velocity space without retraining; (2) pretrained checkpoint conversion from ImageNet-DDPM to Flow Matching objectives, accelerating convergence and enabling initialization without objective-specific pretraining; and (3) PixArt-alpha's efficient AdaLN-Single architecture, reducing parameters while maintaining quality. Experiments on LAION-Aesthetics show that, relative to the training scale reported for prior DDM work, our approach reduces compute from 1176 to 72 GPU-days (16x) and data from 158M to 11M (14x). Under aligned inference settings, our heterogeneous 2DDPM:6FM configuration achieves better FID (11.88 vs. 12.45) and higher intra-prompt diversity (LPIPS 0.631 vs. 0.617) than the homogeneous 8FM baseline. By eliminating synchronization requirements and enabling mixed DDPM/FM objectives, our framework lowers infrastructure requirements for decentralized generative model training.
Abstract（参考訳）: フロンティアスケールの拡散モデルの訓練は、しばしば、密結合されたクラスタに集中し、十分なリソースを持つ機関への参加を制限する、実質的な計算資源を必要とする。 Decentralized Diffusion Models(DDM)は、複数の専門家を独立してトレーニング可能にする一方で、既存のアプローチでは、すべての専門家に対して1176のGPU日と均質なトレーニング目標が必要である。不均一なトレーニング目標をサポートしながら、リソース要求を削減できる効率的なフレームワークを提案する。提案手法は,(1)異なる目的 (DDPM と Flow Matching ) を専門家が利用し,決定論的スケジュールを意識した推論時間で再トレーニングせずに共通速度空間に統一するヘテロジニアスな分散トレーニングパラダイム,(2)ImageNet-DDPM から Flow Matching への事前学習チェックポイント変換,収束の促進と目標固有の事前トレーニングなしでの初期化を可能にすること,(3) PixArt-alpha の効率的な AdaLN-Single アーキテクチャ, 品質を維持しながらパラメーターを削減すること,の3つを組み合わせたものである。 LAION-Aestheticsの実験では、従来のDDM作業で報告されたトレーニングスケールと比較して、計算量は1176日から72GPU日(16x)に減少し、データは158Mから1M(14x)に減少した。不均一な2DDPM:6FM構成は、同質な8FMベースラインよりも優れたFID(11.88 vs. 12.45)と高いプロンプト内多様性(LPIPS 0.631 vs. 0.617)を達成する。本フレームワークは、同期要求を排除し、DDPM/FMの混在を可能とすることにより、分散生成モデルトレーニングのためのインフラ要件を低くする。

論文の概要: Heterogeneous Decentralized Diffusion Models

関連論文リスト