Fugu-MT 論文翻訳(概要): Improved Immiscible Diffusion: Accelerate Diffusion Training by Reducing Its Miscibility

論文の概要: Improved Immiscible Diffusion: Accelerate Diffusion Training by Reducing Its Miscibility

arxiv url: http://arxiv.org/abs/2505.18521v1
Date: Sat, 24 May 2025 05:38:35 GMT
ステータス: 翻訳完了
システム内更新日: 2025-05-27 16:58:42.474517
Title: Improved Immiscible Diffusion: Accelerate Diffusion Training by Reducing Its Miscibility
Title（参考訳）: 解離不能拡散の改善:解離性低下による解離促進訓練
Authors: Yiheng Li, Feng Liang, Dan Kondratyuk, Masayoshi Tomizuka, Kurt Keutzer, Chenfeng Xu,
Abstract要約: 不和がいかに軽視し、効率を向上するかを示す。そこで本稿では,K-nearest neighbor (KNN) ノイズ選択や画像スケーリングなどの実装のファミリを提案する。この研究は、今後の高効率拡散訓練研究の新たな方向性を確立する。
参考スコア（独自算出の注目度）: 62.272571285823595
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The substantial training cost of diffusion models hinders their deployment. Immiscible Diffusion recently showed that reducing diffusion trajectory mixing in the noise space via linear assignment accelerates training by simplifying denoising. To extend immiscible diffusion beyond the inefficient linear assignment under high batch sizes and high dimensions, we refine this concept to a broader miscibility reduction at any layer and by any implementation. Specifically, we empirically demonstrate the bijective nature of the denoising process with respect to immiscible diffusion, ensuring its preservation of generative diversity. Moreover, we provide thorough analysis and show step-by-step how immiscibility eases denoising and improves efficiency. Extending beyond linear assignment, we propose a family of implementations including K-nearest neighbor (KNN) noise selection and image scaling to reduce miscibility, achieving up to >4x faster training across diverse models and tasks including unconditional/conditional generation, image editing, and robotics planning. Furthermore, our analysis of immiscibility offers a novel perspective on how optimal transport (OT) enhances diffusion training. By identifying trajectory miscibility as a fundamental bottleneck, we believe this work establishes a potentially new direction for future research into high-efficiency diffusion training. The code is available at https://github.com/yhli123/Immiscible-Diffusion.
Abstract（参考訳）: 拡散モデルのかなりの訓練コストは、その展開を妨げる。 Inmiscible Diffusionは、線形代入による雑音空間における拡散軌道混合の低減が、騒音の簡易化による訓練を加速することを示した。高いバッチサイズと高次元下での非効率線形代入を超えて非ミスシブル拡散を拡大するために、我々はこの概念を、どんな層でも、どんな実装でも、より広いミスシビリティ低減に洗練する。具体的には,不和性拡散に関する認知過程の客観的性質を実証的に実証し,生成多様性の維持を確実にする。さらに、我々は徹底的な分析を行い、不和の緩和と効率の向上をステップバイステップで示す。そこで本稿では,K-nearest neighbor(KNN)ノイズ選択や画像スケーリングなどの実装のファミリを提案し,非条件/条件生成,画像編集,ロボット計画など,さまざまなモデルやタスクに対する最大4倍高速なトレーニングを実現する。さらに, 最適輸送(OT)が拡散訓練をいかに促進するか, 新たな視点を提供する。軌跡ミスシビリティを基本的なボトルネックとすることで、この研究は、高効率拡散訓練の今後の研究の新たな方向性を確立する可能性があると信じている。コードはhttps://github.com/yhli123/Immiscible-Diffusionで公開されている。

論文の概要: Improved Immiscible Diffusion: Accelerate Diffusion Training by Reducing Its Miscibility

関連論文リスト