Fugu-MT 論文翻訳(概要): Navigating the Exploration-Exploitation Tradeoff in Inference-Time Scaling of Diffusion Models

論文の概要: Navigating the Exploration-Exploitation Tradeoff in Inference-Time Scaling of Diffusion Models

arxiv url: http://arxiv.org/abs/2508.12361v1
Date: Sun, 17 Aug 2025 13:35:38 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-19 14:49:10.704534
Title: Navigating the Exploration-Exploitation Tradeoff in Inference-Time Scaling of Diffusion Models
Title（参考訳）: 拡散モデルの推測時間スケーリングにおける探索・探索トレードオフのナビゲート
Authors: Xun Su, Jianming Huang, Yang Yusen, Zhongxi Fang, Hiroyuki Kasai,
Abstract要約: 推論時間スケーリングは言語モデルにおいて顕著に成功したが、拡散モデルへの適応は未解明のままである。スケジュールと適応温度の2つの戦略を提案する。提案手法は, 騒音評価の総数を増やすことなく, 試料品質を著しく向上させる。
参考スコア（独自算出の注目度）: 11.813933389519358
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Inference-time scaling has achieved remarkable success in language models, yet its adaptation to diffusion models remains underexplored. We observe that the efficacy of recent Sequential Monte Carlo (SMC)-based methods largely stems from globally fitting the The reward-tilted distribution, which inherently preserves diversity during multi-modal search. However, current applications of SMC to diffusion models face a fundamental dilemma: early-stage noise samples offer high potential for improvement but are difficult to evaluate accurately, whereas late-stage samples can be reliably assessed but are largely irreversible. To address this exploration-exploitation trade-off, we approach the problem from the perspective of the search algorithm and propose two strategies: Funnel Schedule and Adaptive Temperature. These simple yet effective methods are tailored to the unique generation dynamics and phase-transition behavior of diffusion models. By progressively reducing the number of maintained particles and down-weighting the influence of early-stage rewards, our methods significantly enhance sample quality without increasing the total number of Noise Function Evaluations. Experimental results on multiple benchmarks and state-of-the-art text-to-image diffusion models demonstrate that our approach outperforms previous baselines.
Abstract（参考訳）: 推論時間スケーリングは言語モデルにおいて顕著に成功したが、拡散モデルへの適応は未解明のままである。近年のSMC(Sequential Monte Carlo)に基づく手法の有効性は,マルチモーダル探索における多様性を本質的に保存する報奨型分布のグローバルな適合に大きく影響している。しかし、SMCの拡散モデルへの現在の応用は基本的なジレンマに直面している: 早期ノイズサンプルは改善の可能性が高いが、正確な評価は困難であるが、後期ノイズサンプルは確実に評価できるが、ほとんど不可逆である。この探索・探索トレードオフに対処するため,探索アルゴリズムの観点からこの問題にアプローチし,Funnel ScheduleとAdaptive Temperatureの2つの戦略を提案する。これらの単純で効果的な方法は拡散モデルのユニークな生成力学と相転移挙動に適合する。維持粒子数を段階的に減らし,早期報奨の影響を減らし,ノイズ関数評価の総数を増やすことなく試料品質を著しく向上させる。複数のベンチマークと最先端のテキスト・画像拡散モデルによる実験結果から,提案手法が従来のベースラインより優れていることが示された。

論文の概要: Navigating the Exploration-Exploitation Tradeoff in Inference-Time Scaling of Diffusion Models

関連論文リスト