Fugu-MT 論文翻訳(概要): FVD: Inference-Time Alignment of Diffusion Models via Fleming-Viot Resampling

論文の概要: FVD: Inference-Time Alignment of Diffusion Models via Fleming-Viot Resampling

arxiv url: http://arxiv.org/abs/2604.06779v1
Date: Wed, 08 Apr 2026 07:50:00 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-09 17:30:51.406932
Title: FVD: Inference-Time Alignment of Diffusion Models via Fleming-Viot Resampling
Title（参考訳）: FVD:Fleming-Viot再サンプリングによる拡散モデルの推論時間アライメント
Authors: Shivanshu Shekhar, Sagnik Mukherjee, Jia Yi Zhang, Tong Zhang,
Abstract要約: Fleming-Viot Diffusion (FVD) は, 連続モンテカルロ (SMC) を用いた拡散サンプリング装置でよく見られる多様性の崩壊を解決する推論時間アライメント法である。 FVDはフレミング・ヴィオットの人口動態にインスパイアされ、拡散アライメントのために設計された特別な生死機構によって、多項再サンプリングを置き換える。
参考スコア（独自算出の注目度）: 18.997552937110548
License: http://creativecommons.org/publicdomain/zero/1.0/
Abstract: We introduce Fleming-Viot Diffusion (FVD), an inference-time alignment method that resolves the diversity collapse commonly observed in Sequential Monte Carlo (SMC) based diffusion samplers. Existing SMC-based diffusion samplers often rely on multinomial resampling or closely related resampling schemes, which can still reduce diversity and lead to lineage collapse under strong selection pressure. Inspired by Fleming-Viot population dynamics, FVD replaces multinomial resampling with a specialized birth-death mechanism designed for diffusion alignment. To handle cases where rewards are only approximately available and naive rebirth would collapse deterministic trajectories, FVD integrates independent reward-based survival decisions with stochastic rebirth noise. This yields flexible population dynamics that preserve broader trajectory support while effectively exploring reward-tilted distributions, all without requiring value function approximation or costly rollouts. FVD is fully parallelizable and scales efficiently with inference compute. Empirically, it achieves substantial gains across settings: on DrawBench it outperforms prior methods by 7% in ImageReward, while on class-conditional tasks it improves FID by roughly 14-20% over strong baselines and is up to 66 times faster than value-based approaches.
Abstract（参考訳）: Fleming-Viot Diffusion (FVD) は, 連続モンテカルロ (SMC) を用いた拡散サンプリング装置でよく見られる多様性の崩壊を解決する推論時間アライメント法である。既存のSMCベースの拡散サンプリングは、しばしば多相再サンプリングや密接に関連する再サンプリング方式に依存しており、これは多様性を低減し、強い選択圧力下での系統崩壊につながる。 FVDはフレミング・ヴィオットの人口動態にインスパイアされ、拡散アライメントのために設計された特別な生死機構によって、多項再サンプリングを置き換える。報酬がほぼ利用可能であり、単純再生が決定論的軌道を崩壊させるようなケースに対処するため、FVDは独立報酬に基づく生存決定を確率的再生雑音と統合する。これにより、より広い軌道支持を維持しつつ、価値関数近似やコストのかかるロールアウトを必要とせず、報酬型分布を効果的に探索する柔軟な人口動態が得られる。 FVDは完全に並列化可能で、推論計算で効率よくスケールする。 DrawBenchでは、ImageRewardでは7%、クラス条件のタスクでは、強いベースラインよりも14～20%向上し、バリューベースのアプローチよりも66倍高速です。

論文の概要: FVD: Inference-Time Alignment of Diffusion Models via Fleming-Viot Resampling

関連論文リスト