Fugu-MT 論文翻訳(概要): Trust-Region Noise Search for Black-Box Alignment of Diffusion and Flow Models

論文の概要: Trust-Region Noise Search for Black-Box Alignment of Diffusion and Flow Models

arxiv url: http://arxiv.org/abs/2603.14504v1
Date: Sun, 15 Mar 2026 17:37:38 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 16:19:35.85677
Title: Trust-Region Noise Search for Black-Box Alignment of Diffusion and Flow Models
Title（参考訳）: 拡散・流れモデルのブラックボックスアライメントに対する信頼関係雑音探索
Authors: Niklas Schweiger, Daniel Cremers, Karnik Ram,
Abstract要約: 信頼領域に基づく検索アルゴリズム(TRS)は、事前訓練された生成モデルと報酬モデルをブラックボックスとして扱う。我々は,テキスト・ツー・イメージ,分子・タンパク質設計タスクにおけるTRSを評価し,出力サンプルを著しく改善した。
参考スコア（独自算出の注目度）: 46.98480905892642
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Optimizing the noise samples of diffusion and flow models is an increasingly popular approach to align these models to target rewards at inference time. However, we observe that these approaches are usually restricted to differentiable or cheap reward models, the formulation of the underlying pretrained generative model, or are memory/compute inefficient. We instead propose a simple trust-region based search algorithm (TRS) which treats the pre-trained generative and reward models as a black-box and only optimizes the source noise. Our approach achieves a good balance between global exploration and local exploitation, and is versatile and easily adaptable to various generative settings and reward models with minimal hyperparameter tuning. We evaluate TRS across text-to-image, molecule and protein design tasks, and obtain significantly improved output samples over the base generative models and other inference-time alignment approaches which optimize the source noise sample, or even the entire reverse-time sampling noise trajectories in the case of diffusion models. Our source code is publicly available.
Abstract（参考訳）: 拡散モデルと流れモデルのノイズサンプルを最適化することは、これらのモデルを推論時に報酬に合わせるために、ますます一般的なアプローチである。しかし、これらの手法は通常、微分可能または安価な報酬モデル、基礎となる事前学習生成モデルの定式化、あるいはメモリ/計算非効率に制限されている。そこで我々は,事前学習した生成モデルと報奨モデルをブラックボックスとして扱い,ソースノイズのみを最適化する簡易信頼領域探索アルゴリズム(TRS)を提案する。提案手法は,グローバルな探索と局所的利用のバランスが良好であり,様々な生成的設定や報酬モデルに適応しやすく,最小限のハイパーパラメータチューニングが可能である。我々は,テキスト・ツー・イメージ,分子・タンパク質設計タスクにおけるTRSを評価し,ソースノイズサンプルを最適化するベース生成モデルや他の推論時アライメントアプローチ,あるいは拡散モデルの場合の逆時間サンプリングノイズトラジェクトリ全体に対して,大幅に改善された出力サンプルを得る。私たちのソースコードは公開されています。

論文の概要: Trust-Region Noise Search for Black-Box Alignment of Diffusion and Flow Models

関連論文リスト