Fugu-MT 論文翻訳(概要): Sampling from Flow Language Models via Marginal-Conditioned Bridges

論文の概要: Sampling from Flow Language Models via Marginal-Conditioned Bridges

arxiv url: http://arxiv.org/abs/2605.13681v1
Date: Wed, 13 May 2026 15:38:48 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-14 23:30:28.145019
Title: Sampling from Flow Language Models via Marginal-Conditioned Bridges
Title（参考訳）: Marginal-Conditioned Bridgeによる流れ言語モデルからのサンプリング
Authors: Iskander Azangulov, Leo Zhang,
Abstract要約: フロー言語モデル (FLM) は、1ホットエンコードされたトークンシーケンスに対して連続的なフローマッチングを適用する。我々は、FLMの自然なサンプリングは、代わりに後述の予測であると主張する。誘導されたワンステップブリッジカーネルはトークン単位の後方予測限界を全て保持することを示す。
参考スコア（独自算出の注目度）: 1.43163477873944
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Flow Language Models (FLMs) are a recently introduced class of language models which adapt continuous flow matching for one-hot encoded token sequences. Their denoisers have a special structure absent from generic continuous diffusion models: each block of the denoising mean is a posterior marginal distribution over the clean token at that position. Standard DDPM-style samplers collapse these marginals to a single conditional-mean endpoint and bridge toward this simplex-valued point, which is generally not a valid one-hot sequence. We argue that the natural sampler for an FLM is instead posterior-predictive. At each reverse step, we sample a clean one-hot endpoint from the factorized posterior defined by the FLM token marginals, and then sample the next continuous state from the analytic Ornstein--Uhlenbeck bridge conditioned on that endpoint. The method is training-free, uses the same model evaluations as standard sampling, and gives a principled interface for token-level decoding controls such as temperature scaling and nucleus truncation. We show that, under exact posterior marginals, the endpoint approximation error is exactly the conditional multi-information among token positions. The induced one-step bridge kernel preserves all token-wise posterior-predictive marginals and loses only the residual cross-position dependence. Finally, we prove a Girsanov path-space comparison showing that the marginal-conditioned bridge has a no-larger denoising-error term than the frozen conditional-mean bridge, with strict improvement whenever intermediate coordinate-wise bridge observations reveal additional information about the clean token. Experiments with FLMs show that the sampler improves the quality--diversity tradeoff. Code is available at: github.com/imbirik/mcb.
Abstract（参考訳）: Flow Language Models (FLMs) は、1ホットエンコードされたトークンシーケンスに対して連続的なフローマッチングを適用する言語モデルのクラスである。それらのデノワは、一般的な連続拡散モデルに欠く特別な構造を持ち、デノナイジング平均の各ブロックは、その位置にあるクリーントークン上の後縁分布である。標準DDPM型サンプリング器は、これらの限界を単一の条件付き平均点に崩壊させ、この単純な点へ橋渡しする。我々は、FLMの自然なサンプリングは、代わりに後述の予測であると主張する。各逆ステップで、FLMトークン境界によって定義される因子化後端からクリーンな1ホットエンドポイントをサンプリングし、そのエンドポイントに条件付けられた解析的オルンシュタイン-ウレンベック橋から次の連続状態をサンプリングする。この方法はトレーニング不要であり、標準サンプリングと同じモデル評価を使用し、温度スケーリングや核切断といったトークンレベルの復号制御の原則的インターフェースを提供する。その結果, 終端近似誤差は, トークン位置間の条件付き多重情報であることがわかった。誘導されたワンステップブリッジカーネルはトークン単位の後方予測限界を全て保持し、残りのクロスポジション依存だけを失う。最後に, 境界条件付き橋は凍結条件付き平均橋よりも大きなデノージング・エラー項を有しており, 中間座標方向の橋梁観測がクリーントークンに関する追加情報を明らかにすると, 厳密な改善が期待できることを示す。 FLMによる実験では、サンプルは品質、多様性のトレードオフを改善する。コードは、github.com/imbirik/mcbで入手できる。

論文の概要: Sampling from Flow Language Models via Marginal-Conditioned Bridges

関連論文リスト