Fugu-MT 論文翻訳(概要): Lookahead Unmasking Elicits Accurate Decoding in Diffusion Language Models

論文の概要: Lookahead Unmasking Elicits Accurate Decoding in Diffusion Language Models

arxiv url: http://arxiv.org/abs/2511.05563v1
Date: Tue, 04 Nov 2025 02:37:37 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-11 21:18:44.44539
Title: Lookahead Unmasking Elicits Accurate Decoding in Diffusion Language Models
Title（参考訳）: Lookahead Unmasking Elicitscurcurate Decoding in Diffusion Language Models (英語)
Authors: Sanghyun Lee, Seungryong Kim, Jongho Park, Dongmin Park,
Abstract要約: Masked Diffusion Models (MDM) は、反復的にトークンをアンマキングすることで生成される言語モデルであるが、その性能はアンマキングの推測時間順序に依存する。提案するLookUM(LookUM)は,これらの問題に対処し,サンプリングを可能な全注文に対して経路選択として再構成する。 LookUMはピーク性能を達成するために2～3つの経路しか必要とせず、極めて効率的な経路選択を示す。
参考スコア（独自算出の注目度）: 51.12873073612084
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Masked Diffusion Models (MDMs) as language models generate by iteratively unmasking tokens, yet their performance crucially depends on the inference time order of unmasking. Prevailing heuristics, such as confidence based sampling, are myopic: they optimize locally, fail to leverage extra test-time compute, and let early decoding mistakes cascade. We propose Lookahead Unmasking (LookUM), which addresses these concerns by reformulating sampling as path selection over all possible unmasking orders without the need for an external reward model. Our framework couples (i) a path generator that proposes paths by sampling from pools of unmasking sets with (ii) a verifier that computes the uncertainty of the proposed paths and performs importance sampling to subsequently select the final paths. Empirically, erroneous unmasking measurably inflates sequence level uncertainty, and our method exploits this to avoid error-prone trajectories. We validate our framework across six benchmarks, such as mathematics, planning, and coding, and demonstrate consistent performance improvements. LookUM requires only two to three paths to achieve peak performance, demonstrating remarkably efficient path selection. The consistent improvements on both LLaDA and post-trained LLaDA 1.5 are particularly striking: base LLaDA with LookUM rivals the performance of RL-tuned LLaDA 1.5, while LookUM further enhances LLaDA 1.5 itself showing that uncertainty based verification provides orthogonal benefits to reinforcement learning and underscoring the versatility of our framework. Code will be publicly released.
Abstract（参考訳）: Masked Diffusion Models (MDM) は、反復的にアンマキングトークンによって生成される言語モデルであるが、その性能はアンマスキーの推論時間順序に大きく依存する。信頼に基づくサンプリングなどの一般的なヒューリスティックは、ローカルで最適化され、テスト時の余分な計算を利用できなくなり、早期のデコードミスをカスケードにします。提案するLookUM(LookUM)は,外部報酬モデルを必要とせずに,可能な全ての注文に対して,サンプリングを経路選択として再構成することで,これらの問題に対処する。フレームワークカップル一未メイキングセットのプールからサンプリングして経路を提案する経路生成装置二提案した経路の不確かさを計算し、次に最終経路を選択するために重要サンプリングを行う検証器。提案手法は, 誤マスキングによってシーケンスレベルの不確かさが増大し, エラー発生経路の回避を図っている。数学、計画、コーディングの6つのベンチマークでフレームワークを検証し、一貫したパフォーマンス改善を実証する。 LookUMはピーク性能を達成するために2～3つの経路しか必要とせず、極めて効率的な経路選択を示す。ベースLLaDAはLLで調整したLLaDA 1.5の性能に匹敵するが、LookUMはLLaDA 1.5自体をさらに強化し、不確実性に基づく検証は強化学習に直交的利益をもたらし、我々のフレームワークの汎用性を裏付けることを示している。コードは公開されます。

論文の概要: Lookahead Unmasking Elicits Accurate Decoding in Diffusion Language Models

関連論文リスト