Fugu-MT 論文翻訳(概要): Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diffusion Language Models

論文の概要: Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diffusion Language Models

arxiv url: http://arxiv.org/abs/2602.01842v1
Date: Mon, 02 Feb 2026 09:14:51 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-03 19:28:34.030429
Title: Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diffusion Language Models
Title（参考訳）: Prism: 離散拡散言語モデルのための階層探索と自己検証による効率的なテスト時間スケーリング
Authors: Jinbin Bai, Yixuan Li, Yuchen Zhu, Yi Xin, Qingyu Shi, Aosong Feng, Xiaohong Liu, Molei Tao, Jianru Xue, Xiangtai Li, Ming-Hsuan Yang,
Abstract要約: LLM推論を改善するための実用的な方法として、推論時計算が再導入されている。テスト時間スケーリング(TTS)アルゴリズムの多くは、自動回帰デコーディングに依存している。そこで我々は,dLLM のための効率的な TTS フレームワーク Prism を提案する。
参考スコア（独自算出の注目度）: 96.0074341403456
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Inference-time compute has re-emerged as a practical way to improve LLM reasoning. Most test-time scaling (TTS) algorithms rely on autoregressive decoding, which is ill-suited to discrete diffusion language models (dLLMs) due to their parallel decoding over the entire sequence. As a result, developing effective and efficient TTS methods to unlock dLLMs' full generative potential remains an underexplored challenge. To address this, we propose Prism (Pruning, Remasking, and Integrated Self-verification Method), an efficient TTS framework for dLLMs that (i) performs Hierarchical Trajectory Search (HTS) which dynamically prunes and reallocates compute in an early-to-mid denoising window, (ii) introduces Local branching with partial remasking to explore diverse implementations while preserving high-confidence tokens, and (iii) replaces external verifiers with Self-Verified Feedback (SVF) obtained via self-evaluation prompts on intermediate completions. Across four mathematical reasoning and code generation benchmarks on three dLLMs, including LLaDA 8B Instruct, Dream 7B Instruct, and LLaDA 2.0-mini, our Prism achieves a favorable performance-efficiency trade-off, matching best-of-N performance with substantially fewer function evaluations (NFE). The code is released at https://github.com/viiika/Prism.
Abstract（参考訳）: LLM推論を改善するための実用的な方法として、推論時計算が再導入されている。テスト時間スケーリング (TTS) のアルゴリズムの多くは自動回帰デコーディングに依存しており、このアルゴリズムは配列全体の並列デコーディングのために離散拡散言語モデル (dLLM) に不適である。結果として、dLLMsの完全な生成能を解き放つための効率的かつ効率的なTS法の開発は、未発見の課題である。そこで我々は,dLLM のための効率的な TTS フレームワーク Prism (Pruning, Remasking, and Integrated Self-verification Method) を提案する。 i) 階層的軌道探索(Hierarchical Trajectory Search, HTS)を実行する。 (II)高信頼トークンを保持しつつ、多種多様な実装を探索する部分リマキングによるローカルブランチを導入し、 (iii) 中間完了時の自己評価プロンプトによって得られた自己検証フィードバック(SVF)を外部検証器に置き換える。 LLaDA 8B Instruct、Dream 7B Instruct、LLaDA 2.0-miniを含む3つのdLLM上の4つの数学的推論およびコード生成ベンチマークにおいて、我々のPrismは、NFE ( best-of-N performance) とほぼ少ない関数評価(英語版) (NFE) に適合する、良好な性能と効率のトレードオフを達成する。コードはhttps://github.com/viiika/Prism.comで公開されている。

論文の概要: Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diffusion Language Models

関連論文リスト