Fugu-MT 論文翻訳(概要): Lost in Decoding? Reproducing and Stress-Testing the Look-Ahead Prior in Generative Retrieval

論文の概要: Lost in Decoding? Reproducing and Stress-Testing the Look-Ahead Prior in Generative Retrieval

arxiv url: http://arxiv.org/abs/2604.23396v1
Date: Sat, 25 Apr 2026 17:58:15 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-28 17:12:07.313408
Title: Lost in Decoding? Reproducing and Stress-Testing the Look-Ahead Prior in Generative Retrieval
Title（参考訳）: 復号化における損失 : 生成的検索に先立つルックアヘッドの再生とストレステスト
Authors: Kidist Amde Mekonnen, Yongkang Li, Yubao Tang, Simon Lupart, Maarten de Rijke,
Abstract要約: 生成検索(GR)は、文書識別子を自動回帰生成することで文書をランク付けする。多くのGR法は三重遅延ビームサーチに依存しており、有限ビームデコードの下で関連する接頭辞の早期刈り取りに弱い。生成検索の計画 Ahead in Generative Retrieval (PAG) は、同時復号法を用いて、その後の逐次復号を導く文書レベルのルックアヘッドを計算することで、この障害モードを緩和する。
参考スコア（独自算出の注目度）: 48.348257016776195
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Generative retrieval (GR) ranks documents by autoregressively generating document identifiers. Because many GR methods rely on trie-constrained beam search, they are vulnerable to early pruning of relevant prefixes under finite-beam decoding. Planning Ahead in Generative Retrieval (PAG) mitigates this failure mode by using simultaneous decoding to compute a document-level look-ahead prior that guides subsequent sequential decoding. We reproduce PAG at inference time and stress-test its decoding behavior. Using the authors' released checkpoint and identifier/trie artifacts under the reported decoding setup, we reproduce the main effectiveness results on MS MARCO Dev and TREC-DL 2019/2020, and corroborate the reported beam-size-latency trade-off in our hardware setting. Beyond reproduction, we introduce plan drift diagnostics that quantify how intent-preserving query variations alter the planner's top-n candidate set and highest-weight planner tokens, and how these changes affect guided decoding. We find that PAG's planning signal is brittle under lexical surface-form variation: intent-preserving typos can trigger plan collapse, where the planned candidate pool shifts enough that the look-ahead bonus provides little useful guidance, effectively reverting decoding toward weaker unguided search. We further evaluate fixed-index cross-lingual robustness using non-English mMARCO queries against an English index, and assess query-side mitigation strategies that require no re-indexing; query translation provides the strongest recovery in our setting. Overall, our results confirm PAG's reported effectiveness and the benefit of planning-guided decoding under the released inference setup, while showing that these gains depend on the stability of the planning signal under realistic query variation and query-document mismatch.
Abstract（参考訳）: 生成検索(GR)は、文書識別子を自動回帰生成することで文書をランク付けする。多くのGR法は三重拘束ビームサーチに依存しているため、有限ビームデコードの下で関連する接頭辞の早期刈り取りに弱い。生成検索の計画 Ahead in Generative Retrieval (PAG) は、同時復号法を用いて、その後の逐次復号を導く文書レベルのルックアヘッドを計算することで、この障害モードを緩和する。我々は推論時にPAGを再生し、その復号動作をストレステストする。本報告では,MS MARCO DevとTREC-DL 2019/2020の主な効果を再現し,本ハードウェア設定におけるビームサイズ・遅延トレードオフの相関付けを行う。再生以外にも、意図保存クエリのバリエーションがプランナーの上位n候補セットと最上位プランナートークンをどのように変え、これらの変更がガイド付きデコーディングにどのように影響するかを定量化するプランドリフト診断を導入する。意図保存型タイポは計画崩壊を引き起こすことができ、計画された候補プールが十分にシフトし、ルックアヘッドボーナスはほとんど有用なガイダンスが得られず、事実上復号化はより弱い未案内探索に還元される。さらに、英語のmMARCOクエリを英語のインデックスに対して使用し、再インデックスを必要としないクエリ側緩和戦略を評価する。本研究は,提案手法の有効性と,提案した推論環境下での計画誘導復号化の利点を検証し,現実的なクエリ変動やクエリ文書ミスマッチによる計画信号の安定性に依存していることを示した。

論文の概要: Lost in Decoding? Reproducing and Stress-Testing the Look-Ahead Prior in Generative Retrieval

関連論文リスト