Fugu-MT 論文翻訳(概要): Retrieval-in-the-Chain: Bootstrapping Large Language Models for Generative Retrieval

論文の概要: Retrieval-in-the-Chain: Bootstrapping Large Language Models for Generative Retrieval

arxiv url: http://arxiv.org/abs/2510.13095v1
Date: Wed, 15 Oct 2025 02:29:10 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-16 20:13:28.476656
Title: Retrieval-in-the-Chain: Bootstrapping Large Language Models for Generative Retrieval
Title（参考訳）: Retrieval-in-the-Chain:生成検索のための大規模言語モデルのブートストラップ
Authors: Yingchen zhang, Ruqing zhang, Jiafeng Guo, Wenjun Peng, Sen Li, Fuyu Lv,
Abstract要約: 生成検索のための推論強化フレームワークであるReason-for-Retrieval(R4R)を提案する。 R4Rは、自由形式のチェーン・オブ・シークレット(CoT)推論をコンパクトで構造化されたフォーマットに変換し、検索プロセス中の推論を反復的に洗練する。自然質問、MS MARCO、および実世界の項目探索ベンチマークに関する広範囲な実験により、R4Rの有効性が検証された。
参考スコア（独自算出の注目度）: 40.35703097974511
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Generative retrieval (GR) is an emerging paradigm that leverages large language models (LLMs) to autoregressively generate document identifiers (docids) relevant to a given query. Prior works have focused on leveraging the generative capabilities of LLMs to improve GR, while overlooking that their reasoning capabilities could likewise help. This raises a key question: Can explicit reasoning benefit GR? To investigate, we first conduct a preliminary study where an LLM is prompted to generate free-form chain-of-thought (CoT) reasoning before performing constrained docid decoding. Although this method outperforms standard GR, the generated reasoning tends to be verbose and poorly aligned with the docid space. These limitations motivate the development of a reasoning mechanism better tailored to GR. Therefore, we propose Reason-for-Retrieval (R4R), a reasoning-augmented framework for GR that converts free-form CoT reasoning into a compact, structured format, and iteratively refines the reasoning during the retrieval process. R4R augments an existing GR method by leveraging a reasoning-capable LLM that has been instruction-tuned for GR. At inference time, R4R first uses the LLM to generate an initial structured reasoning; then the same LLM alternates between (i) constrained decoding with the chosen GR method to produce candidate docids and (ii) updating the reasoning based on retrieval results to improve the next round. R4R does not require additional models or training, and instead a single LLM serves as both the reasoning generator and the retriever. Extensive experiments on Natural Questions, MS MARCO, and a real-world item-search benchmark validate the effectiveness of R4R.
Abstract（参考訳）: 生成検索(GR)は、大規模言語モデル(LLM)を利用して、与えられたクエリに関連する文書識別子(ドシデント)を自動回帰的に生成する新興パラダイムである。以前の研究は、GRを改善するためにLLMの生成能力を活用することに注力し、それらの推論能力が同様に役立つことを見落としていた。これは重要な疑問を提起する。明確な推論はGRに利益をもたらすか? 本研究はまず,LLMが制約ドシッド復号を行う前に,自由形連鎖(CoT)推論を生成するための予備的研究を行う。この方法は標準GRより優れているが、生成された推論は冗長であり、ドシド空間と不整合である傾向がある。これらの制限はGRに合わせた推論機構の開発を動機付けている。そこで我々は,自由形式のCoT推論をコンパクトで構造化された形式に変換し,検索過程の推論を反復的に洗練するGR用推論拡張フレームワークであるReason-for-Retrieval (R4R)を提案する。 R4Rは、GRのために命令調整された推論可能なLLMを活用することで、既存のGRメソッドを拡張している。推論時、R4R はまず LLM を使用して初期構造推論を生成し、次に同じ LLM を交互に置き換える。 (i)選択したGR法で制約付き復号を行い、候補ドシデントを生成し、二検索結果に基づいて推論を更新し、次ラウンドを改善すること。 R4Rは追加のモデルや訓練を必要とせず、代わりに単一のLCMが推論ジェネレータとレトリバーの両方として機能する。自然質問、MS MARCO、および実世界の項目探索ベンチマークに関する広範囲な実験により、R4Rの有効性が検証された。

論文の概要: Retrieval-in-the-Chain: Bootstrapping Large Language Models for Generative Retrieval

関連論文リスト