Fugu-MT 論文翻訳(概要): Document Optimization for Black-Box Retrieval via Reinforcement Learning

論文の概要: Document Optimization for Black-Box Retrieval via Reinforcement Learning

arxiv url: http://arxiv.org/abs/2604.05087v1
Date: Mon, 06 Apr 2026 18:41:40 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-08 17:42:09.443782
Title: Document Optimization for Black-Box Retrieval via Reinforcement Learning
Title（参考訳）: 強化学習によるブラックボックス検索のための文書最適化
Authors: Omri Uzan, Ron Polonsky, Douwe Kiela, Christopher Potts,
Abstract要約: 我々は文書拡張を文書最適化問題として再放送する。言語モデルまたはビジョン言語モデルは、ドキュメントを、ターゲットレトリバーの下で期待されるクエリ分布とよりよく整合した表現に変換するように微調整される。学習した文書変換が検索ゲインとなり、多くの設定において、より小さく、より効率的な検索者がより大きな文書変換を上回ります。
参考スコア（独自算出の注目度）: 34.56617313585179
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Document expansion is a classical technique for improving retrieval quality, and is attractive since it shifts computation offline, avoiding additional query-time processing. However, when applied to modern retrievers, it has been shown to degrade performance, often introducing noise that obfuscates the discriminative signal. We recast document expansion as a document optimization problem: a language model or a vision language model is fine-tuned to transform documents into representations that better align with the expected query distribution under a target retriever, using GRPO with the retriever's ranking improvements as rewards. This approach requires only black-box access to retrieval ranks, and is applicable across single-vector, multi-vector and lexical retrievers. We evaluate our approach on code retrieval and visual document retrieval (VDR) tasks. We find that learned document transformations yield retrieval gains and in many settings enable smaller, more efficient retrievers to outperform larger ones. For example, applying document optimization to OpenAI text-embedding-3-small model improves nDCG5 on code (58.7 to 66.8) and VDR (53.3 to 57.6), even slightly surpassing the 6.5X more expensive OpenAI text-embedding-3-large model (66.3 on code; 57.0 on VDR). When retriever weights are accessible, document optimization is often competitive with fine-tuning, and in most settings their combination performs best, improving Jina-ColBERT-V2 from 55.8 to 63.3 on VDR and from 48.6 to 61.8 on code retrieval.
Abstract（参考訳）: ドキュメント拡張は、検索品質を改善するための古典的なテクニックであり、追加のクエリ時間処理を回避して、計算をオフラインにするので魅力的である。しかし、現代のレトリバーに適用すると性能が低下することが示され、しばしば識別シグナルを難読化するノイズがもたらされる。我々は文書拡張を文書最適化問題として再キャストする: 言語モデルまたは視覚言語モデルは、文書を、検索対象の検索者の下で期待されるクエリ分布によく適合する表現に変換するために微調整され、GRPOと検索者のランキング改善を報酬として利用する。このアプローチでは、検索ランクへのブラックボックスアクセスしか必要とせず、シングルベクタ、マルチベクタ、レキシカルレトリバーに適用される。コード検索と視覚文書検索(VDR)タスクに対する我々のアプローチを評価する。学習した文書変換が検索ゲインとなり、多くの設定において、より小さく、より効率的な検索者がより大きな文書変換を上回ります。例えば、OpenAIのテキスト埋め込み3-小モデルに文書最適化を適用すると、コード上のnDCG5(58.7から66.8)とVDR(53.3から57.6)が改善され、さらに高価なOpenAIのテキスト埋め込み3-大モデル(66.3、VDRは57.0)をわずかに上回っている。検索のウェイトが利用できる場合、文書の最適化は微調整と競合することが多く、ほとんどの環境ではJina-ColBERT-V2を55.8から63.3に改善し、コード検索では48.6から61.8に改善している。

論文の概要: Document Optimization for Black-Box Retrieval via Reinforcement Learning

関連論文リスト