Fugu-MT 論文翻訳(概要): DREAM: Dense Retrieval Embeddings via Autoregressive Modeling

論文の概要: DREAM: Dense Retrieval Embeddings via Autoregressive Modeling

arxiv url: http://arxiv.org/abs/2606.24667v1
Date: Tue, 23 Jun 2026 15:00:30 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-24 22:16:49.02166
Title: DREAM: Dense Retrieval Embeddings via Autoregressive Modeling
Title（参考訳）: DREAM: 自己回帰モデリングによるDense Retrieval Embeddings
Authors: Yixuan Tang, Yi Yang,
Abstract要約: DREAM(Dense Retrieval Embeddings via Autoregressive Modeling)を提案する。 DREAMは、フリーズ言語モデルの選択されたアテンションヘッドに、レトリバー生成クエリドキュメント類似度スコアを注入する。我々は、0.5Bから3Bパラメータの埋め込みバックボーンを用いて、BEIRとRTEBの検索ベンチマーク上でDREAMを評価する。
参考スコア（独自算出の注目度）: 12.949322198287417
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Dense retrieval embedding models are a fundamental component of modern retrieval-based AI systems. Most dense retrievers are trained with contrastive objectives, which require labeled positive and negative document pairs that are often costly and difficult to obtain. In this work, we investigate whether the autoregressive next-token prediction objective of a large language model (LLM) can provide supervision for dense retrieval. The intuition is simple: if a document contains information relevant to a query, conditioning on that document should make the target output easier for the LLM to predict. A key challenge is that the next-token prediction loss is computed inside the LLM, while the retriever is a separate embedding model. To address this challenge, we propose DREAM (Dense Retrieval Embeddings via Autoregressive Modeling), which injects retriever-generated query-document similarity scores into selected attention heads of a frozen LLM. During training, these scores determine how much attention each candidate document receives while the LLM predicts the target output. The resulting prediction loss provides gradients for retriever training through the attention mechanism. We evaluate DREAM on retrieval benchmarks BEIR and RTEB using embedding backbones ranging from 0.5B to 3B parameters. DREAM consistently outperforms existing baselines across different model scales. These results demonstrate that DREAM provides a promising approach for training dense retrievers through autoregressive modeling.
Abstract（参考訳）: デンス検索埋め込みモデルは、現代の検索ベースのAIシステムの基本コンポーネントである。ほとんどの高密度レトリバーは、しばしばコストがかかり難い正の文書対と負の文書対をラベル付けして、対照的な目的で訓練されている。本研究では,大規模言語モデル(LLM)の自己回帰的次トーケン予測目標が,高密度検索の監視に有効かどうかを検討する。ドキュメントにクエリに関連する情報が含まれている場合、そのドキュメントの条件付けにより、LLMが予測する対象の出力がより簡単になる。重要な課題は、リトリーバーが別個の埋め込みモデルであるのに対して、次点の予測損失がLLM内で計算されることである。この課題に対処するため, 冷凍LLMの注目ヘッドに, 検索者が生成したクエリ-ドキュメント類似度スコアを注入するDREAM(Dense Retrieval Embeddings via Autoregressive Modeling)を提案する。トレーニング中、これらのスコアは、LLMが目標出力を予測している間に、各候補文書がどれだけの注意を受けるかを決定する。結果として生じる予測損失は、アテンション機構を介してレトリバートレーニングのための勾配を与える。我々は、0.5Bから3Bパラメータの埋め込みバックボーンを用いて、BEIRとRTEBの検索ベンチマーク上でDREAMを評価する。 DREAMは、さまざまなモデルスケールで既存のベースラインを一貫して上回る。これらの結果から,DREAMは自己回帰モデルを用いて高密度レトリバーの訓練に有望なアプローチを提供することが示された。

論文の概要: DREAM: Dense Retrieval Embeddings via Autoregressive Modeling

関連論文リスト