Fugu-MT 論文翻訳(概要): List-aware Reranking-Truncation Joint Model for Search and Retrieval-augmented Generation

論文の概要: List-aware Reranking-Truncation Joint Model for Search and Retrieval-augmented Generation

arxiv url: http://arxiv.org/abs/2402.02764v1
Date: Mon, 5 Feb 2024 06:52:53 GMT
ステータス: 翻訳完了
システム内更新日: 2024-02-06 17:37:22.686110
Title: List-aware Reranking-Truncation Joint Model for Search and Retrieval-augmented Generation
Title（参考訳）: 検索・検索付加生成のためのリスト対応リグレード・トランケーション・ジョイントモデル
Authors: Shicheng Xu, Liang Pang, Jun Xu, Huawei Shen, Xueqi Cheng
Abstract要約: 本稿では,2つのタスクを同時に実行可能なRe rank-Truncation joint model(GenRT)を提案する。 GenRTは、エンコーダ-デコーダアーキテクチャに基づく生成パラダイムによるリランクとトランケーションを統合している。提案手法は,Web検索および検索拡張LLMにおけるリランクタスクとトラルケーションタスクの両方においてSOTA性能を実現する。
参考スコア（独自算出の注目度）: 80.12531449946655
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The results of information retrieval (IR) are usually presented in the form of a ranked list of candidate documents, such as web search for humans and retrieval-augmented generation for large language models (LLMs). List-aware retrieval aims to capture the list-level contextual features to return a better list, mainly including reranking and truncation. Reranking finely re-scores the documents in the list. Truncation dynamically determines the cut-off point of the ranked list to achieve the trade-off between overall relevance and avoiding misinformation from irrelevant documents. Previous studies treat them as two separate tasks and model them separately. However, the separation is not optimal. First, it is hard to share the contextual information of the ranking list between the two tasks. Second, the separate pipeline usually meets the error accumulation problem, where the small error from the reranking stage can largely affect the truncation stage. To solve these problems, we propose a Reranking-Truncation joint model (GenRT) that can perform the two tasks concurrently. GenRT integrates reranking and truncation via generative paradigm based on encoder-decoder architecture. We also design the novel loss functions for joint optimization to make the model learn both tasks. Sharing parameters by the joint model is conducive to making full use of the common modeling information of the two tasks. Besides, the two tasks are performed concurrently and co-optimized to solve the error accumulation problem between separate stages. Experiments on public learning-to-rank benchmarks and open-domain Q\&A tasks show that our method achieves SOTA performance on both reranking and truncation tasks for web search and retrieval-augmented LLMs.
Abstract（参考訳）: 情報検索(IR)の結果は通常、人間のウェブ検索や大規模言語モデル(LLM)の検索強化生成など、候補文書のランク付けされたリストの形式で提示される。リストアウェア検索は、リストレベルのコンテキスト機能をキャプチャして、リストを返却することを目的としている。リスト内の文書を細かく再スコアする。トランケーションは、ランクリストのカットオフポイントを動的に決定し、関連性全体のトレードオフと無関係な文書からの誤情報を避ける。以前の研究では、それらを2つの別々のタスクとして扱い、個別にモデル化した。しかし、分離は最適ではない。まず,2つのタスク間でランキングリストのコンテキスト情報を共有することは困難である。第二に、分離されたパイプラインは通常エラー蓄積問題に満ちており、再ランキングステージからの小さなエラーがトランザクションステージに大きく影響する可能性がある。これらの問題を解決するために,2つのタスクを同時に実行可能なRe rank-Truncation Joint Model (GenRT)を提案する。 GenRTは、エンコーダ-デコーダアーキテクチャに基づく生成パラダイムによるリランクとトランケーションを統合している。また, 協調最適化のための新しい損失関数を設計し, モデルが両方のタスクを学習できるようにする。ジョイントモデルによるパラメータの共有は、この2つのタスクの共通モデリング情報を最大限に活用することにつながる。さらに、2つのタスクを同時に実行し、異なるステージ間のエラー蓄積問題を解決するために協調最適化する。オープンドメインQ&Aタスクと公開学習ベンチマークを用いた実験により,Web検索および検索拡張 LLM における再ランク化タスクとトランケーションタスクの両方においてSOTA性能が達成された。

論文の概要: List-aware Reranking-Truncation Joint Model for Search and Retrieval-augmented Generation

関連論文リスト