Fugu-MT 論文翻訳(概要): Leveraging LLM-Assisted Query Understanding for Live Retrieval-Augmented Generation

論文の概要: Leveraging LLM-Assisted Query Understanding for Live Retrieval-Augmented Generation

arxiv url: http://arxiv.org/abs/2506.21384v1
Date: Thu, 26 Jun 2025 15:35:12 GMT
ステータス: 翻訳完了
システム内更新日: 2025-06-27 19:53:10.165114
Title: Leveraging LLM-Assisted Query Understanding for Live Retrieval-Augmented Generation
Title（参考訳）: LLMを用いたライブ検索用クエリ理解の活用
Authors: Guanting Dong, Xiaoxi Li, Yuyao Zhang, Mengjie Deng,
Abstract要約: 実世界のライブ検索拡張生成(RAG)システムは、ノイズがありあいまいで、複数の意図を含むユーザクエリを処理する場合、課題に直面します。本稿では,オープンドメイン環境におけるRAGシステムの堅牢性と有効性向上を目的とした新しいフレームワークであるOmni-RAGを紹介する。
参考スコア（独自算出の注目度）: 6.62734677678023
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Real-world live retrieval-augmented generation (RAG) systems face significant challenges when processing user queries that are often noisy, ambiguous, and contain multiple intents. While RAG enhances large language models (LLMs) with external knowledge, current systems typically struggle with such complex inputs, as they are often trained or evaluated on cleaner data. This paper introduces Omni-RAG, a novel framework designed to improve the robustness and effectiveness of RAG systems in live, open-domain settings. Omni-RAG employs LLM-assisted query understanding to preprocess user inputs through three key modules: (1) Deep Query Understanding and Decomposition, which utilizes LLMs with tailored prompts to denoise queries (e.g., correcting spelling errors) and decompose multi-intent queries into structured sub-queries; (2) Intent-Aware Knowledge Retrieval, which performs retrieval for each sub-query from a corpus (i.e., FineWeb using OpenSearch) and aggregates the results; and (3) Reranking and Generation, where a reranker (i.e., BGE) refines document selection before a final response is generated by an LLM (i.e., Falcon-10B) using a chain-of-thought prompt. Omni-RAG aims to bridge the gap between current RAG capabilities and the demands of real-world applications, such as those highlighted by the SIGIR 2025 LiveRAG Challenge, by robustly handling complex and noisy queries.
Abstract（参考訳）: 実世界のライブ検索強化世代(RAG)システムは、しばしばうるさい、曖昧で、複数の意図を含むユーザクエリを処理する場合、重大な課題に直面します。 RAGは外部の知識で大きな言語モデル(LLM)を強化するが、現在のシステムは多くの場合、よりクリーンなデータでトレーニングまたは評価されるため、このような複雑な入力に苦しむ。本稿では,オープンドメイン環境におけるRAGシステムの堅牢性と有効性向上を目的とした新しいフレームワークであるOmni-RAGを紹介する。 Omni-RAG は LLM を利用したクエリ理解を 3 つの主要なモジュールを通じてユーザ入力に導入している。(1) クエリ理解と分解(Deep Query Understanding and Decomposition)によりクエリ(例えば、スペルエラーの訂正)を識別し、マルチインテントクエリを構造化サブクエリに分解する; (2) コーパス(例えば、OpenSearch を使用したファインウェブ)から各サブクエリの検索を実行し、結果を集約する Intent-Aware Knowledge Retrieval (3) リランダ(BGE)がLLM(例えば、Falcon-10)によって最終応答が生成される前に文書の選択を洗練する。 Omni-RAGはSIGIR 2025 LiveRAG Challengeで強調されたような、現在のRAG機能と現実世界のアプリケーションの要求のギャップを埋めることを目的としており、複雑でノイズの多いクエリを堅牢に処理している。

論文の概要: Leveraging LLM-Assisted Query Understanding for Live Retrieval-Augmented Generation

関連論文リスト