Fugu-MT 論文翻訳(概要): Fishing for Answers: Exploring One-shot vs. Iterative Retrieval Strategies for Retrieval Augmented Generation

論文の概要: Fishing for Answers: Exploring One-shot vs. Iterative Retrieval Strategies for Retrieval Augmented Generation

arxiv url: http://arxiv.org/abs/2509.04820v1
Date: Fri, 05 Sep 2025 05:44:50 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-08 14:27:25.488049
Title: Fishing for Answers: Exploring One-shot vs. Iterative Retrieval Strategies for Retrieval Augmented Generation
Title（参考訳）: アンサーのための釣り:一発対再検索戦略を探求して
Authors: Huifeng Lin, Gang Su, Jintao Liang, You Wu, Rui Zhao, Ziyue Li,
Abstract要約: Retrieval-Augmented Generation (RAG) は、LLM(Large Language Models)に基づく、業界におけるクローズドソース文書の理解とクエリのための強力なソリューションである。しかしながら、基本的なRAGは、法律および規制領域における複雑なQAタスクに苦しむことが多い。証拠のカバレッジを改善し、品質に答えるための2つの戦略を探求する。
参考スコア（独自算出の注目度）: 11.180502261031789
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Retrieval-Augmented Generation (RAG) based on Large Language Models (LLMs) is a powerful solution to understand and query the industry's closed-source documents. However, basic RAG often struggles with complex QA tasks in legal and regulatory domains, particularly when dealing with numerous government documents. The top-$k$ strategy frequently misses golden chunks, leading to incomplete or inaccurate answers. To address these retrieval bottlenecks, we explore two strategies to improve evidence coverage and answer quality. The first is a One-SHOT retrieval method that adaptively selects chunks based on a token budget, allowing as much relevant content as possible to be included within the model's context window. Additionally, we design modules to further filter and refine the chunks. The second is an iterative retrieval strategy built on a Reasoning Agentic RAG framework, where a reasoning LLM dynamically issues search queries, evaluates retrieved results, and progressively refines the context over multiple turns. We identify query drift and retrieval laziness issues and further design two modules to tackle them. Through extensive experiments on a dataset of government documents, we aim to offer practical insights and guidance for real-world applications in legal and regulatory domains.
Abstract（参考訳）: Retrieval-Augmented Generation (RAG) は、LLM(Large Language Models)に基づく、業界におけるクローズドソース文書の理解とクエリのための強力なソリューションである。しかしながら、基本的なRAGは、法律および規制領域における複雑なQAタスク、特に多くの政府文書を扱う際に、しばしば苦労する。最上位の$戦略は、しばしば黄金の塊を見逃し、不完全または不正確な答えをもたらす。これらの発見ボトルネックに対処するため、証拠のカバレッジを改善し、品質に答えるための2つの戦略を探求する。 1つ目は、トークン予算に基づいてチャンクを適応的に選択するOne-SHOT検索手法で、モデルのコンテキストウィンドウに可能な限り関連コンテンツを含めることができる。さらに、チャンクをさらにフィルタし、洗練するためのモジュールも設計しています。 2つ目は、Reasoning Agentic RAGフレームワーク上に構築された反復的検索戦略である。クエリドリフトと検索遅延の問題を識別し,さらに2つのモジュールを設計する。政府文書のデータセットに関する広範な実験を通じて、法律及び規制領域における現実の応用に対する実践的な洞察とガイダンスを提供することを目指している。

論文の概要: Fishing for Answers: Exploring One-shot vs. Iterative Retrieval Strategies for Retrieval Augmented Generation

関連論文リスト