Fugu-MT 論文翻訳(概要): Structure Guided Retrieval-Augmented Generation for Factual Queries

論文の概要: Structure Guided Retrieval-Augmented Generation for Factual Queries

arxiv url: http://arxiv.org/abs/2604.22843v1
Date: Tue, 21 Apr 2026 14:43:53 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-28 17:12:06.982387
Title: Structure Guided Retrieval-Augmented Generation for Factual Queries
Title（参考訳）: ファクチュアルクエリのための構造ガイド付き検索型生成法
Authors: Miao Xie, Xiao Zhang, Yi Li, Chunli Lv,
Abstract要約: Retrieval-Augmented Generation (RAG) は、大規模言語モデル(LLM)における幻覚を軽減するために提案されている。我々は、Exact Retrieval Problem (ERP)という新しい研究問題を導入する。我々の知る限りでは、全てのクエリ条件を満たすためにRAGに構造情報を明示的に組み込んだ最初の問題定式化である。我々は,20のドメインにまたがる複雑な条件を含む,120000の事実指向QAペアからなる大規模データセットであるExact Retrieval Question Answering (ERQA)を構築し,公開する。
参考スコア（独自算出の注目度）: 6.076287232501258
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Retrieval-Augmented Generation (RAG) has been proposed to mitigate hallucinations in large language models (LLMs), where generated outputs may be factually incorrect. However, existing RAG approaches predominantly rely on vector similarity for retrieval, which is prone to semantic noise and fails to ensure that generated responses fully satisfy the complex conditions specified by factual queries, often leading to incorrect answers. To address this challenge, we introduce a novel research problem, named Exact Retrieval Problem (ERP). To the best of our knowledge, this is the first problem formulation that explicitly incorporates structural information into RAG for factual questions to satisfy all query conditions. For this novel problem, we propose Structure Guided Retrieval-Augmented Generation (SG-RAG), which models the retrieval process as an embedding-based subgraph matching task, and uses the retrieved topological structures to guide the LLM to generate answers that meet all specified query conditions. To facilitate evaluation of ERP, we construct and publicly release Exact Retrieval Question Answering (ERQA), a large-scale dataset comprising 120000 fact-oriented QA pairs, each involving complex conditions, spanning 20 diverse domains. The experimental results demonstrate that SG-RAG significantly outperforms strong baselines on ERQA, delivering absolute improvements from 20.68 to 50.88 points across all evaluation metrics, while maintaining reasonable computational overhead.
Abstract（参考訳）: Retrieval-Augmented Generation (RAG) は、大規模言語モデル(LLM)における幻覚を軽減するために提案されている。しかし、既存のRAGアプローチは、主に検索のベクトル類似性に依存しており、これは意味的なノイズに起因し、生成した応答が事実クエリによって特定される複雑な条件を完全に満たすことを確実にすることができない。この課題に対処するため,我々はExact Retrieval Problem (ERP) という新しい研究課題を紹介した。我々の知る限りでは、全てのクエリ条件を満たすためにRAGに構造情報を明示的に組み込んだ最初の問題定式化である。本稿では,検索プロセスを埋め込みベースのサブグラフマッチングタスクとしてモデル化し,検索したトポロジ構造を用いてLLMを誘導し,特定のクエリ条件をすべて満たす回答を生成する構造ガイド型検索拡張生成(SG-RAG)を提案する。 ERPの評価を容易にするために,我々は,20のドメインにまたがる複雑な条件を含む120000の事実指向QAペアからなる大規模データセットであるExact Retrieval Question Answering (ERQA)を構築し,公開する。実験の結果、SG-RAGはERQAの強いベースラインを著しく上回り、全ての評価指標において20.68から50.88ポイントの絶対的な改善を達成し、計算オーバーヘッドを合理的に維持することを示した。

論文の概要: Structure Guided Retrieval-Augmented Generation for Factual Queries

関連論文リスト