Fugu-MT 論文翻訳(概要): CoCR-RAG: Enhancing Retrieval-Augmented Generation in Web Q&A via Concept-oriented Context Reconstruction

論文の概要: CoCR-RAG: Enhancing Retrieval-Augmented Generation in Web Q&A via Concept-oriented Context Reconstruction

arxiv url: http://arxiv.org/abs/2603.23989v1
Date: Wed, 25 Mar 2026 06:38:09 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-26 21:06:11.165442
Title: CoCR-RAG: Enhancing Retrieval-Augmented Generation in Web Q&A via Concept-oriented Context Reconstruction
Title（参考訳）: CoCR-RAG:概念指向コンテキスト再構成によるWebQ&Aの検索強化
Authors: Kaize Shi, Xueyao Sun, Qika Lin, Firoj Alam, Qing Li, Xiaohui Tao, Guandong Xu,
Abstract要約: Retrieval-augmented Generation (RAG)は、Webや他の外部ソースからの情報を取り入れることで、Q&Aの強化に有望な結果を示している。本稿では,マルチソース情報融合問題に対処するフレームワークである概念指向コンテキスト再構成RAG(CoCR-RAG)を提案する。
参考スコア（独自算出の注目度）: 26.159576535656274
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Retrieval-augmented generation (RAG) has shown promising results in enhancing Q&A by incorporating information from the web and other external sources. However, the supporting documents retrieved from the heterogeneous web often originate from multiple sources with diverse writing styles, varying formats, and inconsistent granularity. Fusing such multi-source documents into a coherent and knowledge-intensive context remains a significant challenge, as the presence of irrelevant and redundant information can compromise the factual consistency of the inferred answers. This paper proposes the Concept-oriented Context Reconstruction RAG (CoCR-RAG), a framework that addresses the multi-source information fusion problem in RAG through linguistically grounded concept-level integration. Specifically, we introduce a concept distillation algorithm that extracts essential concepts from Abstract Meaning Representation (AMR), a stable semantic representation that structures the meaning of texts as logical graphs. The distilled concepts from multiple retrieved documents are then fused and reconstructed into a unified, information-intensive context by Large Language Models, which supplement only the necessary sentence elements to highlight the core knowledge. Experiments on the PopQA and EntityQuestions datasets demonstrate that CoCR-RAG significantly outperforms existing context-reconstruction methods across these Web Q&A benchmarks. Furthermore, CoCR-RAG shows robustness across various backbone LLMs, establishing itself as a flexible, plug-and-play component adaptable to different RAG frameworks.
Abstract（参考訳）: Retrieval-augmented Generation (RAG)は、Webや他の外部ソースからの情報を取り入れることで、Q&Aの強化に有望な結果を示している。しかし、異種Webから取得したサポートドキュメントは、多種多様な書式、様々なフォーマット、一貫性のない粒度を持つ複数のソースに由来することが多い。このような多ソース文書を一貫性のある、知識集約的なコンテキストに融合することは、無関係で冗長な情報の存在が、推測された回答の事実的一貫性を損なう可能性があるため、依然として重要な課題である。本稿では,言語的に基礎付けられた概念レベルの統合を通じて,RAGのマルチソース情報融合問題に対処するフレームワークである概念指向コンテキスト再構成RAG(CoCR-RAG)を提案する。具体的には、テキストの意味を論理グラフとして構造化する安定な意味表現である抽象的意味表現(AMR)から重要な概念を抽出する概念蒸留アルゴリズムを提案する。複数の検索された文書から抽出された概念を融合して、大言語モデルによって統合された情報集約的な文脈に再構築し、コア知識を強調するために必要な文要素のみを補完する。 PopQAデータセットとEntityQuestionsデータセットの実験は、CoCR-RAGがこれらのWeb Q&Aベンチマークで既存のコンテキスト再構成メソッドを大幅に上回っていることを示している。さらに、CoCR-RAG は様々なバックボーン LLM に対して堅牢性を示し、様々なRAG フレームワークに適応可能なフレキシブルなプラグイン・アンド・プレイコンポーネントとしての地位を確立している。

論文の概要: CoCR-RAG: Enhancing Retrieval-Augmented Generation in Web Q&A via Concept-oriented Context Reconstruction

関連論文リスト