Fugu-MT 論文翻訳(概要): Efficient RAG with Intent-Aware Retrieval and Semantics-Preserving Chunking

論文の概要: Efficient RAG with Intent-Aware Retrieval and Semantics-Preserving Chunking

arxiv url: http://arxiv.org/abs/2606.01240v1
Date: Sun, 31 May 2026 13:42:36 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-02 21:34:29.466083
Title: Efficient RAG with Intent-Aware Retrieval and Semantics-Preserving Chunking
Title（参考訳）: インテント認識検索とセマンティックス保存を併用した効率的なRAG
Authors: Fachrina Dewi Puspitasari, Chaoning Zhang, Jiaquan Zhang, Zhicheng Wang, Hafiz Shakeel Ahmad Awan, Rizwan Qureshi, Jewon Lee, Tae-Ho Kim, Yang Yang,
Abstract要約: InSemRAGは、検索拡張世代(RAG)システムのためのフレームワークである。 IARは、クエリインテントに基づいて、検索チャネルを適応的に重み付けする動的ハイブリッド検索手法を実装している。 SPCは、意味的整合性を維持するために破損した証拠チャンクの検出と修復を行う。提案手法は,HotPotQAではF1が2.65ポイント,FEVERでは1.5ポイントの精度向上を実現した。
参考スコア（独自算出の注目度）: 18.698122955613584
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The demand for powerful instruction following and reasoning capability of large language models (LLMs) has promoted rapid development of retrieval-augmented generation (RAG). The RAG system assists LLM generation by retrieving chunks of query-fit supplementary knowledge from an external database. Conventional RAG systems, however, suffer from information insufficiency due to two factors, which are intent-agnostic retrieval and information fragmentation. Our work proposes a RAG framework, termed InSemRAG, that addresses these challenges via an iterative retrieve-and-check mechanism with two supporting modules, an intention-aware retriever (IAR) and semantics-preserving chunking (SPC). IAR implements a dynamic hybrid retrieval method that adaptively weights the retrieval channels based on the query intent, while SPC performs detection and reparation to the damaged evidence chunks to preserve the semantic integrity. To alleviate the computational latency brought by our iterative mechanism, we leverage small language models (SLMs). Extensive experiments across several benchmark datasets consistently demonstrate the competitiveness of our method against recent state-of-the-art RAG mechanisms. Particularly, our method achieves significant gains on multi-hop and evidence-sensitive tasks, with a 2.65-point improvement in F1 on HotPotQA and a 1.5-point increase in accuracy on FEVER. Our method also achieves competitive performance to Multi-Hop RAG with 4.32$\times$ lower latency with the utilization of SLM.
Abstract（参考訳）: 大規模言語モデル(LLM)の強力な命令追従と推論能力の要求は,検索強化世代(RAG)の急速な発展を促している。 RAGシステムは、外部データベースからクエリに適した補助知識の塊を取得することにより、LCM生成を支援する。しかし、従来のRAGシステムは、意図に依存しない検索と情報の断片化という2つの要因により、情報不足に悩まされている。 InSemRAGと呼ばれるRAGフレームワークは,2つのモジュール(IAR)とセマンティックス保存チャンキング(SPC)を備えた反復的検索・チェック機構を通じて,これらの課題に対処する。 IARは、クエリインテントに基づいて検索チャネルを適応的に重み付けする動的ハイブリッド検索手法を実装し、SPCは損傷したエビデンスチャンクの検出と修復を行い、セマンティックな整合性を維持する。反復的なメカニズムによってもたらされる計算遅延を軽減するために,我々は小型言語モデル(SLM)を活用している。いくつかのベンチマークデータセットにわたる大規模な実験は、最近の最先端のRAGメカニズムに対する我々の手法の競争力を一貫して実証している。特に,HotPotQAではF1の2.65ポイント,FEVERでは1.5ポイントの精度向上が達成された。また,SLMを用いたMulti-Hop RAGに対して4.32$\times$低レイテンシで競合性能を実現する。

論文の概要: Efficient RAG with Intent-Aware Retrieval and Semantics-Preserving Chunking

関連論文リスト