Fugu-MT 論文翻訳(概要): M-RAG: Making RAG Faster, Stronger, and More Efficient

論文の概要: M-RAG: Making RAG Faster, Stronger, and More Efficient

arxiv url: http://arxiv.org/abs/2603.26667v1
Date: Tue, 06 Jan 2026 15:14:54 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-06 02:36:13.046681
Title: M-RAG: Making RAG Faster, Stronger, and More Efficient
Title（参考訳）: M-RAG: RAGをより速く、より強く、より効率的にする
Authors: Sun Xu, Tongkai Xu, Baiheng Xie, Li Huang, Qiang Gao, Kunpeng Zhang,
Abstract要約: 大規模言語モデルのための新しいチャンクフリー検索戦略であるM-RAGを提案する。 M-RAGは構造化されたk-v分解メタマーカーを軽量で意図に整合した検索キーで抽出する。 M-RAGは高い効率で解答フレンドリーな証拠を回収できることを示す。
参考スコア（独自算出の注目度）: 15.147969102210759
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Retrieval-Augmented Generation (RAG) has become a widely adopted paradigm for enhancing the reliability of large language models (LLMs). However, RAG systems are sensitive to retrieval strategies that rely on text chunking to construct retrieval units, which often introduce information fragmentation, retrieval noise, and reduced efficiency. Recent work has even questioned the necessity of RAG, arguing that long-context LLMs may eliminate multi-stage retrieval pipelines by directly processing full documents. Nevertheless, expanded context capacity alone does not resolve the challenges of relevance filtering, evidence prioritization, and isolating answer-bearing information. To this end, we proposed M-RAG, a novel Chunk-free retrieval strategy. Instead of retrieving coarse-grained textual chunks, M-RAG extracts structured, k-v decomposition meta-markers, with a lightweight, intent-aligned retrieval key for retrieval and a context-rich information value for generation. Under this setting, M-RAG enables efficient and stable query-key similarity matching without sacrificing expressive ability. Experimental results on the LongBench subtasks demonstrate that M-RAG outperforms chunk-based RAG baselines across varying token budgets, particularly under low-resource settings. Extensive analysis further reveals that M-RAG retrieves more answer-friendly evidence with high efficiency, validating the effectiveness of decoupling retrieval representation from generation and highlighting the proposed strategy as a scalable and robust alternative to existing chunk-based methods.
Abstract（参考訳）: Retrieval-Augmented Generation (RAG)は、大規模言語モデル(LLM)の信頼性を高めるためのパラダイムとして広く採用されている。しかし、RAGシステムはテキストチャンキングに依存した検索戦略に敏感であり、情報断片化、検索ノイズ、効率の低下をもたらすことが多い。最近の研究はRAGの必要性に疑問を投げかけ、LLMは文書を直接処理することで多段階検索パイプラインを排除できると主張している。それでも、拡張されたコンテキストキャパシティだけでは、関連フィルタリング、エビデンス優先順位付け、および回答を含む情報の分離という課題は解決しない。そこで我々は,新しいチャンクフリー検索戦略であるM-RAGを提案する。粗いテキストチャンクを検索する代わりに、M-RAGは構造化されたk-v分解メタマーカーを抽出する。この設定の下で、M-RAGは表現能力を犠牲にすることなく、効率的で安定したクエリキー類似性マッチングを可能にする。 LongBenchサブタスクの実験結果から、M-RAGは、特に低リソース設定下で、様々なトークン予算でチャンクベースのRAGベースラインを上回っていることが示された。さらに、M-RAGは、より解答フレンドリな証拠を高い効率で回収し、検索表現を生成から切り離すことの有効性を検証し、既存のチャンクベースの手法のスケーラブルで堅牢な代替手段として提案された戦略を強調した。

論文の概要: M-RAG: Making RAG Faster, Stronger, and More Efficient

関連論文リスト