Fugu-MT 論文翻訳(概要): NeocorRAG: Less Irrelevant Information, More Explicit Evidence, and More Effective Recall via Evidence Chains

論文の概要: NeocorRAG: Less Irrelevant Information, More Explicit Evidence, and More Effective Recall via Evidence Chains

arxiv url: http://arxiv.org/abs/2604.27852v1
Date: Thu, 30 Apr 2026 13:37:01 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-01 16:31:54.112637
Title: NeocorRAG: Less Irrelevant Information, More Explicit Evidence, and More Effective Recall via Evidence Chains
Title（参考訳）: NeocorRAG:無関係な情報、より明確なエビデンス、エビデンスチェーンによるより効果的なリコール
Authors: Shiyao Peng, Qianhe Zheng, Zhuodi Hao, Zichen Tang, Rongjin Li, Qing Huang, Jiayu Huang, Jiacheng Liu, Yifan Zhu, Haihong E,
Abstract要約: Retrieval-Augmented Generation (RAG)の改善は、下流の推論において、常に相反する利得に変換されない。本稿では,リコール変換率(Recall Conversion Rate, RCR)を提案する。我々は,Evidence Chainsを体系的にマイニングし,活用することにより,総合的な検索品質の最適化を実現するNeocorRAGフレームワークを提案する。
参考スコア（独自算出の注目度）: 14.980207267884674
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Although precise recall is a core objective in Retrieval-Augmented Generation (RAG), a critical oversight persists in the field: improvements in retrieval performance do not consistently translate to commensurate gains in downstream reasoning. To diagnose this gap, we propose the Recall Conversion Rate (RCR), a novel evaluation metric to quantify the contribution of retrieval to reasoning accuracy. Our quantitative analysis of mainstream RAG methods reveals that as Recall@5 improves, the RCR exhibits a near-linear decay. We identify the neglect of retrieval quality in these methods as the underlying cause. In contrast, approaches that focus solely on quality optimization often suffer from inferior recall performance. Both categories lack a comprehensive understanding of retrieval quality optimization, resulting in a trade-off dilemma. To address these challenges, we propose comprehensive retrieval quality optimization criteria and introduce the NeocorRAG framework. This framework achieves holistic retrieval quality optimization by systematically mining and utilizing Evidence Chains. Specifically, NeocorRAG first employs an innovative activated search algorithm to obtain a refined candidate space. Then it ensures precise evidence chain generation through constrained decoding. Finally, the retrieved set of evidence chains guides the retrieval optimization process. Evaluated on benchmarks including HotpotQA, 2WikiMultiHopQA, MuSiQue, and NQ, NeocorRAG achieves SOTA performance on both 3B and 70B parameter models, while consuming less than 20% of tokens used by comparable methods. This study presents an efficient, training-free paradigm for RAG enhancement that effectively optimizes retrieval quality while maintaining high recall. Our code is released at https://github.com/BUPT-Reasoning-Lab/NeocorRAG.
Abstract（参考訳）: Retrieval-Augmented Generation (RAG) における正確なリコールは中核的な目的であるが、重要な監視はこの分野に留まっている。このギャップを診断するために,検索と推論の精度を定量化する新しい評価指標であるリコール変換率(Recall Conversion Rate, RCR)を提案する。主流RAG法を定量的に分析したところ、Recall@5が改良されるにつれて、RCRはほぼ直線的な崩壊を示すことがわかった。これらの手法における検索品質の欠如を根本原因とみなす。対照的に、品質最適化にのみ焦点をあてるアプローチは、しばしばリコール性能が劣る。どちらのカテゴリも、検索品質の最適化に関する包括的な理解が欠けているため、トレードオフジレンマが発生する。これらの課題に対処するため、我々は総合的な検索品質最適化基準を提案し、NeocorRAGフレームワークを導入した。このフレームワークは,エビデンス・チェーンを体系的にマイニングし,活用することにより,総合的な検索品質の最適化を実現する。具体的には、NeocorRAGはまず革新的活性化探索アルゴリズムを用いて、洗練された候補空間を得る。そして、制約付き復号による正確なエビデンスチェーン生成を保証する。最後に、得られたエビデンスチェーンの集合が、検索最適化プロセスのガイドとなる。 HotpotQA, 2WikiMultiHopQA, MuSiQue, NQ などのベンチマークで評価すると,NeocorRAG は 3B と 70B のパラメータモデルで SOTA のパフォーマンスを達成し,同等のメソッドで使用されるトークンの20%未満を消費する。本研究では,高いリコールを維持しつつ,検索品質を効果的に最適化するRAG強化のための,効率的かつトレーニング不要なパラダイムを提案する。私たちのコードはhttps://github.com/BUPT-Reasoning-Lab/NeocorRAGで公開されています。

論文の概要: NeocorRAG: Less Irrelevant Information, More Explicit Evidence, and More Effective Recall via Evidence Chains

関連論文リスト