Fugu-MT 論文翻訳(概要): Principled Context Engineering for RAG: Statistical Guarantees via Conformal Prediction

論文の概要: Principled Context Engineering for RAG: Statistical Guarantees via Conformal Prediction

arxiv url: http://arxiv.org/abs/2511.17908v1
Date: Sat, 22 Nov 2025 04:17:06 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-25 18:34:24.508253
Title: Principled Context Engineering for RAG: Statistical Guarantees via Conformal Prediction
Title（参考訳）: RAGの原理的文脈工学:コンフォーマル予測による統計的保証
Authors: Debashish Chakraborty, Eugene Yang, Daniel Khashabi, Dawn Lawrie, Kevin Duh,
Abstract要約: Retrieval-Augmented Generation (RAG)は、大規模言語モデルにおける現実的な基盤を強化する。既存の前世代のフィルターは信頼性のスコアに依存しており、保持された証拠に対する統計的制御を提供していない。本研究では,コンフォメーション予測によるコンテキスト工学の実証を行う。
参考スコア（独自算出の注目度）: 40.28465841863481
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Retrieval-Augmented Generation (RAG) enhances factual grounding in large language models (LLMs) by incorporating retrieved evidence, but LLM accuracy declines when long or noisy contexts exceed the model's effective attention span. Existing pre-generation filters rely on heuristics or uncalibrated LLM confidence scores, offering no statistical control over retained evidence. We evaluate and demonstrate context engineering through conformal prediction, a coverage-controlled filtering framework that removes irrelevant content while preserving recall of supporting evidence. Using both embedding- and LLM-based scoring functions, we test this approach on the NeuCLIR and RAGTIME collections. Conformal filtering consistently meets its target coverage, ensuring that a specified fraction of relevant snippets are retained, and reduces retained context by 2-3x relative to unfiltered retrieval. On NeuCLIR, downstream factual accuracy measured by ARGUE F1 improves under strict filtering and remains stable at moderate coverage, indicating that most discarded material is redundant or irrelevant. These results demonstrate that conformal prediction enables reliable, coverage-controlled context reduction in RAG, offering a model-agnostic and principled approach to context engineering.
Abstract（参考訳）: Retrieval-Augmented Generation (RAG) は、検索された証拠を組み込むことで、大規模言語モデル(LLM)の現実的な基盤を強化するが、LLMの精度は、長期またはノイズの多いコンテキストがモデルの効果的な注意範囲を超えると低下する。既存の前世代のフィルターはヒューリスティックスやLLMの信頼性スコアに依存しており、保持された証拠に対する統計的制御を提供していない。我々は,証拠の記憶を保ちながら,無関係なコンテンツを除去するカバー制御フィルタリングフレームワークであるコンフォメーション予測を用いて,文脈工学を評価し,実証する。埋め込み型とLLM型の両方のスコアリング関数を用いて、NeuCLIRとRAGTIMEのコレクション上でこのアプローチをテストする。コンフォーマルフィルタリングはその対象のカバレッジを一貫して満たし、特定のスニペットの断片が保持されることを保証し、未フィルタリングの検索に対して保持されたコンテキストを2～3倍削減する。 NeuCLIRでは、ARGUE F1によって測定された下流の事実精度は厳密なフィルタリングの下で改善され、適度な範囲で安定であり、ほとんどの廃棄された物質は冗長であるか無関係であることを示している。これらの結果は、コンフォメーション予測がRAGの信頼性、カバレッジ制御されたコンテキスト還元を可能にし、コンテキスト工学に対するモデルに依存しない、原則化されたアプローチを提供することを示す。

論文の概要: Principled Context Engineering for RAG: Statistical Guarantees via Conformal Prediction

関連論文リスト