Fugu-MT 論文翻訳(概要): Vector RAG vs LLM-Compiled Wiki: A Preregistered Comparison on a Small Multi-Domain Research

論文の概要: Vector RAG vs LLM-Compiled Wiki: A Preregistered Comparison on a Small Multi-Domain Research

arxiv url: http://arxiv.org/abs/2605.18490v1
Date: Mon, 18 May 2026 14:41:16 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-19 17:57:49.801396
Title: Vector RAG vs LLM-Compiled Wiki: A Preregistered Comparison on a Small Multi-Domain Research
Title（参考訳）: ベクトルRAG vs LLM-Compiled Wiki: 小規模なマルチドメイン研究における事前登録の比較
Authors: Theodore O. Cochran,
Abstract要約: シングルラウンドベクトルRAGシステムとLCMコンパイルされたマークダウンwikiを比較した。どちらのシステムも、同じ回答生成モデルを使用して、24の論文で同じ13の質問に答えた。 wikiは、論文間の発見を繋ぐのがはるかに優れているが、回答組織におけるその優位性は、審査の調整後に強くなかった。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: We preregistered a comparison of two ways to help an LLM answer questions over a small research corpus: a single-round Vector RAG system and an LLM-compiled markdown wiki. Both systems answered the same 13 questions over 24 papers using the same answer-generating model, and their answers were scored by blinded LLM judges. The wiki scored much better at connecting findings across papers, but its advantage in answer organization was not strong after judge adjustment. RAG met the preregistered test for single-fact lookup questions. The clean query-side cost result went against the expected wiki advantage: under the tested setup, the wiki used far more query tokens than RAG, so it could not recover any upfront build cost through cheaper queries. Two exploratory analyses changed how we interpret the result. First, claim-level citation checking favored the wiki: its cited pages more often supported the exact claims being made, even though RAG scored better on the overall groundedness rubric. Second, a decomposition-based RAG variant recovered most of the wiki's advantage on cross-paper synthesis at lower LLM-token cost, but it did not recover the wiki advantage in claim-by-claim citation support. The main conclusion is that grounded research synthesis is not a single capability. Systems can differ in how well they organize evidence, how well their citations support each claim, and how much they cost to run. In this study, no architecture was best on all three.
Abstract（参考訳）: 単ラウンドベクターRAGシステムとLCMコンパイルしたマークダウンwikiという,小さな研究コーパスに対して,LCMが回答する2つの方法の比較を行った。どちらのシステムも、同じ回答生成モデルを用いて24の論文で同じ13の質問に回答し、その回答は盲目のLLM審査員によって得られた。 wikiは、論文間の発見を繋ぐのがはるかに優れているが、回答組織におけるその優位性は、審査の調整後に強くなかった。 RAGは、シングルファクトのルックアップ質問のために事前登録されたテストに合格した。テストされたセットアップでは、wikiはRAGよりもはるかに多くのクエリトークンを使用していたため、より安価なクエリを通じて事前ビルドコストを回復できなかった。 2つの探索分析が結果の解釈方法を変えた。まず、クレームレベルの引用チェックはwikiを好んだ:その引用ページは、RAGが全体的な根拠の曖昧さをより良く評価したにもかかわらず、正確なクレームをしばしば支持した。第二に、分解に基づくRAG変種は、低lLMコストで横断紙合成におけるウィキの利点のほとんどを回復したが、クレーム・バイ・クレームの引用サポートにおいてウィキの優位性は回復しなかった。主な結論は、基礎研究合成は単一の能力ではないということである。システムは、証拠の整理方法、引用がそれぞれのクレームをどれだけうまくサポートしているか、実行にどれだけの費用がかかるかによって異なる可能性がある。本研究では,3つすべてにおいてアーキテクチャが最善であった。

論文の概要: Vector RAG vs LLM-Compiled Wiki: A Preregistered Comparison on a Small Multi-Domain Research

関連論文リスト