Fugu-MT 論文翻訳(概要): BridgeRAG: Training-Free Bridge-Conditioned Retrieval for Multi-Hop Question Answering

論文の概要: BridgeRAG: Training-Free Bridge-Conditioned Retrieval for Multi-Hop Question Answering

arxiv url: http://arxiv.org/abs/2604.03384v1
Date: Fri, 03 Apr 2026 18:27:51 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-07 15:49:18.54683
Title: BridgeRAG: Training-Free Bridge-Conditioned Retrieval for Multi-Hop Question Answering
Title（参考訳）: BridgeRAG:Multi-Hop Question Answeringのためのトレーニングフリーブリッジコンディション検索
Authors: Andre Bacellar,
Abstract要約: 本稿では,BridgeRAGを提案する。 BridgeRAGは、スコアリングからカバーを分離する: 二重エンタテイティーANNは、第2のホップ候補プールを広げる。ブリッジコンディショニングが生産的再ランク付けを引き起こすことを示す(並列チェーンでは18.7%、単一チェーンでは0.6%)。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Multi-hop retrieval is not a single-step relevance problem: later-hop evidence should be ranked by its utility conditioned on retrieved bridge evidence, not by similarity to the original query alone. We present BridgeRAG, a training-free, graph-free retrieval method for retrieval-augmented generation (RAG) over multi-hop questions that operationalizes this view with a tripartite scorer s(q,b,c) over (question, bridge, candidate). BridgeRAG separates coverage from scoring: dual-entity ANN expansion broadens the second-hop candidate pool, while a bridge-conditioned LLM judge identifies the active reasoning chain among competing candidates without any offline graph or proposition index. Across four controlled experiments we show that this conditioning signal is (i) selective: +2.55pp on parallel-chain queries (p<0.001) vs. ~0 on single-chain subtypes; (ii) irreplaceable: substituting the retrieved passage with generated SVO query text reduces R@5 by 2.1pp, performing worse than even the lowest-SVO-similarity pool passage; (iii) predictable: cos(b,g2) correlates with per-query gain (Spearman rho=0.104, p<0.001); and (iv) mechanistically precise: bridge conditioning causes productive re-rankings (18.7% flip-win rate on parallel-chain vs. 0.6% on single-chain), not merely more churn. Combined with lightweight coverage expansion and percentile-rank score fusion, BridgeRAG achieves the best published training-free R@5 under matched benchmark evaluation on all three standard MHQA benchmarks without a graph database or any training: 0.8146 on MuSiQue (+3.1pp vs. PropRAG, +6.8pp vs. HippoRAG2), 0.9527 on 2WikiMultiHopQA (+1.2pp vs. PropRAG), and 0.9875 on HotpotQA (+1.35pp vs. PropRAG).
Abstract（参考訳）: マルチホップ検索は、単一ステップの関連性の問題ではない: 後続ホップ証拠は、検索されたブリッジエビデンスに条件づけられた実用性によってランク付けされるべきであり、元のクエリのみと類似性によってランク付けされるべきである。我々は、このビューを3部構成のスコアラーs(q,b,c)で操作するマルチホップ質問に対して、学習不要でグラフのない検索手法であるBridgeRAGを提案する。デュアルエンタリティ ANN拡張は第2のホップ候補プールを広げる一方、ブリッジ条件のLCM判事は、オフライングラフや命題インデックスのない競合候補間のアクティブな推論チェーンを識別する。 4つの制御された実験で、この条件付け信号が示される。 (i)選択:+2.55pp on parallel-chain query (p<0.001) vs. ~0 on single-chain subtypes (ii)置き換え不能:生成されたSVOクエリテキストで検索されたパスを置換すると、R@5が2.1pp減少し、最低のSVO類似プールパスよりもパフォーマンスが悪くなります。 (iii)予測可能:cos(b,g2)は、クエリ当たりのゲイン(Spearman rho=0.104, p<0.001);および (4) 機械的正確性: ブリッジコンディショニングはパラレルチェーンでは18.7%のフリップウィンレート、シングルチェーンでは0.6%のフリップウィンレートを引き起こす。軽量なカバレッジ拡張とパーセンタイル-ランクスコアの融合と組み合わせて、BridgeRAGは、グラフデータベースなしの3つの標準MHQAベンチマークと、いかなるトレーニングでも0.8146 on MuSiQue (+3.1pp vs. PropRAG, +6.8pp vs. HippoRAG2)、0.9527 on 2WikiMultiHopQA (+1.2pp vs. PropRAG)、0.9875 on HotpotQA (+1.35pp vs. PropRAG)という、マッチしたベンチマークで最高のトレーニングフリーのR@5を達成している。

論文の概要: BridgeRAG: Training-Free Bridge-Conditioned Retrieval for Multi-Hop Question Answering

関連論文リスト