Fugu-MT 論文翻訳(概要): FVA-RAG: Falsification-Verification Alignment for Mitigating Sycophantic Hallucinations

論文の概要: FVA-RAG: Falsification-Verification Alignment for Mitigating Sycophantic Hallucinations

arxiv url: http://arxiv.org/abs/2512.07015v1
Date: Sun, 07 Dec 2025 21:28:42 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-09 22:03:54.639307
Title: FVA-RAG: Falsification-Verification Alignment for Mitigating Sycophantic Hallucinations
Title（参考訳）: FVA-RAG:Sycophanic Hallucinationを緩和するためのFalsification-Verification Alignment
Authors: Mayank Ravishankara,
Abstract要約: Falsification-Verification Alignment RAG(FVA-RAG)は、検索パラダイムをインダクティブ・検証(探索支援)からデダクティブ・ファルシフィケーション(耐障害性を参照)へ移行するフレームワークである。この「アンチ・コンテクスト」に対する草案回答を明示的に重み付けする二重検証機構を導入する。一般的な誤解のデータセットに関する予備実験では、FVA-RAGは標準的なRAGベースラインと比較して、幻覚に対する堅牢性を大幅に向上することが示された。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Retrieval-Augmented Generation (RAG) systems have significantly reduced hallucinations in Large Language Models (LLMs) by grounding responses in external context. However, standard RAG architectures suffer from a critical vulnerability: Retrieval Sycophancy. When presented with a query based on a false premise or a common misconception, vector-based retrievers tend to fetch documents that align with the user's bias rather than objective truth, leading the model to "hallucinate with citations." In this work, we introduce Falsification-Verification Alignment RAG (FVA-RAG), a framework that shifts the retrieval paradigm from Inductive Verification (seeking support) to Deductive Falsification (seeking disproof). Unlike existing "Self-Correction" methods that rely on internal consistency, FVA-RAG deploys a distinct Adversarial Retrieval Policy that actively generates "Kill Queries"-targeted search terms designed to surface contradictory evidence. We introduce a dual-verification mechanism that explicitly weighs the draft answer against this "Anti-Context." Preliminary experiments on a dataset of common misconceptions demonstrate that FVA-RAG significantly improves robustness against sycophantic hallucinations compared to standard RAG baselines, effectively acting as an inference-time "Red Team" for factual generation.
Abstract（参考訳）: Retrieval-Augmented Generation (RAG) システムは、外部コンテキストでの応答をグラウンド化することによって、Large Language Models (LLM) における幻覚を著しく低減した。しかし、標準的なRAGアーキテクチャは重大な脆弱性に悩まされている。虚偽の前提や共通の誤解に基づくクエリを提示すると、ベクトルベースの検索者は、客観的な真理ではなく、ユーザのバイアスに沿った文書を取得する傾向にあり、モデルが「引用に幻滅する」傾向にある。本研究では、帰納的検証(インダクティブ・検証)から帰納的Falsification(ディダクティブ・ファルシフィケーション)へ、学習パラダイムを移行させるフレームワークであるFVA-RAG(Falsification-Verification Alignment RAG)を紹介する。内部整合性に依存する既存の「自己補正」手法とは異なり、FVA-RAGは「キルクェリ」をターゲットとした探索用語を積極的に生成する独自の逆検索ポリシーを展開している。本稿では、この「アンチコンテキスト」に対するドラフト回答を明示的に重み付けする二重検証機構を導入する。一般的な誤解のデータセットに関する予備実験では、FVA-RAGは標準的なRAGベースラインと比較して、幻覚に対する堅牢性を大幅に改善し、事実上、実際の生成のための推論時間「レッドチーム」として機能することを示した。

論文の概要: FVA-RAG: Falsification-Verification Alignment for Mitigating Sycophantic Hallucinations

関連論文リスト