Fugu-MT 論文翻訳(概要): Faithfulness-QA: A Counterfactual Entity Substitution Dataset for Training Context-Faithful RAG Models

論文の概要: Faithfulness-QA: A Counterfactual Entity Substitution Dataset for Training Context-Faithful RAG Models

arxiv url: http://arxiv.org/abs/2604.25313v2
Date: Wed, 29 Apr 2026 17:00:16 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-30 13:51:54.146661
Title: Faithfulness-QA: A Counterfactual Entity Substitution Dataset for Training Context-Faithful RAG Models
Title（参考訳）: Faithfulness-QA: コンテキスト指向RAGモデルのトレーニングのための対実エンティティ置換データセット
Authors: Li Ju, Junzhe Wang, Qi Zhang,
Abstract要約: Retrieval-Augmented Generationモデルは、検索されたコンテキストではなく、パラメトリックメモリにグラウンドされた回答を生成する。本稿では,99,094サンプルの大規模データセットであるFithfulness-QAについて紹介する。
参考スコア（独自算出の注目度）: 13.478990954968316
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Retrieval-Augmented Generation (RAG) models frequently produce answers grounded in parametric memory rather than the retrieved context, undermining the core promise of retrieval augmentation. A fundamental obstacle to fixing this unfaithfulness is the lack of training data that explicitly requires models to prefer context over internal knowledge. We introduce Faithfulness-QA, a large-scale dataset of 99,094 samples constructed through counterfactual entity substitution. Starting from two established extractive QA benchmarks--SQuAD and TriviaQA--we automatically identify answer-bearing named entities in each context, replace them with type-consistent alternatives drawn from a curated bank of 76,953 entities, and thereby manufacture controlled knowledge conflicts between context and parametric memory. Rigorous quality filtering ensures 100% pass rates across four automated checks on random 200-sample audits. We release the full dataset, the construction pipeline, and a typed entity bank covering eight named entity categories. Faithfulness-QA is designed as a training resource for attention-based faithfulness objectives and as an evaluation benchmark for measuring context-grounding behavior in RAG systems. Data and code are available at https://github.com/qzhangFDU/faithfulness-qa-dataset.
Abstract（参考訳）: Retrieval-Augmented Generation (RAG)モデルは、検索されたコンテキストではなく、パラメトリックメモリに基礎を置く回答を頻繁に生成し、検索強化のコア約束を損なう。この不誠実さを修正するための根本的な障害は、内部知識よりもコンテキストを好むモデルを必要とするトレーニングデータの欠如である。本稿では,99,094サンプルの大規模データセットであるFithfulness-QAについて紹介する。 SQuADとTriviaQAという2つの確立されたQAベンチマークから始めると、各コンテキストにおける応答を持つ名前付きエンティティを自動的に識別し、76,953個のキュレートされたバンクから引き出された型一貫性の代替品に置き換え、コンテキストとパラメトリックメモリ間の制御された知識衝突を発生させる。厳格な品質フィルタリングにより、ランダムな200サンプル監査の4つの自動チェックで100%パスレートが保証される。完全なデータセット、構築パイプライン、および8つの名前付きエンティティカテゴリをカバーする型付きエンティティバンクをリリースします。 Faithfulness-QAは、注意に基づく忠実度目標のためのトレーニングリソースとして設計され、RAGシステムにおけるコンテキストグラウンド動作を測定するための評価ベンチマークとして設計されている。データとコードはhttps://github.com/qzhangFDU/faithfulness-qa-dataset.comで入手できる。

論文の概要: Faithfulness-QA: A Counterfactual Entity Substitution Dataset for Training Context-Faithful RAG Models

関連論文リスト