Fugu-MT 論文翻訳(概要): Trust but Verify: Introducing DAVinCI -- A Framework for Dual Attribution and Verification in Claim Inference for Language Models

論文の概要: Trust but Verify: Introducing DAVinCI -- A Framework for Dual Attribution and Verification in Claim Inference for Language Models

arxiv url: http://arxiv.org/abs/2604.21193v1
Date: Thu, 23 Apr 2026 01:37:50 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-24 14:40:06.231854
Title: Trust but Verify: Introducing DAVinCI -- A Framework for Dual Attribution and Verification in Claim Inference for Language Models
Title（参考訳）: Trust but Verify: DAVinCIの紹介 - 言語モデルに対するクレーム推論におけるデュアル属性と検証のためのフレームワーク
Authors: Vipula Rawte, Ryan Rossi, Franck Dernoncourt, Nedim Lipka,
Abstract要約: 大規模言語モデル(LLM)は、広範囲のNLPタスクにおいて顕著な流速と汎用性を示してきたが、実際的な不正確さと制限が伴う傾向にある。これは、信頼と妥当性が最優先される医療、法律、科学コミュニケーションといった、高リスク領域に重大なリスクをもたらす。 LLM出力の事実的信頼性と解釈可能性を高めるために設計されたDAVinCI - Dual Attribution and Verificationフレームワークを紹介する。
参考スコア（独自算出の注目度）: 53.279391576560755
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large Language Models (LLMs) have demonstrated remarkable fluency and versatility across a wide range of NLP tasks, yet they remain prone to factual inaccuracies and hallucinations. This limitation poses significant risks in high-stakes domains such as healthcare, law, and scientific communication, where trust and verifiability are paramount. In this paper, we introduce DAVinCI - a Dual Attribution and Verification framework designed to enhance the factual reliability and interpretability of LLM outputs. DAVinCI operates in two stages: (i) it attributes generated claims to internal model components and external sources; (ii) it verifies each claim using entailment-based reasoning and confidence calibration. We evaluate DAVinCI across multiple datasets, including FEVER and CLIMATE-FEVER, and compare its performance against standard verification-only baselines. Our results show that DAVinCI significantly improves classification accuracy, attribution precision, recall, and F1-score by 5-20%. Through an extensive ablation study, we isolate the contributions of evidence span selection, recalibration thresholds, and retrieval quality. We also release a modular DAVinCI implementation that can be integrated into existing LLM pipelines. By bridging attribution and verification, DAVinCI offers a scalable path to auditable, trustworthy AI systems. This work contributes to the growing effort to make LLMs not only powerful but also accountable.
Abstract（参考訳）: 大規模言語モデル (LLM) は、広範囲のNLPタスクにおいて顕著な流布性と汎用性を示してきたが、実際的な不正確さや幻覚の傾向は残されている。この制限は、信頼と妥当性が最優先される医療、法律、科学コミュニケーションといった、高リスク領域において重大なリスクをもたらす。本稿では,LDM出力の事実的信頼性と解釈可能性を高めるために設計されたDAVinCI(Dual Attribution and Verification)フレームワークについて紹介する。 DAVinCIは以下の2段階で動作する。 i) 内部モデルコンポーネント及び外部ソースに生成されたクレームを属性とする。二エンテーメントに基づく推論及び信頼度校正により各クレームを検証すること。我々は、FEVERやCLIMATE-FEVERを含む複数のデータセットにわたるDAVinCIを評価し、その性能を標準検証のみのベースラインと比較した。以上の結果から,DAVinCIは分類精度,属性精度,リコール,F1スコアを5～20%向上させることがわかった。広範囲にわたるアブレーション研究を通じて, 証拠の寄与は, 選別, 校正基準, 検索品質にまたがる。既存のLLMパイプラインに統合可能なモジュール型DAVinCI実装もリリースしています。属性と検証をブリッジすることで、DAVinCIは監査可能な信頼性の高いAIシステムへのスケーラブルなパスを提供する。この作業は、LSMを強力にだけでなく、説明責任も持てるものにする努力の増加に寄与します。

論文の概要: Trust but Verify: Introducing DAVinCI -- A Framework for Dual Attribution and Verification in Claim Inference for Language Models

関連論文リスト