Fugu-MT 論文翻訳(概要): Unmasking Hallucinations: A Causal Graph-Attention Perspective on Factual Reliability in Large Language Models

論文の概要: Unmasking Hallucinations: A Causal Graph-Attention Perspective on Factual Reliability in Large Language Models

arxiv url: http://arxiv.org/abs/2604.04020v1
Date: Sun, 05 Apr 2026 08:37:36 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-07 15:49:18.879269
Title: Unmasking Hallucinations: A Causal Graph-Attention Perspective on Factual Reliability in Large Language Models
Title（参考訳）: 幻覚を解き放つ:大規模言語モデルにおける現実的信頼性に関する因果グラフ的視点
Authors: Sailesh kiran kurra, Shiek Ruksana, Vishal Borusu,
Abstract要約: 幻覚は、医学的診断や法的推論のようなシナリオで深刻な問題を引き起こす。本稿では,トランスアーキテクチャ内における内部の注意フローの解釈を通じて幻覚を減少させる因果グラフ注意ネットワーク(GCAN)フレームワークを提案する。 TruthfulQAやHotpotQAといった標準ベンチマークの実験では、幻覚率が27.8%低下し、実際の精度が16.4%向上した。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: This paper primarily focuses on the hallucinations caused due to AI language models(LLMs).LLMs have shown extraordinary Language understanding and generation capabilities .Still it has major a disadvantage hallucinations which give outputs which are factually incorrect ,misleading or unsupported by input data . These hallucinations cause serious problems in scenarios like medical diagnosis or legal reasoning.Through this work,we propose causal graph attention network (GCAN) framework that reduces hallucinations through interpretation of internal attention flow within a transformer architecture with the help of constructing token level graphs that combine self attention weights and gradient based influence scores.our method quantifies each tokens factual dependency using a new metric called the Causal Contribution Score (CCS). We further introduce a fact-anchored graph reweighting layer that dynamically reduces the influence of hallucination prone nodes during generation. Experiments on standard benchmarks such as TruthfulQA and HotpotQA show a 27.8 percent reduction in hallucination rate and 16.4 percent improvement in factual accuracy over baseline retrieval-augmented generation (RAG) models. This work contributes to the interpretability,robustness, and factual reliability of future LLM architectures.
Abstract（参考訳）: 本稿では主に,AI言語モデル(LLM)による幻覚に焦点を当てる。 LLMは並外れた言語理解と生成能力を示している。いまだに、入力データによって非推奨または非推奨の、事実的に誤った出力を与える不利な幻覚がある。これらの幻覚は、医学的診断や法的推論などのシナリオにおいて深刻な問題を引き起こすが、本研究では、自己の注意重みと勾配に基づく影響スコアを組み合わせたトークンレベルグラフの構築の支援により、トランスフォーマーアーキテクチャ内の内部の注意フローの解釈を通じて幻覚を低減する因果グラフ注意ネットワーク(GCAN)フレームワークを提案する。さらに、生成中の幻覚傾向ノードの影響を動的に低減するファクトアンコレッドグラフ再重み付け層を導入する。 TruthfulQAやHotpotQAといった標準ベンチマークの実験では、幻覚率が27.8%低下し、16.4%の精度でRAGモデルよりも精度が向上した。この研究は、将来のLLMアーキテクチャの解釈可能性、ロバスト性、および現実的な信頼性に寄与する。

論文の概要: Unmasking Hallucinations: A Causal Graph-Attention Perspective on Factual Reliability in Large Language Models

関連論文リスト