Fugu-MT 論文翻訳(概要): Mitigating Hallucinations in Large Language Models via Causal Reasoning

論文の概要: Mitigating Hallucinations in Large Language Models via Causal Reasoning

arxiv url: http://arxiv.org/abs/2508.12495v1
Date: Sun, 17 Aug 2025 20:51:06 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-19 14:49:10.802052
Title: Mitigating Hallucinations in Large Language Models via Causal Reasoning
Title（参考訳）: 因果推論による大規模言語モデルにおける幻覚の緩和
Authors: Yuangang Li, Yiqing Shen, Yi Nian, Jiechao Gao, Ziyi Wang, Chenxiao Yu, Shawn Li, Jie Wang, Xiyang Hu, Yue Zhao,
Abstract要約: 大規模言語モデル(LLM)は論理的に矛盾した幻覚を示し、一貫性があるように見えるが、理性原理に反する。我々は因果DAG構築と推論(CDCR-SFT)を紹介する。 CDCR-SFT は LLM を訓練し、変数レベル指向非巡回グラフ (DAG) を明示的に構築し、その上で推論を行う。
参考スコア（独自算出の注目度）: 16.830640147232295
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models (LLMs) exhibit logically inconsistent hallucinations that appear coherent yet violate reasoning principles, with recent research suggesting an inverse relationship between causal reasoning capabilities and such hallucinations. However, existing reasoning approaches in LLMs, such as Chain-of-Thought (CoT) and its graph-based variants, operate at the linguistic token level rather than modeling the underlying causal relationships between variables, lacking the ability to represent conditional independencies or satisfy causal identification assumptions. To bridge this gap, we introduce causal-DAG construction and reasoning (CDCR-SFT), a supervised fine-tuning framework that trains LLMs to explicitly construct variable-level directed acyclic graph (DAG) and then perform reasoning over it. Moreover, we present a dataset comprising 25,368 samples (CausalDR), where each sample includes an input question, explicit causal DAG, graph-based reasoning trace, and validated answer. Experiments on four LLMs across eight tasks show that CDCR-SFT improves the causal reasoning capability with the state-of-the-art 95.33% accuracy on CLADDER (surpassing human performance of 94.8% for the first time) and reduces the hallucination on HaluEval with 10% improvements. It demonstrates that explicit causal structure modeling in LLMs can effectively mitigate logical inconsistencies in LLM outputs. Code is available at https://github.com/MrLYG/CDCR-SFT.
Abstract（参考訳）: 大規模言語モデル (LLMs) は論理的に矛盾した幻覚を示し, 因果的推論能力とそのような幻覚との逆関係を示唆している。しかし、Chain-of-Thought (CoT) やグラフベースの変種のような LLM の既存の推論手法は、変数間の因果関係をモデル化するのではなく、言語トークンレベルで機能する。このギャップを埋めるために、我々は、LLMに可変レベル指向非巡回グラフ(DAG)を明示的に構築するよう訓練し、その上で推論を行う教師付き微調整フレームワークであるCDCR-SFT(Casal-DAG construction and reasoning)を導入する。さらに,25,368個のサンプルからなるデータセット(CausalDR)について,各サンプルは入力質問,明示的因果DAG,グラフベースの推論トレース,検証された回答を含む。 8つのタスクにわたる4つのLCMの実験では、CDCR-SFTはCLADDERの最先端95.33%の精度で因果推論能力を改善し、HaluEvalの幻覚を10%改善した。 LLM出力における論理的不整合を効果的に軽減できることを示す。コードはhttps://github.com/MrLYG/CDCR-SFTで入手できる。

論文の概要: Mitigating Hallucinations in Large Language Models via Causal Reasoning

関連論文リスト