Fugu-MT 論文翻訳(概要): Exploring and Mitigating Fawning Hallucinations in Large Language Models

論文の概要: Exploring and Mitigating Fawning Hallucinations in Large Language Models

arxiv url: http://arxiv.org/abs/2509.00869v1
Date: Sun, 31 Aug 2025 14:29:54 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-04 15:17:03.437426
Title: Exploring and Mitigating Fawning Hallucinations in Large Language Models
Title（参考訳）: 大規模言語モデルにおけるfawning Hallucinationの探索と緩和
Authors: Zixuan Shangguan, Yanjie Dong, Lanjun Wang, Xiaoyi Fan, Victor C. M. Leung, Xiping Hu,
Abstract要約: 各種自然言語処理タスクにおけるハエの幻覚の分析を行う。我々は、フェニング・ハロシン化緩和のための、いわゆるコントラストデコーディング手法をカスタマイズする。
参考スコア（独自算出の注目度）: 52.444712272909435
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language models (LLMs) have demonstrated exceptional proficiency in language understanding. However, when LLMs align their outputs with deceptive and/or misleading prompts, the generated responses could deviate from the de facto information. Such observations are known as fawning hallucinations, where the model prioritizes alignment with the input's implied perspective over accuracy and truthfulness. In this work, we analyze fawning hallucinations in various natural language processing tasks and tailor the so-termed contrastive decoding method for fawning-hallucination mitigation. Specifically, we design two paradigms to generate corresponding deceptive and/or misleading inputs for the consistent fawning hallucinations induction. Then, we propose the collaborative contrastive decoding (CCD) to handle the fawning hallucinations across different tasks in LLMs. By contrasting the deviation in output distribution between induced and transformed neutral inputs, the proposed CCD can reduce reliance on deceptive and/or misleading information without requiring additional training. Extensive experiments demonstrate that the proposed CCD can effectively mitigate fawning hallucinations and improve the factuality of the generated responses over various tasks.
Abstract（参考訳）: 大規模言語モデル(LLM)は、言語理解において非常に優れた能力を示している。しかし、LCMが出力を偽りや誤解を招くプロンプトと一致させると、生成された応答はデファクト情報から逸脱する可能性がある。このような観察はfawning Hallucinationsと呼ばれ、モデルが正確さと真理性よりも入力の暗黙の視点との整合を優先する。本研究では,様々な自然言語処理タスクにおけるfawning Hallucinationを分析し,fawning-hallucinationの緩和のためのコントラスト復号法をカスタマイズする。具体的には、2つのパラダイムを設計し、一貫したfawning Hallucinationインジェクションに対して、それに対応する偽りの入力および/または誤解を招く入力を生成する。そこで本研究では,LLM における様々なタスクにまたがる幻覚を扱うために,協調的コントラスト復号法 (CCD) を提案する。誘導された中性入力と変換された中性入力の出力分布のずれを対照的に、提案したCCDは、追加のトレーニングを必要とせず、偽情報および/または誤誘導情報への依存を減らすことができる。広汎な実験により、提案したCCDは、ハエの幻覚を効果的に軽減し、様々なタスクにおいて生成された反応の事実性を改善することができることが示された。

論文の概要: Exploring and Mitigating Fawning Hallucinations in Large Language Models

関連論文リスト