Fugu-MT 論文翻訳(概要): Uncertainty-Aware Attention Heads: Efficient Unsupervised Uncertainty Quantification for LLMs

論文の概要: Uncertainty-Aware Attention Heads: Efficient Unsupervised Uncertainty Quantification for LLMs

arxiv url: http://arxiv.org/abs/2505.20045v1
Date: Mon, 26 May 2025 14:28:37 GMT
ステータス: 翻訳完了
システム内更新日: 2025-05-27 16:58:43.511312
Title: Uncertainty-Aware Attention Heads: Efficient Unsupervised Uncertainty Quantification for LLMs
Title（参考訳）: 不確かさを意識した注意頭:LLMの効率的な教師なし不確実性定量化
Authors: Artem Vazhentsev, Lyudmila Rvanova, Gleb Kuzmin, Ekaterina Fadeeva, Ivan Lazichny, Alexander Panchenko, Maxim Panov, Timothy Baldwin, Mrinmaya Sachan, Preslav Nakov, Artem Shelmanov,
Abstract要約: 大型言語モデル(LLM)は、顕著な流布を示すが、しばしば「幻覚」として知られる致命的な誤りを引き起こす。本稿では,非教師的アプローチであるRAUQ(Recurrent Attention-based Uncertainty Quantification)を提案する。 4つのLLMと12の質問応答、要約、翻訳タスクにわたる実験は、RAUQが優れた結果をもたらすことを示した。
参考スコア（独自算出の注目度）: 114.46334319795785
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models (LLMs) exhibit impressive fluency, but often produce critical errors known as "hallucinations". Uncertainty quantification (UQ) methods are a promising tool for coping with this fundamental shortcoming. Yet, existing UQ methods face challenges such as high computational overhead or reliance on supervised learning. Here, we aim to bridge this gap. In particular, we propose RAUQ (Recurrent Attention-based Uncertainty Quantification), an unsupervised approach that leverages intrinsic attention patterns in transformers to detect hallucinations efficiently. By analyzing attention weights, we identified a peculiar pattern: drops in attention to preceding tokens are systematically observed during incorrect generations for certain "uncertainty-aware" heads. RAUQ automatically selects such heads, recurrently aggregates their attention weights and token-level confidences, and computes sequence-level uncertainty scores in a single forward pass. Experiments across 4 LLMs and 12 question answering, summarization, and translation tasks demonstrate that RAUQ yields excellent results, outperforming state-of-the-art UQ methods using minimal computational overhead (<1% latency). Moreover, it requires no task-specific labels and no careful hyperparameter tuning, offering plug-and-play real-time hallucination detection in white-box LLMs.
Abstract（参考訳）: 大型言語モデル(LLM)は、顕著な流感を示すが、しばしば「幻覚」と呼ばれる致命的な誤りを引き起こす。不確実性定量化(UQ)手法はこの根本的な欠点に対処するための有望なツールである。しかし、既存のUQ手法では、高い計算オーバーヘッドや教師あり学習への依存といった課題に直面している。ここでは、このギャップを埋めることを目指しています。特にRAUQ(Recurrent Attention-based Uncertainty Quantification)を提案する。これは、トランスフォーマにおける固有の注意パターンを利用して、幻覚を効率的に検出する教師なしの手法である。注意重みを解析することにより、特定の「不確かさを認識した」頭部に対して、前者のトークンに対する注意の低下が、不正確な世代の間に体系的に観察されるという特異なパターンを同定した。 RAUQはこれらのヘッドを自動的に選択し、注意重みとトークンレベルの信頼度を繰り返し集計し、シーケンスレベルの不確実性スコアを1回のフォワードパスで計算する。 4つのLLMと12の質問応答、要約、翻訳タスクにわたる実験により、RAUQは計算オーバーヘッドを最小限に抑える(1%レイテンシ)、最先端のUQ手法よりも優れた結果が得られることが示された。さらに、タスク固有のラベルを必要とせず、注意深いハイパーパラメータチューニングも必要とせず、ホワイトボックスLSMのプラグアンドプレイによる幻覚検出を提供する。

論文の概要: Uncertainty-Aware Attention Heads: Efficient Unsupervised Uncertainty Quantification for LLMs

関連論文リスト