Fugu-MT 論文翻訳(概要): Towards Generation-Efficient Uncertainty Estimation in Large Language Models

論文の概要: Towards Generation-Efficient Uncertainty Estimation in Large Language Models

arxiv url: http://arxiv.org/abs/2605.06053v1
Date: Thu, 07 May 2026 11:39:13 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-08 22:27:11.730764
Title: Towards Generation-Efficient Uncertainty Estimation in Large Language Models
Title（参考訳）: 大規模言語モデルにおける生成効率の良い不確かさ推定に向けて
Authors: Mingcheng Zhu, Yu Liu, Tingting Zhu,
Abstract要約: 医療やファイナンスといった高度なアプリケーションにLLMをデプロイする上で,不確実性評価が重要である。既存の手法では、不確実性を推定するために1つ以上の完全な自己回帰世代が必要である。部分生成や入力のみの情報を用いて有効不確実性を推定できるかどうかを検討する。
参考スコア（独自算出の注目度）: 6.111244917760012
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Uncertainty estimation is important for deploying LLMs in high-stakes applications such as healthcare and finance, where hallucinations can appear fluent and plausible while being factually incorrect, making it difficult for users to judge whether an output should be trusted. Existing methods require one or more full autoregressive generations to estimate uncertainty, which introduces substantial inference cost and often delays uncertainty assessment. In this paper, we investigate whether effective uncertainty estimation can be achieved with partial generation or even input-only information. Specifically, we first develop a unified framework that formulates uncertainty estimation as an early estimation problem over the autoregressive generation process of LLMs. This framework organises existing and proposed estimators by the information they observe, ranging from multi-generation to input-only prediction, and clarifies the performance-cost trade-off underlying different uncertainty estimation methods. Building on this view, we study two largely underexplored low-cost settings: estimating uncertainty with part of the generation, and predicting uncertainty from the input prompt. We propose Logit Magnitude, which uses top-M logit evidence to estimate uncertainty from an early-stopped generation prefix, and MetaUE, which distils generation-based uncertainty into a lightweight input-only estimator trained with uncertainty scores. Extensive experiments on general and domain-specific benchmarks show that Logit Magnitude achieves strong performance, and partial generations of LLMs are often sufficient for effective uncertainty estimation. MetaUE further provides a competitive input-only approximation in several settings. These findings suggest that effective uncertainty estimation requires less generation than commonly assumed, enabling unreliable responses to be identified earlier.
Abstract（参考訳）: 不確実性の推定は、医療や金融などの高度なアプリケーションにLCMを配置する上で重要である。既存の手法では、不確実性を推定するために1つ以上の完全な自己回帰世代が必要であり、これはかなりの推論コストをもたらし、しばしば不確実性評価を遅らせる。本稿では,部分生成や入出力のみの情報による有効不確実性推定が可能であるかを検討する。具体的には,LLMの自己回帰生成過程における初期推定問題として不確実性推定を定式化する統一フレームワークを開発する。このフレームワークは,複数世代から入力のみの予測まで,観測した情報に基づいて既存の推定器と提案した推定器を編成し,異なる不確実性推定手法に基づく性能・コストトレードオフを明らかにする。この観点から、我々は、主に過小評価されている2つの低コスト設定について、世代の一部を不確実性として推定し、入力プロンプトから不確実性を予測する。本稿では,最上位のロジットエビデンスを用いて早期に停止した生成プレフィックスから不確実性を推定するLogit Magnitudeと,不確実性スコアをトレーニングした軽量入力専用推定器に生成ベースの不確実性を悪用するMetaUEを提案する。一般およびドメイン固有のベンチマークに関する大規模な実験により、ロジトマグニチュードは高い性能を達成し、LLMの部分世代は効果的な不確実性推定に十分であることが示された。 MetaUEはさらに、いくつかの設定で競合する入力のみの近似を提供する。これらの結果から, 有効不確実性推定は一般的に想定されるよりも少ない生成を必要としており, 信頼できない応答を早期に特定できることが示唆された。

論文の概要: Towards Generation-Efficient Uncertainty Estimation in Large Language Models

関連論文リスト