Fugu-MT 論文翻訳(概要): GrACE: A Generative Approach to Better Confidence Elicitation in Large Language Models

論文の概要: GrACE: A Generative Approach to Better Confidence Elicitation in Large Language Models

arxiv url: http://arxiv.org/abs/2509.09438v1
Date: Thu, 11 Sep 2025 13:25:40 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-12 16:52:24.396594
Title: GrACE: A Generative Approach to Better Confidence Elicitation in Large Language Models
Title（参考訳）: GrACE: 大規模言語モデルにおける信頼性向上のためのジェネレーティブアプローチ
Authors: Zhaohan Zhang, Ziquan Liu, Ioannis Patras,
Abstract要約: 信頼の喚起は、医療やファイナンスといった高度なアプリケーションにおいて、AIの安全性に対する顕著なアプローチである。既存の手法では、高価な計算オーバーヘッドを必要とするか、キャリブレーションの悪さに悩まされ、現実の展開には実用的でなく信頼性が低い。我々は,LLMのスケーラブルで信頼性の高い信頼性推論を可能にする,信頼緩和のためのジェネレーティブアプローチであるGrACEを提案する。
参考スコア（独自算出の注目度）: 37.3708358049548
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Assessing the reliability of Large Language Models (LLMs) by confidence elicitation is a prominent approach to AI safety in high-stakes applications, such as healthcare and finance. Existing methods either require expensive computational overhead or suffer from poor calibration, making them impractical and unreliable for real-world deployment. In this work, we propose GrACE, a Generative Approach to Confidence Elicitation that enables scalable and reliable confidence elicitation for LLMs. GrACE adopts a novel mechanism in which the model expresses confidence by the similarity between the last hidden state and the embedding of a special token appended to the vocabulary, in real-time. We fine-tune the model for calibrating the confidence with calibration targets associated with accuracy. Experiments with three LLMs and two benchmark datasets show that the confidence produced by GrACE achieves the best discriminative capacity and calibration on open-ended generation tasks, outperforming six competing methods without resorting to additional sampling or an auxiliary model. Moreover, we propose two strategies for improving test-time scaling based on confidence induced by GrACE. Experimental results show that using GrACE not only improves the accuracy of the final decision but also significantly reduces the number of required samples in the test-time scaling scheme, indicating the potential of GrACE as a practical solution for deploying LLMs with scalable, reliable, and real-time confidence estimation.
Abstract（参考訳）: 信頼誘導によるLLM(Large Language Models)の信頼性を評価することは、医療やファイナンスといった高度なアプリケーションにおいて、AI安全性に対する顕著なアプローチである。既存の手法では、高価な計算オーバーヘッドを必要とするか、キャリブレーションの悪さに悩まされ、現実の展開には実用的でなく信頼性が低い。本研究は,LLMのスケーラブルで信頼性の高い信頼性推論を実現するための,信頼緩和のためのジェネレーティブアプローチであるGrACEを提案する。 GrACEは、最後に隠された状態と語彙に付加された特別なトークンの埋め込みとの類似性によってモデルが信頼を表現する新しいメカニズムをリアルタイムで採用している。精度に付随する校正目標を用いて信頼性を校正するモデルを微調整する。 3つのLCMと2つのベンチマークデータセットを用いて実験したところ、GrACEが生成した信頼度は、追加サンプリングや補助モデルに頼らずに6つの競合する手法よりも優れた識別能力とキャリブレーションを達成することが示された。さらに,GrACEによる信頼度に基づくテスト時間スケーリングを改善するための2つの手法を提案する。実験結果から, GrACEは最終決定の精度を向上するだけでなく, テスト時間スケーリング方式で必要なサンプル数を大幅に削減し, 拡張性, 信頼性, リアルタイム信頼度を考慮したLCMの実用的ソリューションとしてのGrACEの可能性を示した。

論文の概要: GrACE: A Generative Approach to Better Confidence Elicitation in Large Language Models

関連論文リスト