Fugu-MT 論文翻訳(概要): Calibrating LLM Judges: Linear Probes for Fast and Reliable Uncertainty Estimation

論文の概要: Calibrating LLM Judges: Linear Probes for Fast and Reliable Uncertainty Estimation

arxiv url: http://arxiv.org/abs/2512.22245v1
Date: Tue, 23 Dec 2025 22:08:46 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-30 22:37:29.928329
Title: Calibrating LLM Judges: Linear Probes for Fast and Reliable Uncertainty Estimation
Title（参考訳）: LLM判定の校正:高速かつ信頼性の高い不確実性推定のための線形プローブ
Authors: Bhaktipriya Radharapu, Eshika Saxena, Kenneth Li, Chenxi Whitehouse, Adina Williams, Nicola Cancedda,
Abstract要約: 本稿では,Brierスコアに基づく損失をトレーニングした線形プローブを導入し,審査員の隠蔽状態から不確実性を校正した推定値を提供する。我々は,目的的タスク(推論,数学,事実性,コーディング)と主観的人間の選好判断の両方に対するアプローチを評価する。
参考スコア（独自算出の注目度）: 25.80946316489521
License: http://creativecommons.org/licenses/by/4.0/
Abstract: As LLM-based judges become integral to industry applications, obtaining well-calibrated uncertainty estimates efficiently has become critical for production deployment. However, existing techniques, such as verbalized confidence and multi-generation methods, are often either poorly calibrated or computationally expensive. We introduce linear probes trained with a Brier score-based loss to provide calibrated uncertainty estimates from reasoning judges' hidden states, requiring no additional model training. We evaluate our approach on both objective tasks (reasoning, mathematics, factuality, coding) and subjective human preference judgments. Our results demonstrate that probes achieve superior calibration compared to existing methods with $\approx10$x computational savings, generalize robustly to unseen evaluation domains, and deliver higher accuracy on high-confidence predictions. However, probes produce conservative estimates that underperform on easier datasets but may benefit safety-critical deployments prioritizing low false-positive rates. Overall, our work demonstrates that interpretability-based uncertainty estimation provides a practical and scalable plug-and-play solution for LLM judges in production.
Abstract（参考訳）: LLMに基づく判断が産業アプリケーションにとって不可欠なものとなるにつれて、適切に校正された不確実性推定を効率的に得ることが、生産展開にとって重要になっている。しかし、言語化された信頼やマルチジェネレーション手法のような既存の手法は、しばしば校正が不十分であるか計算的に高価である。本稿では,Brierスコアに基づく損失をトレーニングした線形プローブを導入し,審査員の隠れ状態からキャリブレーションされた不確実性を推定し,追加のモデルトレーニングを不要とした。我々は,目的的タスク(推論,数学,事実性,コーディング)と主観的人間の選好判断の両方に対するアプローチを評価する。以上の結果から,従来の$\approx10$x計算法に比べ,プローブのキャリブレーションが優れていること,未確認評価領域に頑健に一般化できること,高信頼度予測における精度の向上が示唆された。しかし、調査員は、より簡単なデータセットでは性能が低いが、偽陽性率の低いデプロイを優先する安全クリティカルなデプロイメントの恩恵を受ける可能性がある、保守的な見積もりを生成する。全体として、本研究は、解釈可能性に基づく不確実性推定が、実用的でスケーラブルなLLM判定用プラグアンドプレイソリューションを提供することを示した。

論文の概要: Calibrating LLM Judges: Linear Probes for Fast and Reliable Uncertainty Estimation

関連論文リスト