Fugu-MT 論文翻訳(概要): Uncertainty-Aware Transformers: Conformal Prediction for Language Models

論文の概要: Uncertainty-Aware Transformers: Conformal Prediction for Language Models

arxiv url: http://arxiv.org/abs/2604.08885v1
Date: Fri, 10 Apr 2026 02:48:50 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-13 17:57:53.648649
Title: Uncertainty-Aware Transformers: Conformal Prediction for Language Models
Title（参考訳）: 不確かさを意識した変圧器:言語モデルの等角予測
Authors: Abhiram Vellore, Niraj K. Jha,
Abstract要約: 本稿ではトランスフォーマーに基づく言語モデルのための不確実な定量化フレームワークを提案する。 CONFIDEは、BERTやRoBERTaのようなエンコーダのみのアーキテクチャの内部埋め込みに対して、整合予測を適用する。初期および中期のトランスフォーマー層は、しばしば共形予測のためのより良いキャリブレーションと意味論的意味のある表現をもたらす。
参考スコア（独自算出の注目度）: 4.464939140209426
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Transformers have had a profound impact on the field of artificial intelligence, especially on large language models and their variants. However, as was the case with neural networks, their black-box nature limits trust and deployment in high-stakes settings. For models to be genuinely useful and trustworthy in critical applications, they must provide more than just predictions: they must supply users with a clear understanding of the reasoning that underpins their decisions. This article presents an uncertainty quantification framework for transformer-based language models. This framework, called CONFIDE (CONformal prediction for FIne-tuned DEep language models), applies conformal prediction to the internal embeddings of encoder-only architectures, like BERT and RoBERTa, while enabling hyperparameter tuning. CONFIDE uses either [CLS] token embeddings or flattened hidden states to construct class-conditional nonconformity scores, enabling statistically valid prediction sets with instance-level explanations. Empirically, CONFIDE improves test accuracy by up to 4.09% on BERT-tiny and achieves greater correct efficiency (i.e., the expected size of the prediction set conditioned on it containing the true label) compared to prior methods, including NM2 and VanillaNN. We show that early and intermediate transformer layers often yield better-calibrated and more semantically meaningful representations for conformal prediction. In resource-constrained models and high-stakes tasks with ambiguous labels, CONFIDE offers robustness and interpretability where softmax-based uncertainty fails. We position CONFIDE as a framework for practical diagnostic and efficiency/robustness improvement over prior conformal baselines.
Abstract（参考訳）: トランスフォーマーは人工知能の分野、特に大きな言語モデルとその変種に大きな影響を与えた。しかしながら、ニューラルネットワークの場合と同様に、ブラックボックスの性質は信頼と高レベルな設定でのデプロイメントを制限する。重要なアプリケーションで真に有用で信頼できるモデルには、単なる予測以上のものを提供することが必要だ。本稿ではトランスフォーマーに基づく言語モデルのための不確実な定量化フレームワークを提案する。 CONFIDE(conformal prediction for FIne-tuned DEep language model)と呼ばれるこのフレームワークは、ハイパーパラメータチューニングを実現しつつ、BERTやRoBERTaのようなエンコーダのみのアーキテクチャの内部埋め込みに対して、コンフォーマルな予測を適用している。 CONFIDEは[CLS]トークンの埋め込みまたはフラット化された隠された状態を使用して、クラス条件の非整合性スコアを構築し、インスタンスレベルの説明を伴う統計的に有効な予測セットを可能にします。実証的に、CONFIDEはBERT-tinyの試験精度を最大4.09%向上させ、NM2やVanillaNNを含む従来の方法と比較して高い精度(すなわち、真のラベルを含む予測セットの期待サイズ)を達成する。初期および中期の変圧器層は、共形予測のためのより良い校正と意味論的意味のある表現をしばしば得ることを示す。資源制約付きモデルと曖昧なラベル付きハイテイクタスクにおいて、CONFIDEはソフトマックスベースの不確実性が失敗する堅牢性と解釈性を提供する。本研究では,ConFIDEを従来の整合性ベースラインよりも実用的な診断・効率・ロバスト性向上のためのフレームワークとして位置づける。

論文の概要: Uncertainty-Aware Transformers: Conformal Prediction for Language Models

関連論文リスト