Fugu-MT 論文翻訳(概要): Box-Constrained Softmax Function and Its Application for Post-Hoc Calibration

論文の概要: Box-Constrained Softmax Function and Its Application for Post-Hoc Calibration

arxiv url: http://arxiv.org/abs/2506.10572v1
Date: Thu, 12 Jun 2025 11:01:43 GMT
ステータス: 翻訳完了
システム内更新日: 2025-06-13 15:37:22.691649
Title: Box-Constrained Softmax Function and Its Application for Post-Hoc Calibration
Title（参考訳）: Box-Constrained Softmax関数とポストホック校正への応用
Authors: Kyohei Atarashi, Satoshi Oyama, Hiromi Arai, Hisashi Kashima,
Abstract要約: ソフトマックスモデルにおける出力確率の制御は、現代の機械学習において一般的な問題である。ボックス制約付きソフトマックス(mathrmBCSoftmax$)関数を提案する。これは$mathrmSoftmax$関数の新たな一般化であり、出力確率に対して下限と上限を明示的に強制する。
参考スコア（独自算出の注目度）: 29.222132137091368
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Controlling the output probabilities of softmax-based models is a common problem in modern machine learning. Although the $\mathrm{Softmax}$ function provides soft control via its temperature parameter, it lacks the ability to enforce hard constraints, such as box constraints, on output probabilities, which can be critical in certain applications requiring reliable and trustworthy models. In this work, we propose the box-constrained softmax ($\mathrm{BCSoftmax}$) function, a novel generalization of the $\mathrm{Softmax}$ function that explicitly enforces lower and upper bounds on output probabilities. While $\mathrm{BCSoftmax}$ is formulated as the solution to a box-constrained optimization problem, we develop an exact and efficient computation algorithm for $\mathrm{BCSoftmax}$. As a key application, we introduce two post-hoc calibration methods based on $\mathrm{BCSoftmax}$. The proposed methods mitigate underconfidence and overconfidence in predictive models by learning the lower and upper bounds of the output probabilities or logits after model training, thereby enhancing reliability in downstream decision-making tasks. We demonstrate the effectiveness of our methods experimentally using the TinyImageNet, CIFAR-100, and 20NewsGroups datasets, achieving improvements in calibration metrics.
Abstract（参考訳）: ソフトマックスモデルにおける出力確率の制御は、現代の機械学習において一般的な問題である。 $\mathrm{Softmax}$関数は温度パラメータによるソフトコントロールを提供するが、信頼性と信頼性のあるモデルを必要とする特定のアプリケーションにおいて重要な出力確率に対するボックス制約のような厳密な制約を強制する能力に欠ける。本研究では,出力確率に対する下限と上限を明確に強制する$\mathrm{BCSoftmax}$関数の新たな一般化であるボックス制約ソフトマックス(\mathrm{BCSoftmax}$)関数を提案する。ボックス制約付き最適化問題の解法として$\mathrm{BCSoftmax}$が定式化されているのに対し、$\mathrm{BCSoftmax}$に対する正確かつ効率的な計算アルゴリズムを開発する。鍵となる応用として、$\mathrm{BCSoftmax}$ に基づいた2つのポストホック校正法を導入する。提案手法は, モデル学習後の出力確率やロジットの上下境界を学習することにより, 予測モデルにおける不信感や過信感を軽減し, 下流意思決定タスクの信頼性を高める。我々はTinyImageNet, CIFAR-100, 20NewsGroupsデータセットを用いて, キャリブレーション精度の向上を図った。

論文の概要: Box-Constrained Softmax Function and Its Application for Post-Hoc Calibration

関連論文リスト