Fugu-MT 論文翻訳(概要): Auditability and the Landscape of Distance to Multicalibration

論文の概要: Auditability and the Landscape of Distance to Multicalibration

arxiv url: http://arxiv.org/abs/2509.16930v1
Date: Sun, 21 Sep 2025 05:26:48 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-23 18:58:16.039974
Title: Auditability and the Landscape of Distance to Multicalibration
Title（参考訳）: マルチキャリブレーションにおける距離の可聴性とランドスケープ
Authors: Nathan Derhake, Siddartha Devic, Dutch Hansen, Kuan Liu, Vatsal Sharan,
Abstract要約: 予測器の多重校正までの距離の監査性は$f$である。本研究は,より強力なマルチキャリブレーションアルゴリズムの開発に影響を及ぼす可能性がある。
参考スコア（独自算出の注目度）: 14.895808794051568
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Calibration is a critical property for establishing the trustworthiness of predictors that provide uncertainty estimates. Multicalibration is a strengthening of calibration which requires that predictors be calibrated on a potentially overlapping collection of subsets of the domain. As multicalibration grows in popularity with practitioners, an essential question is: how do we measure how multicalibrated a predictor is? B{\l}asiok et al. (2023) considered this question for standard calibration by introducing the distance to calibration framework (dCE) to understand how calibration metrics relate to each other and the ground truth. Building on the dCE framework, we consider the auditability of the distance to multicalibration of a predictor $f$. We begin by considering two natural generalizations of dCE to multiple subgroups: worst group dCE (wdMC), and distance to multicalibration (dMC). We argue that there are two essential properties of any multicalibration error metric: 1) the metric should capture how much $f$ would need to be modified in order to be perfectly multicalibrated; and 2) the metric should be auditable in an information theoretic sense. We show that wdMC and dMC each fail to satisfy one of these two properties, and that similar barriers arise when considering the auditability of general distance to multigroup fairness notions. We then propose two (equivalent) multicalibration metrics which do satisfy these requirements: 1) a continuized variant of dMC; and 2) a distance to intersection multicalibration, which leans on intersectional fairness desiderata. Along the way, we shed light on the loss-landscape of distance to multicalibration and the geometry of the set of perfectly multicalibrated predictors. Our findings may have implications for the development of stronger multicalibration algorithms as well as multigroup auditing more generally.
Abstract（参考訳）: キャリブレーションは、不確実性推定を提供する予測者の信頼性を確立するための重要な特性である。マルチキャリブレーション(Multicalibration)は、ドメインの重複するサブセットの集合に対して、予測器をキャリブレーションする必要があるキャリブレーションの強化である。マルチキャリブレーション(multicalibration)が実践者の間で人気を増す中、重要な疑問は、どのように予測器がマルチキャリブレーションされているかを測定するかである。 B{\l}asiok et al (2023) は、キャリブレーションの指標が互いにどのように関係しているかを理解するために、キャリブレーションフレームワーク (dCE) に距離を導入することで、標準キャリブレーションの問題を検討した。 dCEフレームワークに基づいて、予測器の多重校正までの距離の監査可能性を考える。まず、最低群 dCE (wdMC) と多重校正距離 (dMC) の2つの自然一般化を考える。我々は、任意の多重校正誤差計量には2つの重要な性質があると主張する。 1) メトリックは、完全にマルチキャリブレーションされるために、どれだけの$f$を変更する必要があるかをキャプチャする必要がある。 2) 計量は情報理論的な意味で監査可能であるべきである。我々は、wdMC と dMC がこれら2つの特性の1つを満たすことができず、また、多群フェアネスの概念に対する一般的な距離の監査可能性を考えると、同様の障壁が生じることを示した。次に、これらの要件を満たす2つの(等価な)多重校正指標を提案する。 1) 連続したdMCの変種,及び 2) 交叉の多校正距離は交叉のフェアネス・デシダラタに依存する。その過程で、我々は、多重校正への距離の損失ランドスケープと、完全に多重校正された予測器の集合の幾何学に光を当てた。本研究は,より強力なマルチキャリブレーションアルゴリズムの開発や,より一般的なマルチグループ監査に影響を及ぼす可能性がある。

論文の概要: Auditability and the Landscape of Distance to Multicalibration

関連論文リスト