Fugu-MT 論文翻訳(概要): Label-Free Detection of Governance Evidence Degradation in Risk Decision Systems

論文の概要: Label-Free Detection of Governance Evidence Degradation in Risk Decision Systems

arxiv url: http://arxiv.org/abs/2604.17836v1
Date: Mon, 20 Apr 2026 05:46:15 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-21 21:52:52.714655
Title: Label-Free Detection of Governance Evidence Degradation in Risk Decision Systems
Title（参考訳）: リスク決定システムにおけるラベルフリーによるガバナンス証拠の劣化検出
Authors: Oleg Solozobov,
Abstract要約: 不正検出・信用スコアリングにおけるリスク決定システムは、構造ラベルが存在しない状態で運用される。既存のフレームワークは、ドリフト検出とガバナンスエビデンス評価と運用対応を統合していません。本稿では,ガバナンスドリフトツールキットのラベルフリーガバナンス監視拡張について述べる。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Risk decision systems in fraud detection and credit scoring operate under structural label absence: ground truth arrives weeks to months after decisions are made. During this blind period, model performance may degrade silently, eroding the governance evidence that justifies automated decisions. Existing drift detection methods either require labels (supervised detectors) or detect statistical change without distinguishing harmful degradation from benign distributional evolution (unsupervised detectors). No existing framework integrates drift detection with governance evidence assessment and operational response. This paper presents a label-free governance monitoring extension to the Governance Drift Toolkit that produces governance alerts rather than statistical alarms. The monitoring architecture applies composite multi-proxy monitoring across four proxy monitors (score distribution, feature drift, prediction entropy, confidence distribution), with governance-calibrated thresholds. Empirical evaluation on the Lending Club credit scoring dataset (1.37M loans, 11 years) demonstrates three findings. First, raw proxy metrics (Feature PSI delta up to 1.84, Score PSI delta up to 0.92) distinguish injected covariate degradation from natural temporal drift in an offline evaluation setting. Second, pure concept drift in P(Y|X) produces exactly zero delta across all proxy metrics in all windows, confirming the irreducible blind spot of label-free monitoring as a structural verification. Third, the composite score provides monotonic severity progression as more monitors trigger (0.583 to 0.833 to 1.000), enabling graduated governance response. Cross-domain comparison with IEEE-CIS fraud detection results shows the detectable/undetectable boundary is consistent across both domains. The toolkit and evaluation code are available as open-source artifacts.
Abstract（参考訳）: 不正検出および信用スコアリングにおけるリスク決定システムは、構造的ラベルの欠如の下で運用される。この盲目な期間に、モデルパフォーマンスは静かに低下し、自動決定を正当化するガバナンスエビデンスを侵食する可能性がある。既存のドリフト検出方法はラベル(教師なし検出器)を必要とするか、良性分布進化(教師なし検出器)からの有害な劣化を区別することなく統計的変化を検出する。既存のフレームワークは、ドリフト検出とガバナンスエビデンス評価と運用対応を統合していません。本稿では,統計アラームではなく,ガバナンスアラートを生成するガバナンスドリフトツールキットのラベルフリーガバナンス監視拡張について述べる。監視アーキテクチャは、4つのプロキシモニタ(スコア分布、フィーチャードリフト、予測エントロピー、信頼性分布)に、ガバナンスの基準付きで複合的なマルチプロキシ監視を適用する。 Lending Clubクレジットスコアリングデータセット(融資137万、11年)に関する実証的な評価は、3つの結果を示している。まず、生のプロキシメトリクス(PSIデルタが1.84まで、Score PSIデルタが0.92まで)は、オフライン評価設定で天然の時間的ドリフトから注入された共変量劣化を区別する。第2に、P(Y|X) における純粋な概念ドリフトは、すべてのウィンドウにおける全てのプロキシメトリクスに対して正確にゼロデルタを生成し、構造的検証としてラベルなし監視の既約の盲点を確認する。第3に、複合スコアは、より多くのモニタトリガー(0.583から0.833から1.000まで)をトリガーとして、モノトニックな重症度を進行させ、段階的なガバナンス応答を可能にする。 IEEE-CIS不正検出結果とのクロスドメイン比較は、検出可能/検出不能境界が両領域間で一致していることを示している。ツールキットと評価コードはオープンソースアーティファクトとして利用可能である。

論文の概要: Label-Free Detection of Governance Evidence Degradation in Risk Decision Systems

関連論文リスト