Fugu-MT 論文翻訳(概要): Knowing When Not to Predict: Self Supervised Learning and Abstention for Safer DR Screening

論文の概要: Knowing When Not to Predict: Self Supervised Learning and Abstention for Safer DR Screening

arxiv url: http://arxiv.org/abs/2605.19133v1
Date: Mon, 18 May 2026 21:32:03 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-20 15:03:08.999638
Title: Knowing When Not to Predict: Self Supervised Learning and Abstention for Safer DR Screening
Title（参考訳）: 予測すべきでないことを知る: 自己指導型学習とサファーDRスクリーニングの欠如
Authors: Muskaan Chopra, Lorenz Sparrenberg, Jan H. Terheyden, Rafet Sifa,
Abstract要約: 自己教師付き学習(SSL)は、現在、医療画像モデルを事前訓練する標準的な方法である。我々は,SSL事前トレーニングの長さが,信頼性と信頼性に基づく棄権の校正にどのように影響するかを検討する。
参考スコア（独自算出の注目度）: 1.3288901827225499
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Self-supervised learning (SSL) is now a standard way to pretrain medical image models, but performance is still mostly judged by downstream accuracy. For safety-critical screening tasks such as diabetic retinopathy grading, this is not enough: a model must also know when its predictions are unreliable and defer uncertain cases for clinical review. In this work, we examine how the length of SSL pretraining influences calibrated confidence and confidence-based abstention. We evaluate multiple SSL checkpoints under a fixed fine-tuning protocol and assess calibrated confidence, coverage, selective accuracy, and selective macro-F1. Across datasets and data regimes, SSL pretraining improves selective prediction compared to training from scratch. Unlike prior SSL studies that primarily evaluate downstream accuracy or AUROC, we analyze how SSL pretraining duration influences confidence behavior under calibrated confidence-based abstention. However, once accuracy saturates, selective performance can still change markedly across checkpoints, and longer pretraining does not consistently improve reliability. These results underscore the importance of abstention-aware evaluation and suggest that pretraining length should be treated as an important reliability-related design choice rather than only a computational detail. Code is available at GitHub.
Abstract（参考訳）: 自己教師付き学習(SSL)は現在、医療画像モデルを事前訓練する標準的な方法であるが、パフォーマンスは依然として下流の精度で判断されている。糖尿病性網膜症のグレーディングのような安全クリティカルなスクリーニングタスクでは、これは十分ではない: モデルは、その予測が信頼できないことを認識し、臨床検査の不確実なケースを遅らせる必要がある。本研究では,SSL事前トレーニングの長さが信頼度と信頼度に基づく棄権にどのように影響するかを検討する。固定された微調整プロトコルの下で複数のSSLチェックポイントを評価し、キャリブレーションされた信頼度、カバレッジ、選択精度、選択マクロF1を評価する。データセットとデータ構造全体で、SSL事前トレーニングは、スクラッチからのトレーニングよりも選択的な予測を改善する。下流の精度やAUROCを主に評価する以前のSSL研究とは異なり、SSL事前学習期間が信頼度に基づく禁忌条件下での信頼行動にどのように影響するかを分析する。しかし、精度が飽和すると、チェックポイント間で選択的なパフォーマンスが著しく変化し、事前トレーニングが長くても信頼性は向上しない。これらの結果は, 留意点を考慮した評価の重要性を裏付けるものであり, 事前学習長は, 計算の詳細だけでなく, 重要な信頼性に関する設計選択として扱うべきであることを示唆している。コードはGitHubで入手できる。

論文の概要: Knowing When Not to Predict: Self Supervised Learning and Abstention for Safer DR Screening

関連論文リスト