Fugu-MT 論文翻訳(概要): T-DuMpRa: Teacher-guided Dual-path Multi-prototype Retrieval Augmented framework for fine-grained medical image classification

論文の概要: T-DuMpRa: Teacher-guided Dual-path Multi-prototype Retrieval Augmented framework for fine-grained medical image classification

arxiv url: http://arxiv.org/abs/2604.17360v1
Date: Sun, 19 Apr 2026 10:13:46 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-21 21:52:52.487044
Title: T-DuMpRa: Teacher-guided Dual-path Multi-prototype Retrieval Augmented framework for fine-grained medical image classification
Title（参考訳）: T-DuMpRa:きめ細かな医用画像分類のための教師誘導デュアルパス多目的検索フレームワーク
Authors: Zixuan Tang, Shen Zhao,
Abstract要約: T-DuMpRaは教師向けデュアルパスマルチプロトタイプ検索拡張フレームワークである。本稿では,T-DuMpRaを提案する。本法では5種類の背骨に対して0.68%-0.21%,0.44%-2.69%の改善が得られた。
参考スコア（独自算出の注目度）: 9.53765398422759
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Fine-grained medical image classification is challenged by subtle inter-class variations and visually ambiguous cases, where confidence estimates often exhibit uncertainty rather than being overconfident. In such scenarios, purely discriminative classifiers may achieve high overall accuracy yet still fail to distinguish between highly similar categories, leading to miscalibrated predictions. We propose T-DuMpRa, a teacher-guided dual-path multi-prototype retrieval-augmented framework, where discriminative classification and multi-prototype retrieval jointly drive both training and prediction. During training, we jointly optimize cross-entropy and supervised contrastive objectives to learn a cosine-compatible embedding geometry for reliable prototype matching. We further employ an exponential moving average (EMA) teacher to obtain smoother representations and build a multi-prototype memory bank by clustering teacher embeddings in the teacher embedding space. Our framework is plug-and-play: it can be easily integrated into existing classification models by constructing a compact prototype bank, thereby improving performance on visually ambiguous cases. At inference, we combine the classifier's predicted distribution with a similarity-based distribution computed via cosine matching to prototypes, and apply a conservative confidence-gated fusion that activates retrieval only when the classifier's prediction is uncertain and the retrieval evidence is decisive and conflicting, otherwise keeping confident predictions unchanged. On HAM10000 and ISIC2019, our method yields 0.68%-0.21% and 0.44%-2.69% improvements on 5 different backbones. And visualization analysis proves our model can enhance the model's ability to handle visually ambiguous cases.
Abstract（参考訳）: 微細な医用画像分類は、微妙なクラス間変異と視覚的不明瞭なケースによって挑戦され、信頼度推定は過度に信頼されるよりも不確実性を示すことが多い。このようなシナリオでは、純粋に識別的分類器は高い全体的な精度を達成できるが、非常に類似したカテゴリの区別に失敗し、誤った予測をもたらす。 T-DuMpRa,T-DuMpRa,T-DuMpRa,T-DuMpRa,T-DuMpRa,T-DuMpRa,T-DuMpRa,T-DuMpRa,T-DuMpRa。トレーニング中、我々はクロスエントロピーと教師付きコントラスト目標を共同で最適化し、信頼できるプロトタイプマッチングのためのコサイン互換な埋め込み幾何学を学習する。さらに、指数移動平均(EMA)教師を用いて、スムーズな表現を得るとともに、教師埋め込み空間に教師埋め込みをクラスタリングすることで、マルチプロトタイプメモリバンクを構築する。我々のフレームワークはプラグアンドプレイであり、コンパクトなプロトタイプバンクを構築することで既存の分類モデルに容易に統合することができ、視覚的にあいまいなケースの性能を向上させることができる。推定では,分類器の予測分布と,原型と一致するコサインを用いて計算した類似性に基づく分布とを組み合わせ,分類器の予測が不確実であり,検証証拠が決定的かつ矛盾している場合にのみ,検索を活性化する保守的信頼制御融合を適用する。 HAM10000とISIC2019では、5つの異なるバックボーンに対して0.68%-0.21%と0.44%-2.69%の改善が得られた。そして、可視化分析により、我々のモデルは、視覚的に曖昧なケースを扱うモデルの能力を高めることができることを示す。

論文の概要: T-DuMpRa: Teacher-guided Dual-path Multi-prototype Retrieval Augmented framework for fine-grained medical image classification

関連論文リスト