Fugu-MT 論文翻訳(概要): Dynamic Weight Adjustment for Knowledge Distillation: Leveraging Vision Transformer for High-Accuracy Lung Cancer Detection and Real-Time Deployment

論文の概要: Dynamic Weight Adjustment for Knowledge Distillation: Leveraging Vision Transformer for High-Accuracy Lung Cancer Detection and Real-Time Deployment

arxiv url: http://arxiv.org/abs/2510.20438v1
Date: Thu, 23 Oct 2025 11:19:52 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-25 03:08:17.822361
Title: Dynamic Weight Adjustment for Knowledge Distillation: Leveraging Vision Transformer for High-Accuracy Lung Cancer Detection and Real-Time Deployment
Title（参考訳）: 知識蒸留のための動的重量調整:高精度肺癌検出とリアルタイム展開のための視覚変換器の活用
Authors: Saif Ur Rehman Khan, Muhammad Nabeel Asim, Sebastian Vollmer, Andreas Dengel,
Abstract要約: FuzzyDistillViT-MobileNetモデルは肺がん(LC)分類の新しいアプローチである。本手法は, ファジィ論理を用いて蒸留重量を動的に調整し, 生徒が高信頼領域に集中できるようにする。教師モデルとして視覚変換器(ViT-B32)を用い,学生モデルであるMobileNetに効果的に知識を伝達する。
参考スコア（独自算出の注目度）: 6.432534227472963
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: This paper presents the FuzzyDistillViT-MobileNet model, a novel approach for lung cancer (LC) classification, leveraging dynamic fuzzy logic-driven knowledge distillation (KD) to address uncertainty and complexity in disease diagnosis. Unlike traditional models that rely on static KD with fixed weights, our method dynamically adjusts the distillation weight using fuzzy logic, enabling the student model to focus on high-confidence regions while reducing attention to ambiguous areas. This dynamic adjustment improves the model ability to handle varying uncertainty levels across different regions of LC images. We employ the Vision Transformer (ViT-B32) as the instructor model, which effectively transfers knowledge to the student model, MobileNet, enhancing the student generalization capabilities. The training process is further optimized using a dynamic wait adjustment mechanism that adapts the training procedure for improved convergence and performance. To enhance image quality, we introduce pixel-level image fusion improvement techniques such as Gamma correction and Histogram Equalization. The processed images (Pix1 and Pix2) are fused using a wavelet-based fusion method to improve image resolution and feature preservation. This fusion method uses the wavedec2 function to standardize images to a 224x224 resolution, decompose them into multi-scale frequency components, and recursively average coefficients at each level for better feature representation. To address computational efficiency, Genetic Algorithm (GA) is used to select the most suitable pre-trained student model from a pool of 12 candidates, balancing model performance with computational cost. The model is evaluated on two datasets, including LC25000 histopathological images (99.16% accuracy) and IQOTH/NCCD CT-scan images (99.54% accuracy), demonstrating robustness across different imaging domains.
Abstract（参考訳）: 本稿では,肺がんの新しい分類法であるFuzzyDistillViT-MobileNetモデルを提案する。固定重み付き静的KDに依存する従来のモデルとは異なり,本手法はファジィ論理を用いて蒸留重量を動的に調整し,不明瞭な領域への注意を減らしながら高信頼領域に集中できるようにする。この動的調整により、LC画像の異なる領域にわたる様々な不確実性レベルを扱うモデル能力が改善される。教師モデルとして視覚変換器(ViT-B32)を用い,学生モデルであるMobileNetに効果的に知識を伝達し,生徒の一般化能力を向上させる。さらに、トレーニング手順を適応させて収束と性能を向上させる動的待ち調整機構を用いて、トレーニングプロセスをさらに最適化する。画像品質を向上させるため,ガンマ補正やヒストグラム等化などの画素レベルの画像融合改善技術を導入する。処理された画像(Pix1、Pix2)はウェーブレットベースの融合法で融合し、画像解像度と特徴保存を改善する。この融合法では、Wavedec2関数を用いて画像を224x224の解像度に標準化し、それらをマルチスケールの周波数成分に分解し、各レベルで再帰的に平均係数を算出して特徴表現を改善する。遺伝的アルゴリズム(GA)は、12の候補のプールから最も適した事前学習された学生モデルを選択するために用いられ、モデル性能と計算コストのバランスをとる。このモデルは、LC25000の病理像(99.16%の精度)とIQOTH/NCCD CTスキャン画像(99.54%の精度)を含む2つのデータセットで評価され、異なる画像領域にわたって堅牢性を示す。

論文の概要: Dynamic Weight Adjustment for Knowledge Distillation: Leveraging Vision Transformer for High-Accuracy Lung Cancer Detection and Real-Time Deployment

関連論文リスト