Fugu-MT 論文翻訳(概要): AgriKD: Cross-Architecture Knowledge Distillation for Efficient Leaf Disease Classification

論文の概要: AgriKD: Cross-Architecture Knowledge Distillation for Efficient Leaf Disease Classification

arxiv url: http://arxiv.org/abs/2605.01355v1
Date: Sat, 02 May 2026 09:58:57 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-05 20:33:49.723054
Title: AgriKD: Cross-Architecture Knowledge Distillation for Efficient Leaf Disease Classification
Title（参考訳）: AgriKD:効率的な葉疾患分類のためのクロスアーキテクチャ知識蒸留
Authors: Minh-Dung Le, Minh-Duc Hoang, Hoang-Vu Truong, Thi-Thu-Hong Phan,
Abstract要約: AgriKDは効率的なエッジデプロイメントのためのクロスアーキテクチャ知識蒸留フレームワークである。ビジョントランスフォーマー(ViT)の教師から、コンパクトな畳み込み学生モデルに知識を移す。それは無視できる精度で一貫した予測性能を達成する。
参考スコア（独自算出の注目度）: 0.05599792629509228
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Automated leaf disease classification is critical for early disease detection in resource-constrained field environments. Vision Transformers (ViTs) provide strong representation capability by modeling long-range dependencies and inter-class relationships; however, their high computational cost makes them impractical for deployment on edge devices. As a result, existing approaches struggle to effectively transfer these rich representations to lightweight models. This paper introduces AgriKD, a cross-architecture knowledge distillation framework for efficient edge deployment, which transfers knowledge from a Vision Transformer (ViT) teacher to a compact convolutional student model. To bridge the representational gap between Transformer and CNN architectures, the proposed approach integrates multiple distillation objectives at the output, feature, and relational levels, where each objective captures a different aspect of the teacher knowledge. This enables the student model to better preserve and utilize transformer-derived global representations. Experiments on multiple leaf disease datasets show that the distilled student achieves performance comparable to the teacher while significantly improving efficiency, reducing model parameters by approximately 172 times, computational cost by 47.57 times, and inference latency by 18-22 times. Furthermore, the optimized model is deployed across multiple runtime formats, including ONNX, TFLite Float16, and TensorRT FP16, achieving consistent predictive performance with negligible accuracy degradation. Real-world deployment on NVIDIA Jetson edge devices and a mobile application demonstrates reliable real-time inference, highlighting the practicality of AgriKD for AI-powered agricultural applications in resource-constrained environments.
Abstract（参考訳）: 自動葉病分類は, 資源制約環境下での早期の病原体検出に重要である。視覚変換器(ViT)は、長距離依存とクラス間関係をモデル化することによって、強力な表現能力を提供するが、その高い計算コストは、エッジデバイスへのデプロイにおいて実用的ではない。結果として、既存のアプローチは、これらのリッチな表現を軽量モデルに効果的に転送するのに苦労している。本稿では,視覚変換器(ViT)の教師からコンパクトな畳み込み学習者モデルに知識を伝達する,効率的なエッジ展開のためのクロスアーキテクチャ知識蒸留フレームワークであるAgriKDを紹介する。トランスフォーマーとCNNアーキテクチャ間の表現的ギャップを埋めるために,提案手法は,教師の知識の異なる側面を捉えた出力,特徴,関係レベルにおいて,複数の蒸留目標を統合する。これにより、学生モデルはトランスフォーマーから派生したグローバル表現をよりよく保存し、活用することができる。複数の葉病データセットの実験により、蒸留した学生は教師に匹敵する性能を達成し、効率を大幅に向上し、モデルパラメータを約172倍、計算コストを47.57倍、推論遅延を18-22倍に削減した。さらに、最適化されたモデルは、ONNX、TFLite Float16、TensorRT FP16を含む複数のランタイムフォーマットにデプロイされ、無視できる精度の劣化で一貫した予測性能を達成する。 NVIDIA Jetsonエッジデバイスとモバイルアプリケーションの実世界展開は、リソース制約のある環境におけるAIによる農業アプリケーションのためのAgriKDの実用性を強調し、信頼性の高いリアルタイム推論を示す。

論文の概要: AgriKD: Cross-Architecture Knowledge Distillation for Efficient Leaf Disease Classification

関連論文リスト