Fugu-MT 論文翻訳(概要): Agentic Knowledge Distillation: Autonomous Training of Small Language Models for SMS Threat Detection

論文の概要: Agentic Knowledge Distillation: Autonomous Training of Small Language Models for SMS Threat Detection

arxiv url: http://arxiv.org/abs/2602.10869v1
Date: Wed, 11 Feb 2026 13:57:56 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-12 21:44:01.957276
Title: Agentic Knowledge Distillation: Autonomous Training of Small Language Models for SMS Threat Detection
Title（参考訳）: エージェント知識蒸留:SMS脅威検出のための小言語モデルの自律的訓練
Authors: Adel ElZemity, Joshua Sylvester, Budi Arief, Rogério De Lemos,
Abstract要約: エージェント知識蒸留は、人間の介入なしにセキュリティタスクにデプロイ可能な、より小さな学生SLMを微調整する強力なLLMで構成されている。 SMSスパム・スマイッシング検出における教師の役割における4つのLLMと2人の学生のSLMを比較した。以上の結果から,教師のLLMによって性能が大きく異なり,94.31%の精度と96.25%のリコールが達成された。
参考スコア（独自算出の注目度）: 0.4899818550820574
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: SMS-based phishing (smishing) attacks have surged, yet training effective on-device detectors requires labelled threat data that quickly becomes outdated. To deal with this issue, we present Agentic Knowledge Distillation, which consists of a powerful LLM acts as an autonomous teacher that fine-tunes a smaller student SLM, deployable for security tasks without human intervention. The teacher LLM autonomously generates synthetic data and iteratively refines a smaller on-device student model until performance plateaus. We compare four LLMs in this teacher role (Claude Opus 4.5, GPT 5.2 Codex, Gemini 3 Pro, and DeepSeek V3.2) on SMS spam/smishing detection with two student SLMs (Qwen2.5-0.5B and SmolLM2-135M). Our results show that performance varies substantially depending on the teacher LLM, with the best configuration achieving 94.31% accuracy and 96.25% recall. We also compare against a Direct Preference Optimisation (DPO) baseline that uses the same synthetic knowledge and LoRA setup but without iterative feedback or targeted refinement; agentic knowledge distillation substantially outperforms it (e.g. 86-94% vs 50-80% accuracy), showing that closed-loop feedback and targeted refinement are critical. These findings demonstrate that agentic knowledge distillation can rapidly yield effective security classifiers for edge deployment, but outcomes depend strongly on which teacher LLM is used.
Abstract（参考訳）: SMSベースのフィッシング(スマイシング)攻撃が急増しているが、デバイス上での効果的な検出にはラベル付き脅威データが必要である。この問題に対処するため,人間を介さずにセキュリティタスクに展開可能な,小規模のSLMを微調整する自律教師としての強力なLLM機能を備えたエージェント知識蒸留法を提案する。教師LLMは、自動で合成データを生成し、演奏台地までデバイス上のより小さな生徒モデルを反復的に洗練する。この教師の役割における4つのLCM(Claude Opus 4.5, GPT 5.2 Codex, Gemini 3 Pro, DeepSeek V3.2)と2人の学生SLM(Qwen2.5-0.5B, SmolLM2-135M)を比較した。以上の結果から,教師のLLMによって性能が大きく異なり,94.31%の精度と96.25%のリコールが達成された。また, 直接選好最適化 (DPO) ベースラインと比較し, 同じ合成知識と LoRA セットアップを用いるが, 反復的なフィードバックや目標改質を伴わず, エージェント知識の蒸留により大幅に性能が向上し(例: 86-94% 対 50-80% の精度) , 閉ループフィードバックと目標改質が重要であることを示した。これらの結果から, エージェント的知識蒸留は, エッジ展開に有効なセキュリティ分類器を迅速に得ることが示唆された。

論文の概要: Agentic Knowledge Distillation: Autonomous Training of Small Language Models for SMS Threat Detection

関連論文リスト