Fugu-MT 論文翻訳(概要): Malicious ML Model Detection by Learning Dynamic Behaviors

論文の概要: Malicious ML Model Detection by Learning Dynamic Behaviors

arxiv url: http://arxiv.org/abs/2604.19438v1
Date: Tue, 21 Apr 2026 13:12:42 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-22 22:41:49.779916
Title: Malicious ML Model Detection by Learning Dynamic Behaviors
Title（参考訳）: 学習動作による悪性MLモデル検出
Authors: Sarang Nambiar, Dhruv Pradhan, Ezekiel Soremekun,
Abstract要約: 悪意のある攻撃は、例えばモデルローディング中に、信頼されたユーザー環境上で任意のコードを実行することができる。動的解析と機械学習(ML)を用いて良性PTMの挙動を学習し、悪意のあるPTMを検出する新しい手法を提案する。我々は,Hugging FaceやMalHugなど,さまざまなソースから25,000以上の良性および悪意のあるPTMを用いてDynaHugを評価する。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Pre-trained machine learning models (PTMs) are commonly provided via Model Hubs (e.g., Hugging Face) in standard formats like Pickles to facilitate accessibility and reuse. However, this ML supply chain setting is susceptible to malicious attacks that are capable of executing arbitrary code on trusted user environments, e.g., during model loading. To detect malicious PTMs, state-of-the-art detectors (e.g., PickleScan) rely on rules, heuristics, or static analysis, but ignore runtime model behaviors. Consequently, they either miss malicious models due to under-approximation (blacklisting) or miscategorize benign models due to over-approximation (static analysis or whitelisting). To address this challenge, we propose a novel technique (DynaHug) which detects malicious PTMs by learning the behavior of benign PTMs using dynamic analysis and machine learning (ML). DynaHug trains an ML classifier (one-class SVM (OCSVM)) on the runtime behaviours of task-specific benign models. We evaluate DynaHug using over 25,000 benign and malicious PTMs from different sources including Hugging Face and MalHug. We also compare DynaHug to several state-of-the-art detectors including static, dynamic and LLM-based detectors. Results show that DynaHug is up to 44% more effective than existing baselines in terms of F1-score. Our ablation study demonstrates that our design decisions (dynamic analysis, OCSVM, clustering) contribute positively to DynaHug's effectiveness.
Abstract（参考訳）: 事前トレーニングされた機械学習モデル(PTM)は一般的に、アクセシビリティと再利用を容易にするために、Picklesのような標準フォーマットのモデルハブ(例えばHugging Face)を介して提供される。しかし、このMLサプライチェーン設定は、モデルローディング中に信頼できるユーザ環境、例えば、任意のコードを実行できる悪意のある攻撃に影響を受けやすい。悪意のあるPTMを検出するために、最先端の検出器(例えばPickleScan)はルールやヒューリスティック、静的解析に依存しているが、実行時のモデル動作は無視する。その結果、過度な近似(ブラックリスト)のために悪意あるモデルを見逃すか、過度な近似(静的分析またはホワイトリスト)のために良質なモデルを誤分類する。この課題に対処するために、動的解析と機械学習(ML)を用いて良性PTMの振る舞いを学習し、悪意のあるPTMを検出する新しい手法(DynaHug)を提案する。 DynaHugはタスク固有の良性モデルの実行時の振る舞いに基づいてML分類器(一級SVM(OCSVM))をトレーニングする。我々は,Hugging FaceやMalHugなど,さまざまなソースから25,000以上の良性および悪意のあるPTMを用いてDynaHugを評価する。また、DynaHugを静的、動的、LLMベースの検出器を含む最先端の検出器と比較する。その結果、DynaHugはF1スコアの点で既存のベースラインよりも最大44%有効であることがわかった。我々のアブレーション調査は、設計決定(動的解析、OCSVM、クラスタリング)がDynaHugの有効性に肯定的な貢献をすることを示している。

論文の概要: Malicious ML Model Detection by Learning Dynamic Behaviors

関連論文リスト