Fugu-MT 論文翻訳(概要): SafePickle: Robust and Generic ML Detection of Malicious Pickle-based ML Models

論文の概要: SafePickle: Robust and Generic ML Detection of Malicious Pickle-based ML Models

arxiv url: http://arxiv.org/abs/2602.19818v1
Date: Mon, 23 Feb 2026 13:19:43 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-24 17:42:02.824224
Title: SafePickle: Robust and Generic ML Detection of Malicious Pickle-based ML Models
Title（参考訳）: SafePickle: 不正ピクルスに基づくMLモデルのロバストおよびジェネリックML検出
Authors: Hillel Ohayon, Daniel Gilkarov, Ran Dubin,
Abstract要約: Hugging Faceのようなモデルリポジトリは、Pythonのピクルスフォーマットでシリアライズされた機械学習アーティファクトを配布する傾向にある。 PickleBallのような最近の防衛は、複雑なシステムセットアップと検証された良性モデルを必要とするライブラリごとのポリシー合成に依存している。ポリシー生成やコードインスツルメンテーションなしに悪意のあるPickleベースのファイルを検出する軽量な機械学習ベースのスキャナを提案する。
参考スコア（独自算出の注目度）: 6.365889364810239
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Model repositories such as Hugging Face increasingly distribute machine learning artifacts serialized with Python's pickle format, exposing users to remote code execution (RCE) risks during model loading. Recent defenses, such as PickleBall, rely on per-library policy synthesis that requires complex system setups and verified benign models, which limits scalability and generalization. In this work, we propose a lightweight, machine-learning-based scanner that detects malicious Pickle-based files without policy generation or code instrumentation. Our approach statically extracts structural and semantic features from Pickle bytecode and applies supervised and unsupervised models to classify files as benign or malicious. We construct and release a labeled dataset of 727 Pickle-based files from Hugging Face and evaluate our models on four datasets: our own, PickleBall (out-of-distribution), Hide-and-Seek (9 advanced evasive malicious models), and synthetic joblib files. Our method achieves 90.01% F1-score compared with 7.23%-62.75% achieved by the SOTA scanners (Modelscan, Fickling, ClamAV, VirusTotal) on our dataset. Furthermore, on the PickleBall data (OOD), it achieves 81.22% F1-score compared with 76.09% achieved by the PickleBall method, while remaining fully library-agnostic. Finally, we show that our method is the only one to correctly parse and classify 9/9 evasive Hide-and-Seek malicious models specially crafted to evade scanners. This demonstrates that data-driven detection can effectively and generically mitigate Pickle-based model file attacks.
Abstract（参考訳）: Hugging Faceのようなモデルリポジトリは、Pythonのピクルフォーマットにシリアライズされた機械学習アーティファクトを配布し、モデルローディング中にリモートコード実行(RCE)のリスクにユーザをさらしている。 PickleBallのような最近の防衛は、複雑なシステムセットアップと検証された良性モデルを必要とするライブラリごとのポリシー合成に依存しており、スケーラビリティと一般化を制限している。本研究では、ポリシー生成やコードインスツルメンテーションなしで悪意のあるピクルスベースのファイルを検出する軽量な機械学習ベースのスキャナを提案する。我々のアプローチでは、Pickleバイトコードから構造的および意味的特徴を静的に抽出し、教師なしおよび教師なしのモデルを適用して、ファイルの良さや悪意を分類する。私たちはHugging Faceから727個のピクルスベースのファイルのラベル付きデータセットを構築してリリースし、当社独自の4つのデータセット、PickleBall(配布外)、Hide-and-Seek(高度な回避悪意のある9つのモデル)、および合成ジョブリファイルでモデルを評価します。提案手法は、データセット上のSOTAスキャナ(Modelscan, Fickling, ClamAV, VirusTotal)が達成した7.23%-62.75%と比較して90.01%のF1スコアを達成する。さらに、PickleBallデータ(OOD)では、PickleBallメソッドの76.09%と比較して81.22%のF1スコアを達成するが、完全なライブラリに依存しないままである。最後に、スキャナーを回避するために特別に作られた悪質モデル9/9を正しく解析し分類する唯一の方法であることを示す。このことは、データ駆動検出がピクルスベースのモデルファイル攻撃を効果的に、そして、汎用的に軽減できることを示している。

論文の概要: SafePickle: Robust and Generic ML Detection of Malicious Pickle-based ML Models

関連論文リスト