Fugu-MT 論文翻訳(概要): Privacy-Aware Machine Unlearning with SISA for Reinforcement Learning-Based Ransomware Detection

論文の概要: Privacy-Aware Machine Unlearning with SISA for Reinforcement Learning-Based Ransomware Detection

arxiv url: http://arxiv.org/abs/2604.16760v1
Date: Sat, 18 Apr 2026 00:19:20 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-28 14:07:29.256292
Title: Privacy-Aware Machine Unlearning with SISA for Reinforcement Learning-Based Ransomware Detection
Title（参考訳）: 強化学習に基づくランサムウェア検出のためのSISAを用いたプライバシ・アウェア・マシン・アンラーニング
Authors: Jannatul Ferdous, Rafiqul Islam, Md Zahidul Islam,
Abstract要約: 本稿では、Sharded, Isolated, Sliced, Aggregated trainingをベースとしたランサムウェア検出のための、プライバシを意識しない機械学習評価フレームワークを提案する。このフレームワークは、検出器全体ではなく、影響を受けるモデルのシャードのみをトレーニングすることで、効率的なデータ削除を可能にする。
参考スコア（独自算出の注目度）: 1.8024397171920885
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Ransomware detection systems increasingly rely on behavior-based machine learning to address evolving attack strategies. However, emerging privacy compliance, data governance, and responsible AI deployment demand not only accurate detection but also the ability to efficiently remove the influence of specific training samples without retraining the models from scratch. In this study, we present a privacy-aware machine unlearning evaluation framework for reinforcement learning (RL)-based ransomware detection built on Sharded, Isolated, Sliced, and Aggregated (SISA) training. The framework enables efficient data deletion by retraining only the affected model shards rather than the entire detector, reducing the retraining cost while preserving detection performance. We conduct a controlled comparative study using value-based RL agents, including Deep Q-Network (DQN) and Double Deep Q-Network (DDQN), under identical experimental settings with a cost-sensitive reward design and 5-fold cross-validation on Windows 11 ransomware dataset. Detection confidence is evaluated using a continuous Q-score margin, enabling ROC-AUC analysis beyond binary predictions. For unlearning, the dataset is partitioned into five shards with majority-vote aggregation, and a fast-unlearning path is evaluated by deleting 5% of the samples from a single shard and retraining only that shard. Results show that SISA-based unlearning incurs negligible utility degradation (<= 0.05 percent F1 drop) while substantially reducing retraining time relative to full SISA retraining. DDQN exhibits slightly improved stability and lower utility loss than DQN, while both agents maintain near identical in-distribution performance after unlearning. These findings indicate that SISA provides an efficient unlearning mechanism for RL-based ransomware detection, supporting privacy-aware deployment without compromising security effectiveness.
Abstract（参考訳）: ランサムウェア検出システムは、進化する攻撃戦略に対処するために、行動に基づく機械学習にますます依存している。しかしながら、プライバシコンプライアンスの出現、データガバナンス、責任あるAIデプロイメントは、正確な検出だけでなく、モデルをスクラッチからトレーニングすることなく、特定のトレーニングサンプルの影響を効率的に除去する能力も要求している。本研究では,Sharded,Isolated,Sliced,Aggregated(SISA)トレーニングをベースとした強化学習(RL)に基づくランサムウェア検出のための,プライバシを意識しない機械学習評価フレームワークを提案する。このフレームワークは、影響を受けるモデルシャードのみを検出器全体ではなく再トレーニングすることで、効率的なデータ削除を可能にし、検出性能を維持しつつ、再トレーニングコストを低減させる。我々は,DQN(Deep Q-Network)やDouble Deep Q-Network(Double Deep Q-Network)などの値ベースのRLエージェントを用いて,コスト感受性の報酬設計とWindows 11ランサムウェアデータセット上の5倍のクロスバリデーションを備えた同じ実験環境下で,制御された比較研究を行う。検出信頼度は連続したQスコアマージンを用いて評価され、2進予測を超えたROC-AUC分析が可能となる。アンラーニングでは、データセットは5つのシャードに分割され、多数決で集約され、高速なアンラーニングパスは、サンプルの5%を単一のシャードから削除し、そのシャードのみを再トレーニングすることで評価される。その結果,SISAをベースとした未学習は実用性低下(=0.05%F1ドロップ)を招き,SISAの再トレーニングに対する再トレーニング時間を大幅に短縮することがわかった。 DDQNはDQNよりも安定性がわずかに向上し、ユーティリティ損失も低かった。これらの結果から,SISA は RL ベースのランサムウェア検出のための効率的なアンラーニング機構を提供し,セキュリティ効果を損なうことなく,プライバシを意識したデプロイメントを支援することが示唆された。

論文の概要: Privacy-Aware Machine Unlearning with SISA for Reinforcement Learning-Based Ransomware Detection

関連論文リスト