Fugu-MT 論文翻訳(概要): ShellGames: Speculative LLM-Driven SSH Deception

論文の概要: ShellGames: Speculative LLM-Driven SSH Deception

arxiv url: http://arxiv.org/abs/2606.17986v1
Date: Tue, 16 Jun 2026 14:40:08 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-17 17:15:32.48675
Title: ShellGames: Speculative LLM-Driven SSH Deception
Title（参考訳）: ShellGames: 投機的LLM駆動SSH推論
Authors: Umberto Salviati, Fabio De Gaspari, Mauro Conti, Luigi Vincenzo Mancini,
Abstract要約: 大規模言語モデル(LLM)は、よりダイナミックな騙しシステムへの有望な道を提供する。 LLMは、持続状態の欠如、出力の不整合、転倒、潜伏、そして、偽装を明らかにする可能性のある行動幻覚への感受性など、適用性を根本的に制限する重要な制限に悩まされている。 LLMに基づくSSHシェルシミュレータであるShellGamesを提案する。
参考スコア（独自算出の注目度）: 17.337274635900933
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Cyber deception and Moving Target Defense are promising strategies that aim to disrupt adversaries by increasing uncertainty. However, sustaining long-lived, credible interactive sessions with adversaries remains an open challenge. Large Language Models (LLMs) offer a promising path toward more dynamic deception systems, but suffer from key limitations that fundamentally limit their applicability, including: lack of persistent state, output inconsistencies, hallucinations, latency, and susceptibility to behavioral subversion that may reveal the deception. We propose ShellGames, an SSH shell simulator based on LLM designed to address these limitations. ShellGames combines five complementary techniques: (i) Automatic Chain-of-Thought and few-shot learning to improve correctness; (ii) memory management to maintain system state coherency; (iii) speculative command execution to reduce response latency; (iv) smart routing of complex interactive commands to a sandboxed environment; and (v) subversion detection leveraging the constrained input-output domain of shell environments. To enable systematic evaluation, we introduce a standardized benchmarking protocol and dataset spanning correctness, consistency, state tracking, and robustness tasks. ShellGames achieves $0.898$ command accuracy on correctness ($+5.3pp$ over baselines), $0.918$ sequence-level accuracy on consistency ($+36pp$), $0.98$ state tracking accuracy ($+18.3pp$), and $0.95$ accuracy on robustness ($+37pp$). A user study with $n=20$ participants confirms that ShellGames achieves realism comparable to a real shell under free exploration and outperforms traditional honeypots on perceived command coverage.
Abstract（参考訳）: サイバー詐欺と移動目標防衛は、不確実性を高めて敵を破壊しようとする有望な戦略である。しかし、長期にわたって、敵との信頼できる対話的なセッションは、依然としてオープンな課題である。大規模言語モデル(LLM)は、よりダイナミックな偽装システムへの有望な道を提供するが、永続的な状態の欠如、出力の不整合、幻覚、遅延、そして、偽装を明らかにする可能性のある行動的転用に対する感受性など、その適用性を根本的に制限する重要な制限に悩まされている。 LLMに基づくSSHシェルシミュレータであるShellGamesを提案する。 ShellGamesは5つの補完的テクニックを組み合わせています。一正当性を改善するために、自動結束及び数発の学習二システム状態の整合性を維持するためのメモリ管理 (iii)応答遅延を低減するための投機的コマンド実行 (四)複雑な対話型コマンドのサンドボックス環境へのスマートルーティング、及び (v)シェル環境の制約された入出力領域を利用したサブバージョン検出。体系的な評価を可能にするため,標準化されたベンチマークプロトコルとデータセットを導入し,正確性,一貫性,状態追跡,堅牢性といったタスクを網羅する。 ShellGamesは、精度が0.898$(ベースライン以上)、一貫性が0.918$(+36pp$)、状態追跡が0.98$(+18.3pp$)、堅牢性が0.95$(+37pp$)である。参加者が$n=20$のユーザスタディでは、ShellGamesがフリーサーベイで本物のシェルに匹敵するリアリズムを実現し、コマンドカバレッジで従来のハニーポットを上回っていることが確認されている。

論文の概要: ShellGames: Speculative LLM-Driven SSH Deception

関連論文リスト