Fugu-MT 論文翻訳(概要): LLMHoney: A Real-Time SSH Honeypot with Large Language Model-Driven Dynamic Response Generation

論文の概要: LLMHoney: A Real-Time SSH Honeypot with Large Language Model-Driven Dynamic Response Generation

arxiv url: http://arxiv.org/abs/2509.01463v1
Date: Mon, 01 Sep 2025 13:28:00 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-04 15:17:03.714839
Title: LLMHoney: A Real-Time SSH Honeypot with Large Language Model-Driven Dynamic Response Generation
Title（参考訳）: LLMHoney: 大規模言語モデル駆動型動的応答生成を備えたリアルタイムSSHHoneypot
Authors: Pranjay Malhotra,
Abstract要約: 本稿では,Large Language Models (LLMs) を利用してリアルタイムに動的コマンド出力を生成するSSHハニーポット LLMHoney について述べる。 LLMは辞書ベースの仮想ファイルシステムを統合し、低レイテンシで共通コマンドを処理する。精度(exact-match, Cosine similarity, Jaro-Winkler similarity, Levenshtein similarity and BLEU score)、応答遅延、メモリオーバーヘッドなどを含む総合的な指標を報告する。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Cybersecurity honeypots are deception tools for engaging attackers and gather intelligence, but traditional low or medium-interaction honeypots often rely on static, pre-scripted interactions that can be easily identified by skilled adversaries. This Report presents LLMHoney, an SSH honeypot that leverages Large Language Models (LLMs) to generate realistic, dynamic command outputs in real time. LLMHoney integrates a dictionary-based virtual file system to handle common commands with low latency while using LLMs for novel inputs, achieving a balance between authenticity and performance. We implemented LLMHoney using open-source LLMs and evaluated it on a testbed with 138 representative Linux commands. We report comprehensive metrics including accuracy (exact-match, Cosine Similarity, Jaro-Winkler Similarity, Levenshtein Similarity and BLEU score), response latency and memory overhead. We evaluate LLMHoney using multiple LLM backends ranging from 0.36B to 3.8B parameters, including both open-source models and a proprietary model(Gemini). Our experiments compare 13 different LLM variants; results show that Gemini-2.0 and moderately-sized models Qwen2.5:1.5B and Phi3:3.8B provide the most reliable and accurate responses, with mean latencies around 3 seconds, whereas smaller models often produce incorrect or out-of-character outputs. We also discuss how LLM integration improves honeypot realism and adaptability compared to traditional honeypots, as well as challenges such as occasional hallucinated outputs and increased resource usage. Our findings demonstrate that LLM-driven honeypots are a promising approach to enhance attacker engagement and collect richer threat intelligence.
Abstract（参考訳）: サイバーセキュリティのハニーポットは攻撃者を誘惑し、知性を集めるための偽装ツールであるが、伝統的な低または中規模のハニーポットは、しばしば、熟練した敵によって容易に識別できる静的で事前記述された相互作用に依存している。本稿では,Large Language Models (LLMs) を利用してリアルタイムに動的コマンド出力を生成するSSHハニーポット LLMHoney について述べる。 LLMHoneyは辞書ベースの仮想ファイルシステムを統合し、新しい入力にLLMを使用しながら低レイテンシで共通コマンドを処理する。オープンソース LLM を用いて LLMHoney を実装し,138 個のLinux コマンドでテストベッド上で評価した。精度(exact-match, Cosine similarity, Jaro-Winkler similarity, Levenshtein similarity and BLEU score)、応答遅延、メモリオーバーヘッドなどを含む総合的な指標を報告する。我々は、オープンソースモデルとプロプライエタリモデル(Gemini)の両方を含む0.36Bから3.8Bのパラメータを含む複数のLCMバックエンドを用いてLCMHoneyを評価する。実験の結果、ジェミニ2.0と中程度のモデル Qwen2.5:1.5B と Phi3:3.8B が最も信頼性が高く正確な応答を提供し、平均レイテンシは3秒程度であるのに対し、小さいモデルでは不正確または不正確な出力が生じることが示されている。また,従来のハニーポットに比べてLLM統合がハニーポットリアリズムと適応性をどのように改善するか,また時折幻覚出力や資源利用の増加といった課題についても論じる。以上の結果から,LSMを駆使したハニーポットは攻撃のエンゲージメントを高め,よりリッチな脅威知能を収集するための有望なアプローチであることが示唆された。

論文の概要: LLMHoney: A Real-Time SSH Honeypot with Large Language Model-Driven Dynamic Response Generation

関連論文リスト