Fugu-MT 論文翻訳(概要): Retrieval-Augmented LLMs for Security Incident Analysis

論文の概要: Retrieval-Augmented LLMs for Security Incident Analysis

arxiv url: http://arxiv.org/abs/2603.18196v1
Date: Wed, 18 Mar 2026 18:45:56 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-20 17:19:05.808991
Title: Retrieval-Augmented LLMs for Security Incident Analysis
Title（参考訳）: セキュリティインシデント解析のための検索用LLM
Authors: Xavier Cadet, Aditya Vikram Singh, Harsh Mamania, Edward Koh, Alex Fitts, Dirk Van Bruggen, Simona Boboila, Peter Chin, Alina Oprea,
Abstract要約: 本稿では、ターゲットクエリベースのフィルタリングとLLMセマンティック推論によるセキュリティインシデント解析を行うRAGベースのシステムを提案する。マルウェアトラフィックインシデントとマルチステージアクティブディレクトリアタックの5つのLSMプロバイダによるシステムの評価を行った。
参考スコア（独自算出の注目度）: 8.426791694746747
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Investigating cybersecurity incidents requires collecting and analyzing evidence from multiple log sources, including intrusion detection alerts, network traffic records, and authentication events. This process is labor-intensive: analysts must sift through large volumes of data to identify relevant indicators and piece together what happened. We present a RAG-based system that performs security incident analysis through targeted query-based filtering and LLM semantic reasoning. The system uses a query library with associated MITRE ATT\&CK techniques to extract indicators from raw logs, then retrieves relevant context to answer forensic questions and reconstruct attack sequences. We evaluate the system with five LLM providers on malware traffic incidents and multi-stage Active Directory attacks. We find that LLM models have different performance and tradeoffs, with Claude Sonnet~4 and DeepSeek~V3 achieving 100\% recall across all four malware scenarios, while DeepSeek costs 15$\times$ less (\$0.008 vs.\ \$0.12 per analysis). Attack step detection on Active Directory scenarios reaches 100\% precision and 82\% recall. Ablation studies confirm that a RAG architecture is essential: LLM baselines without RAG-enhanced context correctly identify victim hosts but miss all attack infrastructure including malicious domains and command-and-control servers. These results demonstrate that combining targeted query-based filtering with RAG-based retrieval enables accurate, cost-effective security analysis within LLM context limits.
Abstract（参考訳）: サイバーセキュリティ事件の調査には、侵入検知アラート、ネットワークトラフィック記録、認証イベントなど、複数のログソースから証拠を収集し、分析する必要がある。このプロセスは労働集約的であり、アナリストは関連する指標を特定するために大量のデータを収集し、何が起きたのかをまとめなければならない。本稿では、ターゲットクエリベースのフィルタリングとLLMセマンティック推論によるセキュリティインシデント解析を行うRAGベースのシステムを提案する。このシステムは、関連するMITRE ATT\&CK技術によるクエリライブラリを使用して、生ログからインジケータを抽出し、関連するコンテキストを取得して、法医学的な質問に答え、攻撃シーケンスを再構築する。マルウェアトラフィックインシデントとマルチステージアクティブディレクトリアタックの5つのLSMプロバイダによるシステムの評価を行った。 LLMモデルにはパフォーマンスとトレードオフが異なり、Claude Sonnet~4とDeepSeek~V3は4つのマルウェアシナリオすべてで100\%のリコールを実現しています。分析あたり0.12ドル)。 Active Directoryシナリオのアタックステップ検出は、100\%の精度と82\%のリコールに達する。 LLMベースラインはRAGの強化されていないコンテキストで被害者のホストを正しく識別するが、悪意のあるドメインやコマンド・アンド・コントロールサーバを含むすべての攻撃インフラを見逃す。これらの結果は、ターゲットクエリベースのフィルタリングとRAGベースの検索を組み合わせることで、LLMコンテキスト制限内での正確で費用対効果の高いセキュリティ分析が可能になることを示している。

論文の概要: Retrieval-Augmented LLMs for Security Incident Analysis

関連論文リスト