Fugu-MT 論文翻訳(概要): VulAgent: Hypothesis-Validation based Multi-Agent Vulnerability Detection

論文の概要: VulAgent: Hypothesis-Validation based Multi-Agent Vulnerability Detection

arxiv url: http://arxiv.org/abs/2509.11523v1
Date: Mon, 15 Sep 2025 02:25:38 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-16 17:26:23.12465
Title: VulAgent: Hypothesis-Validation based Multi-Agent Vulnerability Detection
Title（参考訳）: VulAgent:仮説Validationに基づくマルチエージェント脆弱性検出
Authors: Ziliang Wang, Ge Li, Jia Li, Hao Zhu, Zhi Jin,
Abstract要約: VulAgentは仮説検証に基づくマルチエージェント脆弱性検出フレームワークである。セマンティクスに敏感なマルチビュー検出パイプラインを実装しており、それぞれが特定の分析の観点から一致している。平均して、VulAgentは全体的な精度を6.6%改善し、脆弱性のある固定されたコードペアの正確な識別率を最大450%向上させ、偽陽性率を約36%削減する。
参考スコア（独自算出の注目度）: 55.957275374847484
License: http://creativecommons.org/publicdomain/zero/1.0/
Abstract: The application of language models to project-level vulnerability detection remains challenging, owing to the dual requirement of accurately localizing security-sensitive code and correctly correlating and reasoning over complex program context. We present VulAgent, a multi-agent vulnerability detection framework based on hypothesis validation. Our design is inspired by how human auditors review code: when noticing a sensitive operation, they form a hypothesis about a possible vulnerability, consider potential trigger paths, and then verify the hypothesis against the surrounding context. VulAgent implements a semantics-sensitive, multi-view detection pipeline: specialized agents, each aligned to a specific analysis perspective (e.g., memory, authorization), collaboratively surface and precisely localize sensitive code sites with higher coverage. Building on this, VulAgent adopts a hypothesis-validation paradigm: for each vulnerability report, it builds hypothesis conditions and a trigger path, steering the LLM to target the relevant program context and defensive checks during verification, which reduces false positives. On average across the two datasets, VulAgent improves overall accuracy by 6.6%, increases the correct identification rate of vulnerable--fixed code pairs by up to 450% (246% on average), and reduces the false positive rate by about 36% compared with state-of-the-art LLM-based baselines.
Abstract（参考訳）: 言語モデルのプロジェクトレベルの脆弱性検出への応用は、セキュリティに敏感なコードを正確にローカライズし、複雑なプログラムコンテキストを正しく関連付け、推論するという2つの要件のため、依然として困難である。仮説検証に基づくマルチエージェント脆弱性検出フレームワークであるVulAgentを提案する。センシティブな操作に気付くと、脆弱性に関する仮説を形成し、潜在的なトリガーパスを検討し、その仮説を周囲のコンテキストに対して検証します。 VulAgentはセマンティクスに敏感なマルチビュー検出パイプラインを実装している: 特殊エージェントは、それぞれが特定の分析視点(例えば、メモリ、認可)に整合し、協調的に表面化し、より高いカバレッジでセンシティブなコードサイトを正確にローカライズする。脆弱性レポート毎に仮説条件とトリガーパスを構築し、LCMをステアリングして関連するプログラムコンテキストと検証中の防御チェックを目標とすることで、偽陽性を低減します。 2つのデータセットの平均で、VulAgentは全体的な精度を6.6%改善し、脆弱性のある固定されたコードペアの正確な識別率を最大450%(平均246%)向上させ、最先端のLCMベースのベースラインと比較して、偽陽性率を約36%削減する。

論文の概要: VulAgent: Hypothesis-Validation based Multi-Agent Vulnerability Detection

関連論文リスト