Fugu-MT 論文翻訳(概要): Beyond Imprecise Distance Metrics: LLM-Predicted Target Call Stacks for Directed Greybox Fuzzing

論文の概要: Beyond Imprecise Distance Metrics: LLM-Predicted Target Call Stacks for Directed Greybox Fuzzing

arxiv url: http://arxiv.org/abs/2510.23101v1
Date: Mon, 27 Oct 2025 08:17:03 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-28 19:54:32.596391
Title: Beyond Imprecise Distance Metrics: LLM-Predicted Target Call Stacks for Directed Greybox Fuzzing
Title（参考訳）: 不正確な距離の測定値を超える: 直接グレイボックスファジングのためのLLM予測ターゲットコールスタック
Authors: Yifan Zhang, Xin Zhang,
Abstract要約: Directed graybox fuzzing (DGF)は、特定のターゲット位置でバグを効率的にトリガーすることを目的としている。既存のDGFアプローチは不正確な確率計算に悩まされている。そこで本稿では,静的解析に基づく距離測定を高精度なコールスタック表現に置き換えることを提案する。
参考スコア（独自算出の注目度）: 11.825548125173022
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Directed greybox fuzzing (DGF) aims to efficiently trigger bugs at specific target locations by prioritizing seeds whose execution paths are more likely to mutate into triggering target bugs. However, existing DGF approaches suffer from imprecise probability calculations due to their reliance on complex distance metrics derived from static analysis. The over-approximations inherent in static analysis cause a large number of irrelevant execution paths to be mistakenly considered to potentially mutate into triggering target bugs, significantly reducing fuzzing efficiency. We propose to replace static analysis-based distance metrics with precise call stack representations. Call stacks represent precise control flows, thereby avoiding false information in static analysis. We leverage large language models (LLMs) to predict vulnerability-triggering call stacks for guiding seed prioritization. Our approach constructs call graphs through static analysis to identify methods that can potentially reach target locations, then utilizes LLMs to predict the most likely call stack sequence that triggers the vulnerability. Seeds whose execution paths have higher overlap with the predicted call stack are prioritized for mutation. This is the first work to integrate LLMs into the core seed prioritization mechanism of DGF. We implement our approach and evaluate it against several state-of-the-art fuzzers. On a suite of real-world programs, our approach triggers vulnerabilities $1.86\times$ to $3.09\times$ faster compared to baselines. In addition, our approach identifies 10 new vulnerabilities and 2 incomplete fixes in the latest versions of programs used in our controlled experiments through directed patch testing, with 10 assigned CVE IDs.
Abstract（参考訳）: Directed graybox fuzzing (DGF) は、実行経路が変更されやすく、ターゲットのバグを引き起こす種子を優先順位付けすることで、特定のターゲット位置のバグを効率的にトリガーすることを目的としている。しかし、既存のDGFアプローチは、静的解析から得られた複素距離メトリクスに依存するため、不正確な確率計算に悩まされている。静的解析に固有の過度な近似は、多くの無関係な実行パスを誤って、ターゲットのバグを引き起こす可能性があるとみなし、ファジング効率を著しく低下させる。そこで本稿では,静的解析に基づく距離測定を高精度なコールスタック表現に置き換えることを提案する。コールスタックは正確な制御フローを表し、静的解析における誤った情報を避ける。我々は、大規模言語モデル(LLM)を利用して、シード優先順位付けを導くための脆弱性トリガ呼び出しスタックを予測します。我々のアプローチは静的解析によってコールグラフを構築し、ターゲット位置に到達する可能性のあるメソッドを特定し、次にLSMを使用して、脆弱性を引き起こす最も可能性が高いコールスタックシーケンスを予測する。予測されたコールスタックと高いオーバーラップした実行パスを持つ種子は、突然変異のために優先順位付けされる。これは、LDMをDGFのコアシード優先順位付け機構に統合する最初の試みである。提案手法を実装し,いくつかの最先端ファジィファジィに対して評価する。現実世界のプログラムのスイートでは、我々のアプローチは脆弱性を1.86\times$から3.09\times$にトリガーします。さらに本手法では,10個のCVEIDを割り当てた有向パッチテストにより,制御実験で使用したプログラムの最新バージョンにおいて,10個の新たな脆弱性と2個の不完全な修正を識別する。

論文の概要: Beyond Imprecise Distance Metrics: LLM-Predicted Target Call Stacks for Directed Greybox Fuzzing

関連論文リスト