Fugu-MT 論文翻訳(概要): Beyond Function-Level Analysis: Context-Aware Reasoning for Inter-Procedural Vulnerability Detection

論文の概要: Beyond Function-Level Analysis: Context-Aware Reasoning for Inter-Procedural Vulnerability Detection

arxiv url: http://arxiv.org/abs/2602.06751v1
Date: Fri, 06 Feb 2026 14:49:49 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-09 22:18:26.431583
Title: Beyond Function-Level Analysis: Context-Aware Reasoning for Inter-Procedural Vulnerability Detection
Title（参考訳）: 関数レベル解析を超えて:手続き間脆弱性検出のためのコンテキスト認識推論
Authors: Yikun Li, Ting Zhang, Jieke Shi, Chengran Yang, Junda He, Xin Zhou, Jinfeng Jiang, Huihui Huang, Wen Bin Leow, Yide Yin, Eng Lieh Ouh, Lwin Khin Shar, David Lo,
Abstract要約: 本稿では,コンテキスト認識型脆弱性検出フレームワークCPRVulを提案する。 CPRVulは関数のみのベースラインを一貫して上回ることを示す。また、処理コンテキストが構造化推論とペアリングされている場合にのみゲインが発生することを示す。
参考スコア（独自算出の注目度）: 13.077617614021863
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recent progress in ML and LLMs has improved vulnerability detection, and recent datasets have reduced label noise and unrelated code changes. However, most existing approaches still operate at the function level, where models are asked to predict whether a single function is vulnerable without inter-procedural context. In practice, vulnerability presence and root cause often depend on contextual information. Naively appending such context is not a reliable solution: real-world context is long, redundant, and noisy, and we find that unstructured context frequently degrades the performance of strong fine-tuned code models. We present CPRVul, a context-aware vulnerability detection framework that couples Context Profiling and Selection with Structured Reasoning. CPRVul constructs a code property graph, and extracts candidate context. It then uses an LLM to generate security-focused profiles and assign relevance scores, selecting only high-impact contextual elements that fit within the model's context window. In the second phase, CPRVul integrates the target function, the selected context, and auxiliary vulnerability metadata to generate reasoning traces, which are used to fine-tune LLMs for reasoning-based vulnerability detection. We evaluate CPRVul on three high-quality vulnerability datasets: PrimeVul, TitanVul, and CleanVul. Across all datasets, CPRVul consistently outperforms function-only baselines, achieving accuracies ranging from 64.94% to 73.76%, compared to 56.65% to 63.68% for UniXcoder. Specifically, on the challenging PrimeVul benchmark, CPRVul achieves 67.78% accuracy, outperforming prior state-of-the-art approaches, improving accuracy from 55.17% to 67.78% (22.9% improvement). Our ablations further show that neither raw context nor processed context alone benefits strong code models; gains emerge only when processed context is paired with structured reasoning.
Abstract（参考訳）: MLとLLMの最近の進歩は脆弱性検出を改善し、最近のデータセットはラベルノイズと無関係なコード変更を減らす。しかし、既存のほとんどのアプローチは、関数レベルで動作しており、モデルには、プロセス間コンテキストなしで単一の関数が脆弱かどうかを予測するように求められている。実際には、脆弱性の存在と根本原因はしばしば文脈情報に依存する。現実世界のコンテキストは長く、冗長で、騒々しく、構造化されていないコンテキストは、強い微調整されたコードモデルの性能を劣化させることが多い。本稿では,コンテキストプロファイリングと構造化推論による選択を併用したコンテキスト認識型脆弱性検出フレームワークCPRVulを提案する。 CPRVulはコードプロパティグラフを構築し、候補コンテキストを抽出する。次にLLMを使用して、セキュリティを重視したプロファイルを生成し、関連するスコアを割り当て、モデルのコンテキストウィンドウに適合する高インパクトなコンテキスト要素のみを選択する。第2フェーズでは、CPRVulはターゲット関数、選択されたコンテキスト、補助的な脆弱性メタデータを統合して、推論に基づく脆弱性検出のためにLLMを微調整するために使用される推論トレースを生成する。 CPRVulを、PrimeVul、TitanVul、CleanVulの3つの高品質な脆弱性データセットで評価する。すべてのデータセットで、CPRVulは関数のみのベースラインを一貫して上回り、64.94%から73.76%のアキュラシーを達成している。具体的には、挑戦的なPrimeVulベンチマークにおいて、CPRVulは67.78%の精度を達成し、最先端のアプローチよりも優れ、精度は55.17%から67.78%(22.9%の改善)に向上した。私たちの主張は、生のコンテキストと処理されたコンテキストだけでは、強力なコードモデルに利益が得られないことを示している。

論文の概要: Beyond Function-Level Analysis: Context-Aware Reasoning for Inter-Procedural Vulnerability Detection

関連論文リスト