Fugu-MT 論文翻訳(概要): Evaluating Line-level Localization Ability of Learning-based Code Vulnerability Detection Models

論文の概要: Evaluating Line-level Localization Ability of Learning-based Code Vulnerability Detection Models

arxiv url: http://arxiv.org/abs/2510.11202v1
Date: Mon, 13 Oct 2025 09:34:40 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-14 18:06:30.300129
Title: Evaluating Line-level Localization Ability of Learning-based Code Vulnerability Detection Models
Title（参考訳）: 学習型コードの脆弱性検出モデルにおける線形レベルの局所化能力の評価
Authors: Marco Pintore, Giorgio Piras, Angelo Sotgiu, Maura Pintor, Battista Biggio,
Abstract要約: 脆弱性検出のための説明可能性に基づく評価手法を提案する。提案手法は検出アライメント(DA)として定義され,入力されたソースコード間の一致を定量化する。このようなモデルの予測は、常に非負の線に偏っていることを示す。
参考スコア（独自算出の注目度）: 9.543689542888599
License: http://creativecommons.org/licenses/by/4.0/
Abstract: To address the extremely concerning problem of software vulnerability, system security is often entrusted to Machine Learning (ML) algorithms. Despite their now established detection capabilities, such models are limited by design to flagging the entire input source code function as vulnerable, rather than precisely localizing the concerned code lines. However, the detection granularity is crucial to support human operators during software development, ensuring that such predictions reflect the true code semantics to help debug, evaluate, and fix the detected vulnerabilities. To address this issue, recent work made progress toward improving the detector's localization ability, thus narrowing down the vulnerability detection "window" and providing more fine-grained predictions. Such approaches, however, implicitly disregard the presence of spurious correlations and biases in the data, which often predominantly influence the performance of ML algorithms. In this work, we investigate how detectors comply with this requirement by proposing an explainability-based evaluation procedure. Our approach, defined as Detection Alignment (DA), quantifies the agreement between the input source code lines that most influence the prediction and the actual localization of the vulnerability as per the ground truth. Through DA, which is model-agnostic and adaptable to different detection tasks, not limited to our use case, we analyze multiple learning-based vulnerability detectors and datasets. As a result, we show how the predictions of such models are consistently biased by non-vulnerable lines, ultimately highlighting the high impact of biases and spurious correlations. The code is available at https://github.com/pralab/vuln-localization-eval.
Abstract（参考訳）: ソフトウェア脆弱性の極めて深刻な問題に対処するため、システムセキュリティは機械学習(ML)アルゴリズムに委任されることが多い。現在確立されている検出機能にもかかわらず、そのようなモデルは設計によって、関連するコード行を正確にローカライズするのではなく、入力ソースコード全体の機能を脆弱性としてフラグ付けするように制限されている。しかしながら、検出の粒度は、ソフトウェア開発において人間のオペレータをサポートするために重要であり、そのような予測が真のコードセマンティクスを反映して検出された脆弱性のデバッグ、評価、修正を支援する。この問題に対処するため、最近の研究は検出器のローカライゼーション能力の向上に向けて前進し、脆弱性検出の"ウィンドウ"を狭め、よりきめ細かな予測を提供した。しかし、このようなアプローチは、しばしばMLアルゴリズムの性能に大きく影響する、データ内の急激な相関やバイアスの存在を暗黙的に無視する。本研究では,検知器がこの要件にどのように準拠するかを,説明可能性に基づく評価手法を提案する。提案手法は,検出アライメント (DA) として定義され,入力ソースコード行間の一致を定量化する。 DAはモデルに依存しず、異なる検出タスクに適応可能であり、私たちのユースケースに限らず、複数の学習ベースの脆弱性検出とデータセットを分析します。その結果、そのようなモデルの予測は、常に非負の線によってバイアスを受けており、最終的に偏りの強い影響と刺激的な相関が浮き彫りになることを示す。コードはhttps://github.com/pralab/vuln-localization-eval.comで公開されている。

論文の概要: Evaluating Line-level Localization Ability of Learning-based Code Vulnerability Detection Models

関連論文リスト