Fugu-MT 論文翻訳(概要): Learning to Triage Taint Flows Reported by Dynamic Program Analysis in Node.js Packages

論文の概要: Learning to Triage Taint Flows Reported by Dynamic Program Analysis in Node.js Packages

arxiv url: http://arxiv.org/abs/2510.20739v1
Date: Thu, 23 Oct 2025 16:58:02 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-25 03:08:18.432585
Title: Learning to Triage Taint Flows Reported by Dynamic Program Analysis in Node.js Packages
Title（参考訳）: Node.jsパッケージの動的プログラム解析によるタレントフローの学習
Authors: Ronghao Ni, Aidan Z. H. Yang, Min-Chien Hsu, Nuno Sabino, Limin Jia, Ruben Martins, Darion Cassel, Kevin Cheang,
Abstract要約: プログラム分析ツールは、しばしば大量の脆弱性レポートを生成する。本稿では,プログラム解析ツールによって報告される脆弱性の優先順位付けに機械学習を適用することができるかを検討する。
参考スコア（独自算出の注目度）: 10.447259698656884
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Program analysis tools often produce large volumes of candidate vulnerability reports that require costly manual review, creating a practical challenge: how can security analysts prioritize the reports most likely to be true vulnerabilities? This paper investigates whether machine learning can be applied to prioritizing vulnerabilities reported by program analysis tools. We focus on Node.js packages and collect a benchmark of 1,883 Node.js packages, each containing one reported ACE or ACI vulnerability. We evaluate a variety of machine learning approaches, including classical models, graph neural networks (GNNs), large language models (LLMs), and hybrid models that combine GNN and LLMs, trained on data based on a dynamic program analysis tool's output. The top LLM achieves $F_{1} {=} 0.915$, while the best GNN and classical ML models reaching $F_{1} {=} 0.904$. At a less than 7% false-negative rate, the leading model eliminates 66.9% of benign packages from manual review, taking around 60 ms per package. If the best model is tuned to operate at a precision level of 0.8 (i.e., allowing 20% false positives amongst all warnings), our approach can detect 99.2% of exploitable taint flows while missing only 0.8%, demonstrating strong potential for real-world vulnerability triage.
Abstract（参考訳）: プログラム分析ツールは、しばしば大量の脆弱性レポートを生成し、コストのかかる手作業によるレビューを必要とし、実践的な課題を生み出します。本稿では,プログラム解析ツールによって報告される脆弱性の優先順位付けに機械学習を適用することができるかを検討する。 Node.jsパッケージに集中し、1,883のNode.jsパッケージのベンチマークを収集します。動的プログラム解析ツールの出力に基づいてトレーニングされたデータに基づいて、古典的モデル、グラフニューラルネットワーク(GNN)、大規模言語モデル(LLM)、GNNとLLMを組み合わせたハイブリッドモデルなど、さまざまな機械学習アプローチを評価した。上位のLLMは$F_{1} {=} 0.915$を獲得し、最高のGNNおよび古典的なMLモデルは$F_{1} {=} 0.904$に達する。 7%未満の偽陰性率で、先行モデルは、手作業によるレビューから66.9%の良性パッケージを排除し、1パッケージあたり約60ミリ秒を要した。最良のモデルが0.8の精度で動作するように調整された場合(すなわち、すべての警告の中で20%の偽陽性を許容する)、我々のアプローチは、わずか0.8%の精度で悪用可能なテナントフローの99.2%を検出し、現実世界の脆弱性トリアージの強い可能性を示す。

論文の概要: Learning to Triage Taint Flows Reported by Dynamic Program Analysis in Node.js Packages

関連論文リスト