Fugu-MT 論文翻訳(概要): Characterizing and Mitigating False-Positive Bug Reports in the Linux Kernel

論文の概要: Characterizing and Mitigating False-Positive Bug Reports in the Linux Kernel

arxiv url: http://arxiv.org/abs/2605.07678v1
Date: Fri, 08 May 2026 12:48:36 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-11 19:43:39.049837
Title: Characterizing and Mitigating False-Positive Bug Reports in the Linux Kernel
Title（参考訳）: Linuxカーネルにおける偽陽性バグレポートの特性と緩和
Authors: Jiashuo Tian, Dong Wang, Chen Yang, Haichi Wang, Zan Wang, Junjie Chen,
Abstract要約: 本研究は,Linuxカーネルにおける偽陽性バグレポートの総合的研究である。我々はBugzillaとSyzkallerから収集された1,509の真正のバグと497の偽陽性を含む2,006のバグレポートのデータセットを手作業で構築する。検索強化世代(RAG)は、様々なプロンプト戦略の中で、91%のリコールと88%のF1スコアを達成した。
参考スコア（独自算出の注目度）: 14.987479226824023
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: False-positive bug reports represent a significant yet underexplored challenge in the development and maintenance of the Linux kernel. They occur when correct system behavior is mistakenly flagged as a defect, consuming developer effort without leading to actual code improvements. Such reports can mislead developers, waste debugging resources, and delay the resolution of real bugs. In this paper, we present the first comprehensive empirical study of false-positive bug reports in the Linux kernel. We manually construct a dataset of 2,006 bug reports comprising 1,509 genuine bugs and 497 false positives collected from Bugzilla and Syzkaller. Our analysis indicates that false positives demand effort comparable to real bugs, often requiring extended discussions and non-trivial closure time. They occur in several components, especially File Systems and Drivers, mainly due to external dependencies and semantic misunderstandings. To address this challenge, we evaluate large language models (LLMs) for automated false-positive bug report mitigation. Among various prompting strategies, retrieval-augmented generation (RAG) performs best, achieving 91% recall and an F1 score of 88%. These findings highlight the non-negligible cost of false positive bug reports and show the promise of LLMs for more efficient false positive mitigation in the Linux kernel.
Abstract（参考訳）: 偽陽性のバグレポートは、Linuxカーネルの開発とメンテナンスにおいて、重要かつ未調査の課題であることを示している。正確なシステム動作が欠陥として誤ってフラグ付けされ、実際のコード改善につながることなく、開発者の労力を消費する場合に発生する。このようなレポートは開発者を誤解させ、デバッグリソースを無駄にし、実際のバグの解決を遅らせる可能性がある。本稿では,Linuxカーネルにおける偽陽性バグレポートの総合的研究について紹介する。我々はBugzillaとSyzkallerから収集された1,509の真正のバグと497の偽陽性を含む2,006のバグレポートのデータセットを手作業で構築する。我々の分析によると、偽陽性は実際のバグに匹敵する努力を必要としており、しばしば議論の延長と非自明なクロージャ時間を必要としている。これらは、主に外部の依存関係とセマンティックな誤解のために、いくつかのコンポーネント、特にファイルシステムとドライバで発生する。この課題に対処するため、我々は、自動偽陽性バグレポート軽減のための大規模言語モデル(LLM)を評価した。検索強化世代(RAG)は、様々なプロンプト戦略の中で、91%のリコールと88%のF1スコアを達成した。これらの結果は、偽陽性バグレポートの無視できないコストを浮き彫りにし、Linuxカーネルのより効率的な偽陽性軽減のためのLCMの約束を示している。

論文の概要: Characterizing and Mitigating False-Positive Bug Reports in the Linux Kernel

関連論文リスト