Fugu-MT 論文翻訳(概要): ExplainFuzz: Explainable and Constraint-Conditioned Test Generation with Probabilistic Circuits

論文の概要: ExplainFuzz: Explainable and Constraint-Conditioned Test Generation with Probabilistic Circuits

arxiv url: http://arxiv.org/abs/2604.06559v1
Date: Wed, 08 Apr 2026 01:20:43 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-09 17:30:51.284065
Title: ExplainFuzz: Explainable and Constraint-Conditioned Test Generation with Probabilistic Circuits
Title（参考訳）: ExplainFuzz:確率回路を用いた説明可能・制約付きテスト生成
Authors: Annaëlle Baiget, Jaron Maene, Seongmin Lee, Benjie Wang, Guy Van den Broeck, Miryung Kim,
Abstract要約: 本稿では,確率回路(PC)を利用したテスト生成フレームワークであるExplainFuzzを紹介する。その結果、ExplainFuzzは、生成した入力のコヒーレンスとリアリズムを改善し、pCFGや文法を意識しないPC、LLMと比べて大きなパープレキシティ低減を実現していることがわかった。
参考スコア（独自算出の注目度）: 35.18215841558984
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Understanding and explaining the structure of generated test inputs is essential for effective software testing and debugging. Existing approaches--including grammar-based fuzzers, probabilistic Context-Free Grammars (pCFGs), and Large Language Models (LLMs)--suffer from critical limitations. They frequently produce ill-formed inputs that fail to reflect realistic data distributions, struggle to capture context-sensitive probabilistic dependencies, and lack explainability. We introduce ExplainFuzz, a test generation framework that leverages Probabilistic Circuits (PCs) to learn and query structured distributions over grammar-based test inputs interpretably and controllably. Starting from a Context-Free Grammar (CFG), ExplainFuzz compiles a grammar-aware PC and trains it on existing inputs. New inputs are then generated via sampling. ExplainFuzz utilizes the conditioning capability of PCs to incorporate test-specific constraints (e.g., a query must have GROUP BY), enabling constrained probabilistic sampling to generate inputs satisfying grammar and user-provided constraints. Our results show that ExplainFuzz improves the coherence and realism of generated inputs, achieving significant perplexity reduction compared to pCFGs, grammar-unaware PCs, and LLMs. By leveraging its native conditioning capability, ExplainFuzz significantly enhances the diversity of inputs that satisfy a user-provided constraint. Compared to grammar-aware mutational fuzzing, ExplainFuzz increases bug-triggering rates from 35% to 63% in SQL and from 10% to 100% in XML. These results demonstrate the power of a learned input distribution over mutational fuzzing, which is often limited to exploring the local neighborhood of seed inputs. These capabilities highlight the potential of PCs to serve as a foundation for grammar-aware, controllable test generation that captures context-sensitive, probabilistic dependencies.
Abstract（参考訳）: 生成されたテストインプットの構造を理解し、説明することは、効果的なソフトウェアテストとデバッグに不可欠である。既存のアプローチ - 文法ベースのファジィザ、確率論的文脈自由文法(pCFG)、Large Language Models(LLM)など - は、臨界的な制限から救われる。彼らはしばしば、現実的なデータ分布を反映できず、コンテキストに敏感な確率的依存関係を捉えようと苦労し、説明不可能な不整形入力を生成する。本稿では,確率回路(PC)を利用したテスト生成フレームワークであるExplainFuzzを紹介する。 Context-Free Grammar (CFG) から始まり、ExplainFuzzは文法を意識したPCをコンパイルし、既存の入力でそれを訓練する。新しい入力はサンプリングによって生成される。 ExplainFuzzは、PCの条件付け機能を利用して、テスト固有の制約(例えば、クエリはGROUP BYを持つ必要がある)を導入し、制約付き確率的サンプリングを可能にし、文法とユーザが提供する制約を満たす入力を生成する。その結果, ExplainFuzzは, pCFG, 文法を意識しないPC, LLMに比べて, 入力のコヒーレンスとリアリズムを向上し, パープレキシティの大幅な低減を実現していることがわかった。ネイティブ条件付け機能を活用することで、ExplainFuzzは、ユーザが提供する制約を満たす入力の多様性を大幅に向上する。文法を意識した突然変異ファズリングと比較して、ExplainFuzzは、バグトリガ率をSQLで35%から63%、XMLで10%から100%に向上させる。これらの結果は、しばしばシード入力の局所的な近傍を探索することに制限される突然変異ファジィングに対する学習された入力分布のパワーを示す。これらの機能は、文脈に敏感で確率的な依存関係をキャプチャする文法を意識した制御可能なテスト生成の基礎として機能するPCの可能性を強調している。

論文の概要: ExplainFuzz: Explainable and Constraint-Conditioned Test Generation with Probabilistic Circuits

関連論文リスト