Fugu-MT 論文翻訳(概要): Boosting Skeleton-Driven SMT Solver Fuzzing by Leveraging LLM to Produce Formula Generators

論文の概要: Boosting Skeleton-Driven SMT Solver Fuzzing by Leveraging LLM to Produce Formula Generators

arxiv url: http://arxiv.org/abs/2508.20340v1
Date: Thu, 28 Aug 2025 01:21:26 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-29 18:12:01.884997
Title: Boosting Skeleton-Driven SMT Solver Fuzzing by Leveraging LLM to Produce Formula Generators
Title（参考訳）: LLMを利用したスケルトン駆動型SMTゾルバファズリングによるフォーミュラジェネレータの試作
Authors: Maolin Sun, Yibiao Yang, Yuming Zhou,
Abstract要約: 満足度・モデュロ理論 (Satifiability Modulo Theory, SMT) は、現代のシステムやプログラミング言語の研究に基礎を置いている。以前のテストテクニックは、初期のソルババージョンではうまく機能していましたが、急速に進化する機能に追従するのに苦労しています。近年のLarge Language Models (LLM) に基づくアプローチは,高度な問題解決能力の探求において有望であることを示している。
参考スコア（独自算出の注目度）: 5.527936960933817
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Satisfiability Modulo Theory (SMT) solvers are foundational to modern systems and programming languages research, providing the foundation for tasks like symbolic execution and automated verification. Because these solvers sit on the critical path, their correctness is essential, and high-quality test formulas are key to uncovering bugs. However, while prior testing techniques performed well on earlier solver versions, they struggle to keep pace with rapidly evolving features. Recent approaches based on Large Language Models (LLMs) show promise in exploring advanced solver capabilities, but two obstacles remain: nearly half of the generated formulas are syntactically invalid, and iterative interactions with the LLMs introduce substantial computational overhead. In this study, we present Chimera, a novel LLM-assisted fuzzing framework that addresses both issues by shifting from direct formula generation to the synthesis of reusable term (i.e., logical expression) generators. Particularly, Chimera uses LLMs to (1) automatically extract context-free grammars (CFGs) for SMT theories, including solver-specific extensions, from documentation, and (2) synthesize composable Boolean term generators that adhere to these grammars. During fuzzing, Chimera populates structural skeletons derived from existing formulas with the terms iteratively produced by the LLM-synthesized generators. This design ensures syntactic validity while promoting semantic diversity. Notably, Chimera requires only one-time LLM interaction investment, dramatically reducing runtime cost. We evaluated Chimera on two leading SMT solvers: Z3 and cvc5. Our experiments show that Chimera has identified 43 confirmed bugs, 40 of which have already been fixed by developers.
Abstract（参考訳）: 満足度モデュロ理論 (Satifiability Modulo Theory, SMT) は、現代のシステムやプログラミング言語の研究に基礎を置き、記号実行や自動検証といったタスクの基盤を提供する。これらの解法はクリティカルパスに置かれているため、その正確性は不可欠であり、高品質のテスト公式はバグを明らかにするための鍵となる。しかしながら、以前のテストテクニックは、早期の解決版ではうまく機能する一方で、急速に進化する機能に追従するのに苦労している。近年のLarge Language Models (LLMs) に基づくアプローチは,高度な解法を探索する上で有望であることを示しているが,生成した公式のほぼ半分は構文的に無効であり,LLMとの反復的相互作用は計算上のオーバーヘッドを大幅に引き起こすという2つの障害が残っている。本研究では, 直接公式生成から再利用可能な項(論理式)ジェネレータの合成にシフトすることで, 両方の問題に対処する新しいLCM支援ファジリングフレームワークであるChimeraを提案する。特に、Chimera は LLM を用いて、(1) 文書からソルバ固有の拡張を含む SMT 理論の文脈自由文法 (CFG) を自動的に抽出し、(2) それらの文法に準拠した構成可能なブール項生成器を合成する。ファジッシングの間、キメラはLLM合成ジェネレータによって反復的に生成される用語で既存の公式から派生した構造骨格をポップアップさせる。この設計は、意味的多様性を促進しながら、構文的妥当性を保証する。特に、Chimeraは一度のLLMインタラクション投資しか必要とせず、ランタイムコストを劇的に削減しています。我々は,Z3とcvc5の2つの主要なSMT解法についてキメラの評価を行った。我々の実験によると、Chimeraは43の確認済みバグを発見しており、そのうち40がすでに開発者によって修正されている。

論文の概要: Boosting Skeleton-Driven SMT Solver Fuzzing by Leveraging LLM to Produce Formula Generators

関連論文リスト