Fugu-MT 論文翻訳(概要): Neuro-Symbolic Generation and Validation of Memory-Aware Formal Function Specifications

論文の概要: Neuro-Symbolic Generation and Validation of Memory-Aware Formal Function Specifications

arxiv url: http://arxiv.org/abs/2603.13414v1
Date: Thu, 12 Mar 2026 15:02:26 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 16:19:35.178569
Title: Neuro-Symbolic Generation and Validation of Memory-Aware Formal Function Specifications
Title（参考訳）: 記憶機能仕様のニューロ・シンボリック生成と検証
Authors: Liao Zhang, Tong Chen, Xiwei Wu, Qi Liu, Xiyu Zhai, Xinqi Wang, Qinxiang Cao,
Abstract要約: メモリ操作プログラムの形式的検証は、専門家によって書かれたメモリ状態をキャプチャする正確な機能仕様に依存している。本稿では,Cプログラムのメモリ対応形式関数仕様を自動生成するニューロシンボリックフレームワークを提案する。我々は,メモリ対応の形式関数仕様を生成するための200Cプログラミング問題のベンチマークであるLeetCode-C-Specを紹介する。
参考スコア（独自算出の注目度）: 12.783777562919383
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Formal verification of memory-manipulating programs critically depends on precise function specifications that capture memory states written by experts. This requirement has become a major bottleneck as large language models (LLMs) increasingly generate low-level systems code whose correctness cannot be assumed. To enable scalable formal verification, we focus exclusively on function specification generation, deliberately avoiding the synthesis of complex loop invariants that are central to traditional verification pipelines. We propose a neuro-symbolic framework for automatically generating memory-aware formal function specifications for C programs from natural language problem descriptions and function signatures. The pipeline first produces candidate specifications via in-context learning, and then iteratively refines them using compiler diagnostics from symbolic provers and the verification toolchain. In particular, we validate candidate specifications by constructing a proof for the negation of the specification with concrete examples, enabling machine-checked rejection of plausible-but-incorrect specifications. To support systematic evaluation, we introduce LeetCode-C-Spec, a new benchmark of 200 C programming problems for generating memory-aware formal function specifications. Experiments show that iterative refinement substantially improves syntactic validity, while symbolic prover-based refutation significantly enhances correctness assessment by filtering false positives that LLM-only judges frequently accept. Our results demonstrate that combining neural generation with symbolic feedback provides an effective approach to formal specification synthesis for memory-safe systems software.
Abstract（参考訳）: メモリ操作プログラムの形式的検証は、専門家によって書かれたメモリ状態をキャプチャする正確な機能仕様に依存している。この要件は、大規模言語モデル(LLM)が、正確性を想定できない低レベルのシステムコードを生成するにつれて、大きなボトルネックとなっている。スケーラブルな形式検証を実現するため,従来の検証パイプラインの中心となる複雑なループ不変量の合成を意図的に回避して,関数仕様生成にのみ焦点をあてる。自然言語問題記述と関数シグネチャから,C言語プログラムのメモリ対応形式関数仕様を自動生成するニューラルシンボリックフレームワークを提案する。パイプラインはまず、コンテキスト内学習を通じて候補仕様を生成し、その後、シンボリックプローバーと検証ツールチェーンのコンパイラ診断を使用して繰り返し洗練する。特に、具体例で仕様の否定の証明を構築することで、候補仕様を検証し、妥当な仕様の機械検査による拒否を可能にする。本稿では,メモリアウェアな形式関数仕様を生成するための200Cプログラミング問題のベンチマークであるLeetCode-C-Specを紹介する。実験の結果,反復的改善は構文的妥当性を著しく向上させる一方で,記号的証明に基づく難読化は,LLMのみの判断者が頻繁に受け入れる偽陽性をフィルタリングすることにより,精度を著しく向上させることが示された。この結果から,ニューラルネットワークとシンボルフィードバックを組み合わせることで,メモリセーフなシステムソフトウェアのための形式的仕様合成に有効なアプローチが得られた。

論文の概要: Neuro-Symbolic Generation and Validation of Memory-Aware Formal Function Specifications

関連論文リスト