Fugu-MT 論文翻訳(概要): A Tale of 1001 LoC: Potential Runtime Error-Guided Specification Synthesis for Verifying Large-Scale Programs

論文の概要: A Tale of 1001 LoC: Potential Runtime Error-Guided Specification Synthesis for Verifying Large-Scale Programs

arxiv url: http://arxiv.org/abs/2512.24594v1
Date: Wed, 31 Dec 2025 03:31:51 GMT
ステータス: 翻訳完了
システム内更新日: 2026-01-01 23:27:28.555935
Title: A Tale of 1001 LoC: Potential Runtime Error-Guided Specification Synthesis for Verifying Large-Scale Programs
Title（参考訳）: 1001 LoCの物語:大規模プログラム検証のための潜在的実行時エラーガイド型仕様合成
Authors: Zhongyi Wang, Tengjie Lin, Mingshuai Chen, Haokun Li, Mingqi Yang, Xiao Yi, Shengchao Qin, Yixing Luo, Xiaofeng Li, Bin Gu, Liqiang Lu, Jianwei Yin,
Abstract要約: 本稿では、フォーマルな仕様の生成と改善を自動化するためのモジュラーできめ細かいフレームワークであるPregussについて述べる。以上の結果から, Preguss は最先端の LLM ベースのアプローチを大きく上回っていることがわかった。約1000LOCを超える実世界のプログラムに対して高度に自動化されたRTE-freeness検証を可能にし、80.6%88.9%の人間による検証を削減した。
参考スコア（独自算出の注目度）: 34.387390697713556
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Fully automated verification of large-scale software and hardware systems is arguably the holy grail of formal methods. Large language models (LLMs) have recently demonstrated their potential for enhancing the degree of automation in formal verification by, e.g., generating formal specifications as essential to deductive verification, yet exhibit poor scalability due to long-context reasoning limitations and, more importantly, the difficulty of inferring complex, interprocedural specifications. This paper presents Preguss -- a modular, fine-grained framework for automating the generation and refinement of formal specifications. Preguss synergizes between static analysis and deductive verification by steering two components in a divide-and-conquer fashion: (i) potential runtime error-guided construction and prioritization of verification units, and (ii) LLM-aided synthesis of interprocedural specifications at the unit level. We show that Preguss substantially outperforms state-of-the-art LLM-based approaches and, in particular, it enables highly automated RTE-freeness verification for real-world programs with over a thousand LoC, with a reduction of 80.6%~88.9% human verification effort.
Abstract（参考訳）: 大規模なソフトウェアとハードウェアシステムの完全な自動検証は、間違いなく正式な方法の聖杯である。大規模言語モデル(LLM)は、例えば、帰納的検証に不可欠な形式仕様を生成することで、形式的検証における自動化の度合いを向上する可能性を最近証明した。本稿では、フォーマルな仕様の生成と改善を自動化するためのモジュラーできめ細かいフレームワークであるPregussについて述べる。 Pregussは2つのコンポーネントを分割・分散方式で操り、静的解析と導出検証を相乗化する。一検証ユニットの潜在的実行時エラー誘導構築及び優先順位付け (II)単位レベルでのLLM支援による相互運用仕様の合成。 Preguss は最先端の LLM ベースのアプローチよりも大幅に優れており、特に、1000 LoC 以上の実世界のプログラムに対して、高度に自動化された RTE-freeness 検証を可能にし、80.6%~88.9% の人的検証努力を削減できることを示す。

論文の概要: A Tale of 1001 LoC: Potential Runtime Error-Guided Specification Synthesis for Verifying Large-Scale Programs

関連論文リスト