Fugu-MT 論文翻訳(概要): PAT-Agent: Autoformalization for Model Checking

論文の概要: PAT-Agent: Autoformalization for Model Checking

arxiv url: http://arxiv.org/abs/2509.23675v1
Date: Sun, 28 Sep 2025 06:32:14 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-30 22:32:19.367363
Title: PAT-Agent: Autoformalization for Model Checking
Title（参考訳）: PAT-Agent: モデルチェックの自動化
Authors: Xinyue Zuo, Yifan Zhang, Hongshu Wang, Yufan Cai, Zhe Hou, Jing Sun, Jin Song Dong,
Abstract要約: PAT-Agentは自然言語の自動形式化と形式モデル修復のためのエンドツーエンドフレームワークである。これは、大きな言語モデルの生成能力と形式的検証の厳密さを組み合わせたものである。
参考スコア（独自算出の注目度）: 17.082027022913998
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Recent advances in large language models (LLMs) offer promising potential for automating formal methods. However, applying them to formal verification remains challenging due to the complexity of specification languages, the risk of hallucinated output, and the semantic gap between natural language and formal logic. We introduce PAT-Agent, an end-to-end framework for natural language autoformalization and formal model repair that combines the generative capabilities of LLMs with the rigor of formal verification to automate the construction of verifiable formal models. In PAT-Agent, a Planning LLM first extracts key modeling elements and generates a detailed plan using semantic prompts, which then guides a Code Generation LLM to synthesize syntactically correct and semantically faithful formal models. The resulting code is verified using the Process Analysis Toolkit (PAT) model checker against user-specified properties, and when discrepancies occur, a Repair Loop is triggered to iteratively correct the model using counterexamples. To improve flexibility, we built a web-based interface that enables users, particularly non-FM-experts, to describe, customize, and verify system behaviors through user-LLM interactions. Experimental results on 40 systems show that PAT-Agent consistently outperforms baselines, achieving high verification success with superior efficiency. The ablation studies confirm the importance of both planning and repair components, and the user study demonstrates that our interface is accessible and supports effective formal modeling, even for users with limited formal methods experience.
Abstract（参考訳）: 大規模言語モデル(LLM)の最近の進歩は、形式的手法の自動化に有望な可能性を秘めている。しかし、これらを形式的検証に適用することは、仕様言語の複雑さ、幻覚的出力のリスク、そして自然言語と形式論理のセマンティックギャップにより、依然として困難である。 PAT-Agentは,LLMの生成能力と形式検証の厳密さを組み合わせて,検証可能な形式モデルの構築を自動化する,自然言語の自動形式化と形式モデル修復のためのエンドツーエンドフレームワークである。 PAT-Agentでは、プランニング LLM がまずキーモデリング要素を抽出し、セマンティックプロンプトを使って詳細なプランを生成し、次にコード生成 LLM を誘導して、構文的に正確でセマンティックに忠実な形式モデルを合成する。結果のコードは、ユーザ指定プロパティに対してProcess Analysis Toolkit(PAT)モデルチェッカーを使用して検証される。柔軟性を向上させるために,ユーザ,特に非FM専門家がユーザ-LLMインタラクションを通じてシステム動作を記述,カスタマイズ,検証できるWebベースのインターフェースを構築した。 40システムの実験結果から, PAT-Agentはベースラインを一貫して上回り, 優れた効率で高い検証成功を実現していることがわかった。本研究は, 設計と修復の両要素の重要性を検証し, ユーザ・スタディにより, 限られた形式的手法経験を持つユーザであっても, インターフェースがアクセス可能であり, 効果的な形式的モデリングをサポートすることを示した。

論文の概要: PAT-Agent: Autoformalization for Model Checking

関連論文リスト