Fugu-MT 論文翻訳(概要): Inferring multiple helper Dafny assertions with LLMs

論文の概要: Inferring multiple helper Dafny assertions with LLMs

arxiv url: http://arxiv.org/abs/2511.00125v1
Date: Fri, 31 Oct 2025 09:45:39 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-05 16:37:26.628395
Title: Inferring multiple helper Dafny assertions with LLMs
Title（参考訳）: LLMを用いた複数ヘルパーDafnyアサーションの推論
Authors: Álvaro Silva, Alexandra Mendes, Ruben Martins,
Abstract要約: 本研究では,Dafnyプログラムにおけるヘルパーアサーションの欠落を自動的に推測するために,Large Language Modelsの使用について検討する。推論の難易度を分析するために,アサーション型の分類を導入した。その結果、自動アサーション推論は証明工学の労力を大幅に削減できることが示された。
参考スコア（独自算出の注目度）: 47.33158055894705
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The Dafny verifier provides strong correctness guarantees but often requires numerous manual helper assertions, creating a significant barrier to adoption. We investigate the use of Large Language Models (LLMs) to automatically infer missing helper assertions in Dafny programs, with a primary focus on cases involving multiple missing assertions. To support this study, we extend the DafnyBench benchmark with curated datasets where one, two, or all assertions are removed, and we introduce a taxonomy of assertion types to analyze inference difficulty. Our approach refines fault localization through a hybrid method that combines LLM predictions with error-message heuristics. We implement this approach in a new tool called DAISY (Dafny Assertion Inference SYstem). While our focus is on multiple missing assertions, we also evaluate DAISY on single-assertion cases. DAISY verifies 63.4% of programs with one missing assertion and 31.7% with multiple missing assertions. Notably, many programs can be verified with fewer assertions than originally present, highlighting that proofs often admit multiple valid repair strategies and that recovering every original assertion is unnecessary. These results demonstrate that automated assertion inference can substantially reduce proof engineering effort and represent a step toward more scalable and accessible formal verification.
Abstract（参考訳）: Dafny検証は強い正確性を保証するが、しばしば手動ヘルパーのアサーションを必要とする。 Dafnyプログラムにおけるヘルパーアサーションの欠落を自動的に推測するために,Large Language Models (LLMs) を用いることについて検討する。本研究では,1,2,あるいはすべてのアサーションを除去するキュレートデータセットを用いたDafnyBenchベンチマークを拡張し,推論の難しさを分析するために,アサーション型の分類を導入した。提案手法は, LLM予測とエラーメッセージヒューリスティックスを組み合わせたハイブリッド手法により, 故障局所化を改良する。このアプローチをDAISY(Dafny Assertion Inference SYstem)と呼ばれる新しいツールで実装する。複数のアサーションの欠如に焦点を当てていますが、単一アサーションのケースではDAISYの評価も行っています。 DAISYは1つのアサーションが欠けているプログラムの63.4%、複数のアサーションが欠けているプログラムの31.7%を検証している。特に、多くのプログラムは、元のアサーションよりも少ないアサーションで検証でき、証明は複数の有効な修復戦略をしばしば認め、元のアサーションを回復することは不要であることを強調している。これらの結果は、自動アサーション推論が証明工学の労力を大幅に削減し、よりスケーラブルでアクセスしやすい形式的検証へのステップを示すことを示している。

論文の概要: Inferring multiple helper Dafny assertions with LLMs

関連論文リスト