Fugu-MT 論文翻訳(概要): Keep the Proof State Live: Snapshotting for Efficient Tactic Search in Lean 4

論文の概要: Keep the Proof State Live: Snapshotting for Efficient Tactic Search in Lean 4

arxiv url: http://arxiv.org/abs/2605.25556v2
Date: Wed, 27 May 2026 19:06:07 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-30 02:45:54.642513
Title: Keep the Proof State Live: Snapshotting for Efficient Tactic Search in Lean 4
Title（参考訳）: リーン4の効果的な戦術探索のためのスナップショッティング
Authors: Austin Shen, Yunong Shi,
Abstract要約: これは、一度精巧な証明状態をキャプチャし、Lean 4言語サーバーへの小さな拡張を通じてブランチ間で再利用します。 48のミニF2F-v2問題に対して,本手法は標準的なフォールバックよりも5.6～50倍の高速化を実現する。
参考スコア（独自算出の注目度）: 1.2816802110958607
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Automated theorem proving systems built on Lean 4 increasingly rely on parallel tactic search over partially specified proofs, such as those generated by Draft-Sketch-Prove (DSP) pipelines. In current systems, each search branch reconstructs a proof state by re-running elaboration, leading to substantial per-branch overhead. In Lean 4 with Mathlib, this cost has two components: (1) import loading, which deserializes pre-compiled libraries (~60 s per branch); and (2) theorem-body elaboration, which re-checks the theorem context up to the target goal (estimated 18-735 s depending on proof complexity). Together, these account for >99% of per-branch wall time, making portfolio-based search impractical at scale. We observe that this overhead arises from a mismatch between the structure of proof search and its execution model: branching is implemented via repeated reconstruction of proof states rather than direct reuse. To address this, we introduce proof-state snapshotting, which captures the elaborated proof state once and reuses it across branches via a small extension to the Lean 4 language server. Across 48 miniF2F-v2 problems (45 prove-phase benchmarks and 3 full end-to-end runs), our approach achieves a 5.6-50x wall-time speedup over the standard fallback (average 14x, median 9.7x). Speedup increases with the number of proof branches. Our method is orthogonal to import-level caching (e.g., Kimina Lean Server), which avoids import loading but not theorem-body elaboration. The patched Lean binary and the Snapshot-DSP pipeline will be released as open source upon publication.
Abstract（参考訳）: Lean 4上に構築された自動定理証明システムは、Draft-Sketch-Prove(DSP)パイプラインで生成されたような、部分的に指定された証明よりも、ますます並列的な戦術探索に依存している。現在のシステムでは、各探索枝はエラボレーションを再実行することで証明状態を再構築し、実質的なブランチ毎のオーバーヘッドをもたらす。このコストは、(1)事前コンパイルされたライブラリ(ブランチあたり約60秒)をデシリアライズするインポートローディング(import loading)、(2)定理ボディのエラボレーション(the theorem-body elaboration)、(2)定理コンテキストを目標目標(証明の複雑さに応じて18～735秒)まで再チェックする(estimated 18～735秒)。同時に、これらはブランチごとのウォールタイムの99%以上を占めており、ポートフォリオベースの検索を大規模に非現実的にしている。このオーバヘッドは,証明検索の構造と実行モデルとのミスマッチから生じることを観察する。これを解決するために、実証状態スナップショットを導入し、一度精巧な証明状態をキャプチャし、Lean 4言語サーバーへの小さな拡張を通じてブランチ間で再利用します。 48 の miniF2F-v2 問題 (45 の証明段階ベンチマークと 3 の完全なエンドツーエンド実行) に対して,本手法は標準フォールバック(平均 14 倍,中央 9.7 倍)よりも5.6-50 倍のウォールタイム高速化を実現する。証明ブランチの数によってスピードアップが増加する。本手法は,輸入レベルのキャッシュ(例えばKimina Lean Server)に直交する。パッチされたLeanバイナリとSnapshot-DSPパイプラインは、公開時にオープンソースとしてリリースされる。

論文の概要: Keep the Proof State Live: Snapshotting for Efficient Tactic Search in Lean 4

関連論文リスト