Fugu-MT 論文翻訳(概要): What are the Right Symmetries for Formal Theorem Proving?

論文の概要: What are the Right Symmetries for Formal Theorem Proving?

arxiv url: http://arxiv.org/abs/2605.22257v1
Date: Thu, 21 May 2026 10:00:47 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-22 16:35:42.199655
Title: What are the Right Symmetries for Formal Theorem Proving?
Title（参考訳）: 形式理論の正しい対称性は何か?
Authors: Krzysztof Olejniczak, Radoslav Dimitrov, Xingyue Huang, Bernardo Cuenca Grau, Jinwoo Kim, İsmail İlkan Ceylan,
Abstract要約: 意味論的に等価な文は、非常に異なる証明成功率を示すことを示す。これは中心的な疑問を提起する: 形式的定理証明の適切な対称性は何か? 証明戦術によって誘導される構成的、一般的には非可逆な変換をキャプチャーするカテゴリ理論フレームワークである書字カテゴリを導入する。
参考スコア（独自算出の注目度）: 23.981613344642152
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Formal theorem provers based on large language models (LLMs) are highly sensitive to superficial variations in problem representation: semantically equivalent statements can exhibit drastically different proof success rates, revealing a failure to respect structural symmetries inherent in formal mathematics. This raises a central question: what are the right symmetries for formal theorem proving? We introduce rewriting categories, a category-theoretic framework capturing the compositional, generally non-invertible transformations induced by proof tactics, and use it to formalize two symmetry notions: proof equivariance, governing how proof distributions transform under rewrites, and success invariance (i.e., invariance of success probability), requiring equivalent statements to be solved with the same probability. We observe that state-based next-tactic provers naturally satisfy proof equivariance by operating on proof states. In contrast, state-of-the-art LLM-based provers satisfy neither property, exhibiting large performance variation across equivalent formulations. To mitigate this, we propose test-time methods that aggregate over equivalent rewritings of the input, showing theoretically that they recover success invariance in the sampling limit, and empirically, that they improve robustness and performance under fixed inference budgets. Our results highlight symmetry as a key missing inductive bias in LLM-based theorem proving and suggest test-time computation as a practical route to approximate it.
Abstract（参考訳）: 大言語モデル(LLM)に基づく形式的定理証明は、問題表現における表面的変動に非常に敏感である:意味論的に等価なステートメントは、形式数学に固有の構造的対称性を尊重するのに失敗することを明らかにする、劇的に異なる証明成功率を示す。これは中心的な疑問を提起する: 形式的定理証明の適切な対称性は何か? 我々は、証明戦術によって引き起こされる構成的、一般的には非可逆変換をキャプチャするカテゴリ理論のフレームワークである書き換え圏を導入し、証明の同値性、証明分布が書き換えの下でどのように変換されるかの制御、成功確率の不変性、同じ確率で解決される同等の文を必要とする2つの対称性の概念を定式化する。状態ベースの次戦術プローバーは証明状態の操作により自然に証明等価性を満足する。対照的に、最先端のLLMベースのプローサはどちらの特性も満足せず、等価な定式化にまたがって大きな性能変化を示す。そこで本研究では,入力の等価な書き直しを集約し,サンプリング限界のばらつきを再現し,固定された推論予算下での堅牢性と性能を向上する試験時間法を提案する。本研究では, LLMに基づく定理の証明において, 対称性を欠落した帰納的バイアスとして強調し, 実測経路としてテスト時間計算を提案する。

論文の概要: What are the Right Symmetries for Formal Theorem Proving?

関連論文リスト