Fugu-MT 論文翻訳(概要): OptProver: Bridging Olympiad and Optimization through Continual Training in Formal Theorem Proving

論文の概要: OptProver: Bridging Olympiad and Optimization through Continual Training in Formal Theorem Proving

arxiv url: http://arxiv.org/abs/2604.23712v1
Date: Sun, 26 Apr 2026 13:54:28 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-28 17:12:07.51588
Title: OptProver: Bridging Olympiad and Optimization through Continual Training in Formal Theorem Proving
Title（参考訳）: OptProver: 形式的理論証明における継続トレーニングによるオリンピアドのブリッジと最適化
Authors: Chenyi Li, Yanchen Nie, Zhengyu Ming, Gong Zhang, Kun Yuan, Zaiwen Wen,
Abstract要約: 正式なオリンピアド級数学の最近の進歩は、大半が単純ドメイン領域を残している。本稿では,証明手順の検証を行うためのメカニズムを取り入れた特殊パーリティ重み付き最適化を導入する。このベンチマークでは、最先端のドメイン知識を実証することで、ステート・オブ・ワンかつ比較可能な大きさのドメインが達成される。
参考スコア（独自算出の注目度）: 11.256978472844786
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Recent advances in formal theorem proving have focused on Olympiad-level mathematics, leaving undergraduate domains largely unexplored. Optimization, fundamental to machine learning, operations research, and scientific computing, remains underserved by existing provers. Its reliance on domain-specific formalisms (convexity, optimality conditions, and algorithmic analysis) creates significant distribution shift, making naive domain transfer ineffective. We present OptProver, a trained model that achieves robust transfer from Olympiad to undergraduate optimization. Starting from a strong Olympiad-level prover, our pipeline mitigates distribution shift through two key innovations. First, we employ large-scale optimization-focused data curation via expert iteration. Second, we introduce a specialized preference learning objective that integrates perplexity-weighted optimization with a mechanism to penalize valid but non-progressing proof steps. This not only addresses distribution shifts but also guides the search toward efficient trajectories. To enable rigorous evaluation, we construct a novel benchmark in Lean 4 focused on optimization. On this benchmark, OptProver achieves state-of-the-art Pass@1 and Pass@32 among comparably sized models while maintaining competitive performance on general theorem-proving tasks, demonstrating effective domain transfer without catastrophic forgetting.
Abstract（参考訳）: 形式定理の証明の最近の進歩は、オリンピアード級数学に焦点を合わせており、学部の領域はほとんど探索されていない。機械学習、オペレーションリサーチ、科学コンピューティングの基本となる最適化は、まだ既存の証明者によって守られていない。ドメイン固有の形式(凸性、最適性条件、アルゴリズム解析)に依存しているため、大きな分布シフトが生じ、単純で効率的なドメイン転送が不可能となる。 Olympiadから学部最適化への堅牢な移行を実現するためのトレーニングモデルであるOpsProverを提案する。強力なOlympiadレベルの証明から始めて、パイプラインは2つの重要なイノベーションを通じて分散シフトを緩和します。まず、専門家の反復による大規模最適化に焦点を当てたデータキュレーションを用いる。第2に、複雑度重み付け最適化と、有効だが非進行的な証明ステップをペナルティ化するためのメカニズムを統合する、特殊選好学習の目的を導入する。これは分布シフトに対処するだけでなく、効率的な軌跡への探索も導く。厳格な評価を可能にするため、最適化にフォーカスした新しいベンチマークをLean 4で構築する。このベンチマークでは、OptProverは、一般的な定理証明タスクにおける競合性能を維持しながら、比較可能なサイズのモデルの中で最先端のPass@1とPass@32を達成する。

論文の概要: OptProver: Bridging Olympiad and Optimization through Continual Training in Formal Theorem Proving

関連論文リスト