Fugu-MT 論文翻訳(概要): Autoformalizer with Tool Feedback

論文の概要: Autoformalizer with Tool Feedback

arxiv url: http://arxiv.org/abs/2510.06857v1
Date: Wed, 08 Oct 2025 10:25:12 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-09 16:41:20.423492
Title: Autoformalizer with Tool Feedback
Title（参考訳）: ツールフィードバックによるオートフォーマライザ
Authors: Qi Guo, Jianing Wang, Jianfei Zhang, Deyang Kong, Xiangzhou Huang, Xiangyu Xi, Wei Wang, Jingang Wang, Xunliang Cai, Shikun Zhang, Wei Ye,
Abstract要約: 自動形式化は、数学的問題を自然言語から形式的ステートメントに変換することによって、ATP(Automated Theorem Proving)のデータ不足に対処する。既存のフォーミュラライザは、構文的妥当性とセマンティック一貫性を満たす有効なステートメントを一貫して生成することに苦慮している。本稿では,ツールフィードバックを用いたオートフォーマライザ (ATF) を提案する。
参考スコア（独自算出の注目度）: 52.334957386319864
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Autoformalization addresses the scarcity of data for Automated Theorem Proving (ATP) by translating mathematical problems from natural language into formal statements. Efforts in recent work shift from directly prompting large language models to training an end-to-end formalizer model from scratch, achieving remarkable advancements. However, existing formalizer still struggles to consistently generate valid statements that meet syntactic validity and semantic consistency. To address this issue, we propose the Autoformalizer with Tool Feedback (ATF), a novel approach that incorporates syntactic and consistency information as tools into the formalization process. By integrating Lean 4 compilers for syntax corrections and employing a multi-LLMs-as-judge approach for consistency validation, the model is able to adaptively refine generated statements according to the tool feedback, enhancing both syntactic validity and semantic consistency. The training of ATF involves a cold-start phase on synthetic tool-calling data, an expert iteration phase to improve formalization capabilities, and Direct Preference Optimization to alleviate ineffective revisions. Experimental results show that ATF markedly outperforms a range of baseline formalizer models, with its superior performance further validated by human evaluations. Subsequent analysis reveals that ATF demonstrates excellent inference scaling properties. Moreover, we open-source Numina-ATF, a dataset containing 750K synthetic formal statements to facilitate advancements in autoformalization and ATP research.
Abstract（参考訳）: 自動形式化は、数学的問題を自然言語から形式的ステートメントに変換することによって、ATP(Automated Theorem Proving)のデータ不足に対処する。最近の作業では、大きな言語モデルを直接的に促すことから、エンドツーエンドのフォーミュラモデルをゼロからトレーニングすることへの取り組みが、目覚ましい進歩を遂げています。しかし、既存のフォーミュラライザは、構文的妥当性とセマンティック一貫性を満たす有効なステートメントを一貫して生成することに苦慮している。この問題に対処するため,ツールフィードバックを用いたオートフォーマライザ (ATF) を提案する。構文修正のためにLean 4コンパイラを統合し、一貫性検証のためにマルチLLMs-as-judgeアプローチを採用することで、このモデルは、ツールフィードバックに従って生成されたステートメントを適応的に洗練し、構文的妥当性とセマンティック一貫性の両方を向上させることができる。 ATFのトレーニングには、合成ツールコールデータに対するコールドスタートフェーズ、フォーマル化機能を改善するためのエキスパートイテレーションフェーズ、非効率なリビジョンを緩和するためのダイレクトプレフレクション最適化が含まれる。実験結果から,ATFはベースラインフォーミュラモデルよりも優れた性能を示し,その性能は人体評価によりさらに向上した。その後の解析により、ATFは優れた推論スケーリング特性を示すことが明らかとなった。さらに, 750K の合成形式文を含むデータセットである Numina-ATF をオープンソースとして公開した。

論文の概要: Autoformalizer with Tool Feedback

関連論文リスト