Fugu-MT 論文翻訳(概要): Lean Atlas: An Integrated Proof Environment for Scalable Human-AI Collaborative Formalization

論文の概要: Lean Atlas: An Integrated Proof Environment for Scalable Human-AI Collaborative Formalization

arxiv url: http://arxiv.org/abs/2604.16347v1
Date: Mon, 16 Mar 2026 14:19:33 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-04 02:32:13.915185
Title: Lean Atlas: An Integrated Proof Environment for Scalable Human-AI Collaborative Formalization
Title（参考訳）: Lean Atlas: スケーラブルなHuman-AIコラボレーションの形式化のための統合的な証明環境
Authors: Banri Yanahama, Akiyoshi Sannai,
Abstract要約: 本稿では,人間科学者とAIが共同で公式な証明を行う,ループ型アプローチを提案する。 Lean Atlasはリーン4プロジェクトの依存性グラフをインタラクティブなWebビューアとして視覚化するツールです。コア機能であるLean Compassは、選択された定理セットが与えられたら、自動的にプロジェクト固有のノードを抽出するアルゴリズムである。
参考スコア（独自算出の注目度）: 1.711666249985278
License: http://creativecommons.org/licenses/by/4.0/
Abstract: AI-driven autoformalization of mathematics is advancing rapidly. However, the type checker of a proof assistant guarantees only the logical correctness of proofs; it does not verify whether propositions and definitions faithfully capture their intended mathematical content. Consequently, AI-generated formal proofs can exhibit semantic hallucination-passing the type checker yet failing to express the intended mathematics. We propose a human-in-the-loop approach in which human scientists and AI collaboratively produce formal proofs, with humans responsible for the semantic verification of propositions and definitions. To realize this approach, we develop Lean Atlas, a Lean 4 tool that visualizes the dependency graph of a Lean 4 project as an interactive web viewer, enabling human scientists to grasp the overall structure of a formalization efficiently. Its core feature, Lean Compass, is an algorithm that, given a selected theorem set, automatically extracts the project-specific nodes whose semantic correctness can affect those target statements, thereby reducing the candidate set for semantic review in large-scale formalizations. We further define *aligned Lean code* as formalization code that has undergone human semantic verification, and propose it as a quality standard for AI-generated formalizations. We evaluate the tool on six Lean 4 formalization projects with different structural characteristics; proof-heavy projects (PrimeNumberTheoremAnd, Carleson, Brownian Motion) achieved 94-99% average node reduction, a 6-theorem milestone subset of FLT achieved 59.8%, mixed PhysLib 69.0%, and definition-heavy XMSS 27.3%. Lean Atlas is available as open-source software at https://github.com/NyxFoundation/lean-atlas .
Abstract（参考訳）: AIによる数学の自己形式化は急速に進んでいる。しかし、証明アシスタントの型チェッカーは証明の論理的正当性のみを保証する。その結果、AIが生成した形式証明は、意図した数学を表現できない型チェッカーを通した意味幻覚を示すことができる。本稿では,人間科学者とAIが共同で公式な証明を作成し,命題や定義のセマンティックな検証に責任を負う,ループ型アプローチを提案する。このアプローチを実現するために,リーン4プロジェクトの依存性グラフをインタラクティブなWebビューアとして視覚化するLean 4ツールであるLean Atlasを開発した。その中核的な特徴であるLean Compassは、選択された定理セットを与えられたアルゴリズムであり、意味的正しさが対象のステートメントに影響を与える可能性のあるプロジェクト固有のノードを自動的に抽出し、大規模な形式化における意味的レビューの候補セットを減らす。我々はさらに、人間の意味的検証を行う形式コードとして*整列したリーンコード*を定義し、AI生成の形式化の品質標準として提案する。我々は,このツールを6つのLean 4形式化プロジェクトで異なる構造特性で評価した。証明量の多いプロジェクト(PrimeNumberTheoremAnd,Carleson,Brownian Motion)は94～99%の平均ノード削減,FLTの6つの理論的なサブセットサブセットは59.8%,混合PhysLib 69.0%,定義量の多いXMSS 27.3%を達成した。 Lean Atlasはオープンソースソフトウェアとしてhttps://github.com/NyxFoundation/lean-atlasで公開されている。

論文の概要: Lean Atlas: An Integrated Proof Environment for Scalable Human-AI Collaborative Formalization

関連論文リスト