Fugu-MT 論文翻訳(概要): MatClaw: An Autonomous Code-First LLM Agent for End-to-End Materials Exploration

論文の概要: MatClaw: An Autonomous Code-First LLM Agent for End-to-End Materials Exploration

arxiv url: http://arxiv.org/abs/2604.02688v1
Date: Fri, 03 Apr 2026 03:32:15 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-06 17:20:24.303158
Title: MatClaw: An Autonomous Code-First LLM Agent for End-to-End Materials Exploration
Title（参考訳）: MatClaw: エンド・ツー・エンドの材料探査のための自律型コードファーストLLMエージェント
Authors: Chenmu Zhang, Boris I. Yakobson,
Abstract要約: 我々は、Pythonを直接書き、実行するためのコードファーストエージェントであるMatchClawを提示する。 MatClawはコード生成を確実に扱うが、暗黙のドメイン知識に苦慮している。その結果、ガイド付きと完全に自律的な計算材料研究のギャップはこれまで以上に狭くなっていることが明らかとなった。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Existing LLM agents for computational materials science are constrained by pipeline-bounded architectures tied to specific simulation codes and by dependence on manually written tool functions that grow with task scope. We present MatClaw, a code-first agent that writes and executes Python directly, composing any installed domain library to orchestrate multi-code workflows on remote HPC clusters without predefined tool functions. To sustain coherent execution across multi-day workflows, MatClaw uses a four-layer memory architecture that prevents progressive context loss, and retrieval-augmented generation over domain source code that raises per-step API-call accuracy to ${\sim}$99 %. Three end-to-end demonstrations on ferroelectric CuInP2S6 (machine-learning force field training via active learning, Curie temperature prediction, and heuristic parameter-space search) reveal that the agent handles code generation reliably but struggles with tacit domain knowledge. The missing knowledge, such as appropriate simulation timescales, equilibration protocols, and sampling strategies, is the kind that researchers accumulate through experience but rarely formalize. Two lightweight interventions, literature self-learning and expert-specified constraints, bridge these gaps, defining a guided autonomy model in which the researcher provides high-level domain knowledge while the agent handles workflow execution. Our results demonstrate that the gap between guided and fully autonomous computational materials research is narrower than ever before: LLMs already handle code generation and scientific interpretation reliably, and the rapid improvement in their capabilities will accelerate materials discovery beyond what manual workflows can achieve. All code and benchmarks are open-source.
Abstract（参考訳）: 既存の計算材料科学のLLMエージェントは、特定のシミュレーションコードに結びついたパイプラインバウンドアーキテクチャと、タスクスコープに応じて成長する手書きのツール関数に依存して制約される。我々は,Pythonを直接書き,実行するためのコードファーストエージェントであるMatClawを紹介し,事前に定義されたツール関数を使わずに,リモートHPCクラスタ上でマルチコードワークフローをオーケストレーションするインストール済みのドメインライブラリを構成する。マルチ日ワークフロー間のコヒーレントな実行を維持するため、MatchClawでは、プログレッシブなコンテキスト損失を防止する4層メモリアーキテクチャと、ステップごとのAPIコール精度を${\sim}$99 %に上げるドメインソースコードに対する検索拡張生成を使用している。強誘電体CuInP2S6(アクティブラーニング、キュリー温度予測、ヒューリスティックパラメータ空間探索による機械学習力場トレーニング)のエンドツーエンドの3つのデモでは、エージェントがコード生成を確実に処理するが、暗黙のドメイン知識に苦しむことが明らかになった。適切なシミュレーションタイムスケール、均衡プロトコル、サンプリング戦略などの不足した知識は、研究者が経験を通じて蓄積するが、形式化されることは滅多にない。 2つの軽量な介入、文献の自己学習と専門家が特定した制約は、これらのギャップを埋め、エージェントがワークフローの実行を処理している間に、研究者が高度なドメイン知識を提供するガイド付き自律モデルを定義する。 LLMは、すでにコード生成と科学的解釈を確実に処理しており、その能力の急速な向上は、手動のワークフローが達成できる範囲を超えて、材料発見を加速します。すべてのコードとベンチマークはオープンソースである。

論文の概要: MatClaw: An Autonomous Code-First LLM Agent for End-to-End Materials Exploration

関連論文リスト