Fugu-MT 論文翻訳(概要): EvoMaster: A Foundational Evolving Agent Framework for Agentic Science at Scale

論文の概要: EvoMaster: A Foundational Evolving Agent Framework for Agentic Science at Scale

arxiv url: http://arxiv.org/abs/2604.17406v2
Date: Tue, 21 Apr 2026 06:36:45 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-22 14:04:47.92179
Title: EvoMaster: A Foundational Evolving Agent Framework for Agentic Science at Scale
Title（参考訳）: EvoMaster: 大規模エージェントサイエンスのための基礎的進化型エージェントフレームワーク
Authors: Xinyu Zhu, Yuzhu Cai, Zexi Liu, Cheng Wang, Fengyang Li, Wenkai Jin, Wanxu Liu, Zehao Bing, Bingyang Zheng, Jingyi Chai, Shuo Tang, Rui Ye, Yuwen Du, Xianghe Pang, Yaxin Du, Tingjia Miao, Yuzhi Zhang, Ruoxue Liao, Zhaohan Ding, Linfeng Zhang, Yanfeng Wang, Weinan E, Siheng Chen,
Abstract要約: EvoMasterは、Agentic Science at Scaleのために開発された、基本的な進化型エージェントフレームワークである。継続的自己進化の中核的な原則によって、EvoMasterはエージェントに仮説を反復的に洗練させ、自己批判し、徐々に知識を蓄積させる権限を与える。 EvoMasterは、非常に簡単にスケールアップでき、開発者は高度に機能し、自己進化する科学エージェントを構築およびデプロイできる。
参考スコア（独自算出の注目度）: 59.1699095181965
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The convergence of large language models and agents is catalyzing a new era of scientific discovery: Agentic Science. While the scientific method is inherently iterative, existing agent frameworks are predominantly static, narrowly scoped, and lack the capacity to learn from trial and error. To bridge this gap, we present EvoMaster, a foundational evolving agent framework engineered specifically for Agentic Science at Scale. Driven by the core principle of continuous self-evolution, EvoMaster empowers agents to iteratively refine hypotheses, self-critique, and progressively accumulate knowledge across experimental cycles, faithfully mirroring human scientific inquiry. Crucially, as a domain-agnostic base harness, EvoMaster is exceptionally easy to scale up -- enabling developers to build and deploy highly capable, self-evolving scientific agents for arbitrary disciplines in approximately 100 lines of code. Built upon EvoMaster, we incubated the SciMaster ecosystem across domains such as machine learning, physics, and general science. Evaluations on four authoritative benchmarks (Humanity's Last Exam, MLE-Bench Lite, BrowseComp, and FrontierScience) demonstrate that EvoMaster achieves state-of-the-art scores of 41.1%, 75.8%, 73.3%, and 53.3%, respectively. It comprehensively outperforms the general-purpose baseline OpenClaw with relative improvements ranging from +159% to +316%, robustly validating its efficacy and generality as the premier foundational framework for the next generation of autonomous scientific discovery. EvoMaster is available at https://github.com/sjtu-sai-agents/EvoMaster.
Abstract（参考訳）: 大規模言語モデルとエージェントの収束は、新しい科学的発見の時代を触媒している:エージェントサイエンス。科学的手法は本質的に反復的であるが、既存のエージェントフレームワークは主に静的でスコープが狭く、試行錯誤から学ぶ能力がない。このギャップを埋めるために,我々は,大規模エージェント科学に特化した基盤進化型エージェントフレームワークであるEvoMasterを紹介した。 EvoMasterは継続的自己進化の原理に基づいて、エージェントに仮説を反復的に洗練させ、自己批判し、実験サイクルを通じて知識を徐々に蓄積させ、人間の科学的調査を忠実に反映させる。重要なのは、ドメインに依存しないベースハーネスとして、EvoMasterは驚くほど簡単にスケールアップできます。 EvoMasterをベースに構築された私たちは、機械学習、物理学、一般科学といった分野にまたがるSciMasterエコシステムをインキュベーションしました。 4つの権威ベンチマーク(Humanity's Last Exam、MLE-Bench Lite、BrowseComp、FrontierScience)の評価は、EvoMasterがそれぞれ41.1%、75.8%、73.3%、53.3%の最先端スコアを達成したことを示している。汎用ベースラインのOpenClawは、+159%から+316%までの相対的な改善により、次世代の自律科学発見のための基礎となる枠組みとして、その有効性と一般性をしっかりと検証している。 EvoMasterはhttps://github.com/sjtu-sai-agents/EvoMasterで入手できる。

論文の概要: EvoMaster: A Foundational Evolving Agent Framework for Agentic Science at Scale

関連論文リスト