Fugu-MT 論文翻訳(概要): SelfAI: Building a Self-Training AI System with LLM Agents

論文の概要: SelfAI: Building a Self-Training AI System with LLM Agents

arxiv url: http://arxiv.org/abs/2512.00403v1
Date: Sat, 29 Nov 2025 09:18:39 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-02 19:46:34.223622
Title: SelfAI: Building a Self-Training AI System with LLM Agents
Title（参考訳）: SelfAI: LLMエージェントによる自己学習AIシステムの構築
Authors: Xiao Wu, Ting-Zhu Huang, Liang-Jian Deng, Xiaobing Yu, Yu Zhong, Shangqi Deng, Ufaq Khan, Jianghao Wu, Xiaofeng Liu, Imran Razzak, Xiaojun Chang, Yutong Xie,
Abstract要約: SelfAIは、高レベルの研究目的を標準化された実験構成に変換するためのUser Agentを組み合わせた、一般的なマルチエージェントプラットフォームである。実験マネージャは、連続的なフィードバックのための構造化知識ベースを維持しながら、異種ハードウェアをまたいだ並列かつフォールトトレラントなトレーニングを編成する。回帰、コンピュータビジョン、科学計算、医用画像、薬物発見ベンチマークなどを通じて、SelfAIは一貫して高いパフォーマンスを達成し、冗長な試行を減らしている。
参考スコア（独自算出の注目度）: 79.10991818561907
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Recent work on autonomous scientific discovery has leveraged LLM-based agents to integrate problem specification, experiment planning, and execution into end-to-end systems. However, these frameworks are often confined to narrow application domains, offer limited real-time interaction with researchers, and lack principled mechanisms for determining when to halt exploration, resulting in inefficiencies, reproducibility challenges, and under-utilized human expertise. To address these gaps, we propose \textit{SelfAI}, a general multi-agent platform that combines a User Agent for translating high-level research objectives into standardized experimental configurations, a Cognitive Agent powered by LLMs with optimal stopping criteria to iteratively refine hyperparameter searches, and an Experiment Manager responsible for orchestrating parallel, fault-tolerant training workflows across heterogeneous hardware while maintaining a structured knowledge base for continuous feedback. We further introduce two novel evaluation metrics, Score and $\text{AUP}_D$, to quantify discovery efficiency and search diversity. Across regression, NLP, computer vision, scientific computing, medical imaging, and drug discovery benchmarks, SelfAI consistently achieves strong performance and reduces redundant trials compared to classical Bayesian optimization and LLM-based baselines, while enabling seamless interaction with human researchers.
Abstract（参考訳）: 自律的な科学的発見に関する最近の研究は、LLMベースのエージェントを活用して、問題仕様、実験計画、実行をエンドツーエンドシステムに統合している。しかしながら、これらのフレームワークは、狭いアプリケーションドメインに限定され、研究者とのリアルタイムのやりとりが制限され、探索をいつ停止するかを決めるための原則的なメカニズムが欠如しており、その結果、非効率性、再現性の問題、未使用の人間の専門知識が欠如している。これらのギャップに対処するために,高レベルの研究目的を標準化された実験構成に翻訳するユーザエージェントと,高パラメータ探索を反復的に洗練するための最適な停止基準を持つ認知エージェントと,異種ハードウェア間で並列でフォールトトレラントなトレーニングワークフローを編成する実験マネージャとを組み合わせた,汎用的なマルチエージェントプラットフォームである‘textit{SelfAI} を提案する。さらに、探索効率と探索多様性を定量化するために、Scoreと$\text{AUP}_D$という2つの新しい評価指標を導入する。回帰、NLP、コンピュータビジョン、科学計算、医用画像、薬物発見ベンチマークなどを通じて、SelfAIは一貫して高いパフォーマンスを達成し、古典的ベイズ最適化やLLMベースのベースラインと比較して冗長な試行を減らし、人間の研究者とのシームレスな対話を可能にしている。

論文の概要: SelfAI: Building a Self-Training AI System with LLM Agents

関連論文リスト