Fugu-MT 論文翻訳(概要): AIBuildAI: An AI Agent for Automatically Building AI Models

論文の概要: AIBuildAI: An AI Agent for Automatically Building AI Models

arxiv url: http://arxiv.org/abs/2604.14455v1
Date: Wed, 15 Apr 2026 22:17:05 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-17 21:29:31.633165
Title: AIBuildAI: An AI Agent for Automatically Building AI Models
Title（参考訳）: AIBuildAI - AIモデルの自動構築のためのAIエージェント
Authors: Ruiyi Zhang, Peijia Qin, Qi Cao, Li Zhang, Pengtao Xie,
Abstract要約: タスク記述とトレーニングデータからAIモデルを自動的に構築するAIエージェントであるAIBuildAIを紹介する。我々は、リアルなKaggleスタイルのAI開発タスクのベンチマークであるMLE-Bench上でAIBuildAIを評価する。
参考スコア（独自算出の注目度）: 31.05147308924019
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: AI models underpin modern intelligent systems, driving advances across science, medicine, finance, and technology. Yet developing high-performing AI models remains a labor-intensive process that requires expert practitioners to iteratively design architectures, engineer representations, implement training pipelines and refine approaches through empirical evaluation. Existing AutoML methods partially alleviate this burden but remain limited to narrow aspects such as hyperparameter optimization and model selection within predefined search spaces, leaving the full development lifecycle largely dependent on human expertise. To address this gap, we introduce AIBuildAI, an AI agent that automatically builds AI models from a task description and training data. AIBuildAI adopts a hierarchical agent architecture in which a manager agent coordinates three specialized sub-agents: a designer for modeling strategy, a coder for implementation and debugging, and a tuner for training and performance optimization. Each sub-agent is itself a large language model (LLM) based agent capable of multi-step reasoning and tool use, enabling end-to-end automation of the AI model development process that goes beyond the scope of existing AutoML approaches. We evaluate AIBuildAI on MLE-Bench, a benchmark of realistic Kaggle-style AI development tasks spanning visual, textual, time-series and tabular modalities. AIBuildAI ranks first on MLE-Bench with a medal rate of 63.1%, outperforming all existing baseline methods and matching the capability of highly experienced AI engineers. These results demonstrate that hierarchical agent systems can automate the full AI model development process from task specification to deployable model, suggesting a pathway toward broadly accessible AI development with minimal human intervention.
Abstract（参考訳）: AIモデルは現代のインテリジェントシステムを支えるもので、科学、医学、金融、テクノロジーの進歩を推進している。しかし、ハイパフォーマンスなAIモデルの開発は、専門家が経験的評価を通じて、アーキテクチャ、エンジニア表現、トレーニングパイプラインの実装、アプローチの洗練を反復的に設計することを要求する、労働集約的なプロセスのままである。既存のAutoMLメソッドは、この負担を部分的に軽減するが、事前定義された検索空間内のハイパーパラメータ最適化やモデル選択といった狭い側面に限られており、完全な開発ライフサイクルは人間の専門知識に大きく依存している。このギャップに対処するために、タスク記述とトレーニングデータからAIモデルを自動的に構築するAIエージェントであるAIBuildAIを紹介します。 AIBuildAIは階層的なエージェントアーキテクチャを採用しており、マネージャエージェントが3つの特別なサブエージェントをコーディネートする。各サブエージェントはそれ自体、多段階の推論とツール使用が可能な大規模言語モデル(LLM)ベースのエージェントであり、既存のAutoMLアプローチの範囲を越えて、AIモデル開発プロセスのエンドツーエンドの自動化を可能にする。 MLE-Bench上でAIBuildAIを評価する。これは、ビジュアル、テキスト、時系列、表形式のモダリティにまたがるリアルなKaggleスタイルのAI開発タスクのベンチマークである。 AIBuildAIは、MLE-Benchで63.1%のメダル率でランクインし、既存のすべてのベースラインメソッドを上回り、熟練したAIエンジニアの能力に匹敵する。これらの結果は、階層的なエージェントシステムがタスク仕様からデプロイ可能なモデルまで、完全なAIモデル開発プロセスを自動化できることを示し、人間の介入を最小限に抑えて、広くアクセス可能なAI開発への道筋を示唆している。

論文の概要: AIBuildAI: An AI Agent for Automatically Building AI Models

関連論文リスト