Fugu-MT 論文翻訳(概要): AutoSOTA: An End-to-End Automated Research System for State-of-the-Art AI Model Discovery

論文の概要: AutoSOTA: An End-to-End Automated Research System for State-of-the-Art AI Model Discovery

arxiv url: http://arxiv.org/abs/2604.05550v1
Date: Tue, 07 Apr 2026 07:52:01 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-08 17:42:09.702513
Title: AutoSOTA: An End-to-End Automated Research System for State-of-the-Art AI Model Discovery
Title（参考訳）: AutoSOTA:最先端AIモデル発見のためのエンドツーエンド自動研究システム
Authors: Yu Li, Chenyang Shao, Xinyang Liu, Ruotong Zhao, Peijie Liu, Hongyuan Su, Zhibin Chen, Qinglong Yang, Anjie Xu, Yi Fang, Qingbin Zeng, Tianxing Li, Jingbo Xu, Fengli Xu, Yong Li, Tie-Yan Liu,
Abstract要約: AutoSOTAは、トップレベルのAI論文で発表された最新のState-Of-The-Art(SOTA)モデルを進化させるエンドツーエンドの自動研究システムである。報告された方法を超える新しい105種類のSOTAモデルを発見し、平均して1紙あたり平均5時間である。
参考スコア（独自算出の注目度）: 49.77172027295716
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Artificial intelligence research increasingly depends on prolonged cycles of reproduction, debugging, and iterative refinement to achieve State-Of-The-Art (SOTA) performance, creating a growing need for systems that can accelerate the full pipeline of empirical model optimization. In this work, we introduce AutoSOTA, an end-to-end automated research system that advances the latest SOTA models published in top-tier AI papers to reproducible and empirically improved new SOTA models. We formulate this problem through three tightly coupled stages: resource preparation and goal setting; experiment evaluation; and reflection and ideation. To tackle this problem, AutoSOTA adopts a multi-agent architecture with eight specialized agents that collaboratively ground papers to code and dependencies, initialize and repair execution environments, track long-horizon experiments, generate and schedule optimization ideas, and supervise validity to avoid spurious gains. We evaluate AutoSOTA on recent research papers collected from eight top-tier AI conferences under filters for code availability and execution cost. Across these papers, AutoSOTA achieves strong end-to-end performance in both automated replication and subsequent optimization. Specifically, it successfully discovers 105 new SOTA models that surpass the original reported methods, averaging approximately five hours per paper. Case studies spanning LLM, NLP, computer vision, time series, and optimization further show that the system can move beyond routine hyperparameter tuning to identify architectural innovation, algorithmic redesigns, and workflow-level improvements. These results suggest that end-to-end research automation can serve not only as a performance optimizer, but also as a new form of research infrastructure that reduces repetitive experimental burden and helps redirect human attention toward higher-level scientific creativity.
Abstract（参考訳）: 人工知能の研究は、ステートオフ・ザ・アート(SOTA)のパフォーマンスを達成するために、再現、デバッグ、反復的な改善のサイクルの長いサイクルにますます依存しており、経験的モデルの最適化の完全なパイプラインを加速できるシステムの必要性が高まっている。本稿では,上位階層のAI論文で発表された最新のSOTAモデルを再現可能かつ実証的に改良した新しいSOTAモデルに進化させる,エンドツーエンドの自動研究システムであるAutoSOTAを紹介する。我々は、資源準備と目標設定、実験評価、リフレクションとアイデアの3つの密結合した段階を通してこの問題を定式化する。この問題に対処するため、AutoSOTAは8つの専門エージェントを備えたマルチエージェントアーキテクチャを採用し、文書をコードと依存関係に共同で構築し、実行環境を初期化し、修復し、長距離実験を追跡し、最適化のアイデアを生成およびスケジュールし、素早い利得を避けるために妥当性を監督する。我々は、コード可用性と実行コストのフィルタの下で、上位8つのAIカンファレンスから収集された最近の研究論文についてAutoSOTAを評価した。これらの論文全体で、AutoSOTAは自動レプリケーションとその後の最適化の両方において、エンドツーエンドのパフォーマンスを強力に達成している。具体的には、最初に報告された方法を超える新しい105種類のSOTAモデルを発見し、平均して1紙あたり平均5時間である。 LLM、NLP、コンピュータビジョン、時系列、最適化にまたがるケーススタディは、システムがアーキテクチャの革新、アルゴリズムの再設計、ワークフローレベルの改善を識別するために、通常のハイパーパラメータチューニングを超えることができることを示している。これらの結果から, エンド・ツー・エンドの研究自動化は, 性能最適化だけでなく, 反復的な実験負荷を低減し, 高いレベルの科学的創造性に人間の注意を向ける新たな研究基盤として機能する可能性が示唆された。

論文の概要: AutoSOTA: An End-to-End Automated Research System for State-of-the-Art AI Model Discovery

関連論文リスト