Fugu-MT 論文翻訳(概要): AI-Supervisor: Autonomous AI Research Supervision via a Persistent Research World Model

論文の概要: AI-Supervisor: Autonomous AI Research Supervision via a Persistent Research World Model

arxiv url: http://arxiv.org/abs/2603.24402v1
Date: Wed, 25 Mar 2026 15:16:51 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-26 21:06:11.358055
Title: AI-Supervisor: Autonomous AI Research Supervision via a Persistent Research World Model
Title（参考訳）: AIスーパーバイザ:永続的な研究世界モデルによる自律型AI研究スーパービジョン
Authors: Yunbo Long,
Abstract要約: 既存の自動研究システムは、状態のない線形パイプラインとして動作し、研究環境の永続的な理解を保たずに出力を生成する。我々はAutoProfについて紹介する。AutoProfはマルチエージェントのオーケストレーションフレームワークで、専門エージェントは人間の興味によって駆動されるエンドツーエンドのAI研究の監督を提供する。シーケンシャルパイプラインとは異なり、AutoProfは知識グラフとして実装された連続的に進化するResearch World Modelを維持している。
参考スコア（独自算出の注目度）: 1.14219428942199
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Existing automated research systems operate as stateless, linear pipelines, generating outputs without maintaining a persistent understanding of the research landscape. They process papers sequentially, propose ideas without structured gap analysis, and lack mechanisms for agents to verify or refine each other's findings. We present AutoProf (Autonomous Professor), a multi-agent orchestration framework where specialized agents provide end-to-end AI research supervision driven by human interests, from literature review through gap discovery, method development, evaluation, and paper writing, via autonomous exploration and self-correcting updates. Unlike sequential pipelines, AutoProf maintains a continuously evolving Research World Model implemented as a Knowledge Graph, capturing methods, benchmarks, limitations, and unexplored gaps as shared memory across agents. The framework introduces three contributions: first, structured gap discovery that decomposes methods into modules, evaluates them across benchmarks, and identifies module-level gaps; second, self-correcting discovery loops that analyze why modules succeed or fail, detect benchmark biases, and assess evaluation adequacy; third, self-improving development loops using cross-domain mechanism search to iteratively address failing components. All agents operate under a consensus mechanism where findings are validated before being committed to the shared model. The framework is model-agnostic, supports mainstream large language models, and scales elastically with token budget from lightweight exploration to full-scale investigation.
Abstract（参考訳）: 既存の自動研究システムは、状態のない線形パイプラインとして動作し、研究環境の永続的な理解を保たずに出力を生成する。論文を逐次処理し、構造化されたギャップ分析のないアイデアを提案し、エージェントが互いの発見を検証または改善するためのメカニズムを欠いている。私たちはAutoProf(Autonomous Professor)というマルチエージェントオーケストレーションフレームワークを紹介し、専門エージェントは、文献レビューからギャップ発見、メソッド開発、評価、ペーパーライティングまで、自律的な探索と自己修正更新を通じて、人的関心によって駆動されるエンドツーエンドのAI研究を監督する。シーケンシャルパイプラインとは異なり、AutoProfは知識グラフとして実装された継続的な進化したリサーチワールドモデルを維持し、メソッド、ベンチマーク、制限、探索されていないギャップをエージェント間で共有メモリとしてキャプチャする。第1に、メソッドをモジュールに分解し、ベンチマークで評価し、モジュールレベルのギャップを特定し、第2に、モジュールが成功したか失敗したかを分析し、ベンチマークバイアスを検出し、評価精度を評価し、第3に、クロスドメインメカニズムを使用して開発ループを自己改善し、失敗コンポーネントに反復的に対処する。すべてのエージェントはコンセンサスメカニズムの下で動作し、結果が共有モデルにコミットする前に検証される。フレームワークはモデルに依存しず、主流の大規模言語モデルをサポートし、軽量な探索から本格的な調査までトークン予算と弾力的にスケールする。

論文の概要: AI-Supervisor: Autonomous AI Research Supervision via a Persistent Research World Model

関連論文リスト