Fugu-MT 論文翻訳(概要): Alita-G: Self-Evolving Generative Agent for Agent Generation

論文の概要: Alita-G: Self-Evolving Generative Agent for Agent Generation

arxiv url: http://arxiv.org/abs/2510.23601v1
Date: Mon, 27 Oct 2025 17:59:14 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-28 15:28:15.662951
Title: Alita-G: Self-Evolving Generative Agent for Agent Generation
Title（参考訳）: Alita-G:エージェント生成のための自己進化型生成剤
Authors: Jiahao Qiu, Xuan Qi, Hongru Wang, Xinzhe Juan, Yimin Wang, Zelin Zhao, Jiayi Geng, Jiacheng Guo, Peihang Li, Jingzhe Shi, Shilong Liu, Mengdi Wang,
Abstract要約: 汎用エージェントをドメインエキスパートに変換するフレームワークであるALITA-Gを提案する。このフレームワークでは、ジェネラリストエージェントが対象ドメインタスクのキュレートされたスイートを実行する。計算コストを削減しながら、大きな利益を得ることができます。
参考スコア（独自算出の注目度）: 54.49365835457433
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Large language models (LLMs) have been shown to perform better when scaffolded into agents with memory, tools, and feedback. Beyond this, self-evolving agents have emerged, but current work largely limits adaptation to prompt rewriting or failure retries. Therefore, we present ALITA-G, a self-evolution framework that transforms a general-purpose agent into a domain expert by systematically generating, abstracting, and curating Model Context Protocol (MCP) tools. In this framework, a generalist agent executes a curated suite of target-domain tasks and synthesizes candidate MCPs from successful trajectories. These are then abstracted to parameterized primitives and consolidated into an MCP Box. At inference time, ALITA-G performs retrieval-augmented MCP selection with the help of each tool's descriptions and use cases, before executing an agent equipped with the MCP Executor. Across several benchmarks GAIA, PathVQA, and Humanity's Last Exam, ALITA-G attains strong gains while reducing computation costs. On GAIA validation, it achieves 83.03% pass@1 and 89.09% pass@3, establishing a new state-of-the-art result while reducing mean tokens per example by approximately 15% relative to a strong baseline agent. ALITA-G thus provides a principled pathway from generalist capability to reusable, domain-specific competence, improving both accuracy and efficiency on complex reasoning tasks.
Abstract（参考訳）: 大きな言語モデル(LLM)は、メモリ、ツール、フィードバックを持ったエージェントに足場を組み込むと、パフォーマンスが向上することが示されている。これ以外にも、自己進化エージェントが出現しているが、現在の作業は、書き換えや失敗の再試行を早めるために適応をほとんど制限している。そこで本研究では,汎用エージェントをドメインエキスパートに変換する自己進化フレームワークであるALITA-Gを提案する。このフレームワークでは、汎用エージェントが対象ドメインタスクのキュレートされたスイートを実行し、成功した軌道から候補MPPを合成する。これらはパラメータ化プリミティブに抽象化され、MPPボックスに統合される。推測時に、ALITA-Gは、各ツールの説明やユースケースの助けを借りて検索強化されたMPP選択を行い、MPP実行装置を備えたエージェントを実行する。 GAIA、PathVQA、HumanityのLast Examなどいくつかのベンチマークにおいて、ALITA-Gは計算コストを削減しながら大きく向上した。 GAIA検証では83.03%のpass@1と89.09%のpass@3を達成し、新しい最先端の結果を確立し、例えば平均トークンを強いベースラインエージェントに対して約15%削減する。 ALITA-Gは、一般的な能力から再利用可能なドメイン固有の能力への原則化された経路を提供し、複雑な推論タスクの正確性と効率の両方を改善している。

論文の概要: Alita-G: Self-Evolving Generative Agent for Agent Generation

関連論文リスト