Fugu-MT 論文翻訳(概要): SAGE: A Quantitative Evaluation of Socialized Evolution in Agent Ecosystems

論文の概要: SAGE: A Quantitative Evaluation of Socialized Evolution in Agent Ecosystems

arxiv url: http://arxiv.org/abs/2606.03544v1
Date: Tue, 02 Jun 2026 12:08:38 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-03 22:00:04.983767
Title: SAGE: A Quantitative Evaluation of Socialized Evolution in Agent Ecosystems
Title（参考訳）: SAGE:エージェント生態系における社会的進化の定量的評価
Authors: Linyue Pan, Yaoming Zhu, Lin Qiu, Xuezhi Cao, Xunliang Cai,
Abstract要約: SAGE(Social Agent Group Evolution)は,2つの計算条件を比較した評価フレームワークである。群の歴史は普遍的な増幅器ではなく、最強のエージェントは自己進化の天井を超えない。競合する環境では、カウンターファクトコントロールは、エージェントが相手固有の戦略を開発するよりも一般的に改善することを明らかにする。
参考スコア（独自算出の注目度）: 23.807355053389273
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Self-improving language agents are typically evaluated in isolation: an agent attempts a task, receives feedback, and iteratively refines its own behavior. Yet agents increasingly operate alongside peers whose strategies and outcomes are publicly visible. This raises an under-studied question: when does shared experience produce improvements that self-improvement alone cannot achieve? We introduce SAGE (Social Agent Group Evolution),an evaluation framework that compares two compute-matched conditions: SocialEvo, where agents from five distinct model families co-evolve with access to all peers' histories; and SelfEvo, where each agent receives the same number of task attempts but sees only its own past, which is conventional in self-improving agent studies. We instantiate SAGE in three arenas: open-ended ML research, long-horizon economic planning, and strategic multiplayer play, evaluated across multiple evolutionary rounds. We find that group history is not a universal amplifier: the strongest agent does not exceed its self-evolution ceiling. However, agents that plateau under self-improvement can achieve significant breakthroughs when peer experience is available. In competitive settings, counterfactual controls reveal that agents improve generally rather than developing opponent-specific strategies. Across different forms of shared history, filtered peer traces and reflective summaries often outperform raw logs, indicating that social gains depend on abstraction rather than exposure volume. These findings reveal that peer-history gains are agent-specific, arena-dependent, and contingent on the capacity to abstract transferable knowledge from public traces.
Abstract（参考訳）: エージェントはタスクを試み、フィードバックを受け取り、自身の振る舞いを反復的に洗練する。しかし、エージェントは、戦略や成果が公然と見える仲間と共に活動するようになっている。共有体験はいつ、自己改善だけでは達成できない改善をもたらすのか? SAGE(Social Agent Group Evolution, 社会エージェントグループ進化)は, 5つの異なるモデルファミリーのエージェントがすべての人物の履歴にアクセスするように進化する,SocialEvo(SocialEvo, 社会エージェントグループ進化)と, エージェントが同じ数のタスク試行を受けるが, 自分自身の過去しか見ることができないセルフエボ(SelfEvo, 自己改善エージェント研究における従来の手法)の2つの条件を比較した評価フレームワークである。 SAGEを3つのアリーナ(オープンエンドML研究、長期経済計画、戦略的マルチプレイヤープレイ)でインスタンス化し、複数の進化ラウンドで評価する。群の歴史は普遍的な増幅器ではなく、最強のエージェントは自己進化の天井を超えない。しかし、自己改善剤は、ピア体験が利用可能であれば、大きなブレークスルーを達成できる。競合する環境では、カウンターファクトコントロールは、エージェントが相手固有の戦略を開発するよりも一般的に改善することを明らかにする。異なる形態の共有履歴、フィルターされたピアトレース、反射的な要約は、しばしば生のログよりも優れており、社会的な利益は露光量よりも抽象に依存することを示している。これらの結果から,ピアヒストリーゲインはエージェント特異的で,アリーナ依存的であり,公的な痕跡から伝達可能な知識を抽象化する能力に依存していることが明らかとなった。

論文の概要: SAGE: A Quantitative Evaluation of Socialized Evolution in Agent Ecosystems

関連論文リスト