Fugu-MT 論文翻訳(概要): Communicate-Predict-Act: Evaluating Social Intelligence of Agents

論文の概要: Communicate-Predict-Act: Evaluating Social Intelligence of Agents

arxiv url: http://arxiv.org/abs/2604.08727v1
Date: Thu, 09 Apr 2026 19:40:15 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-13 17:57:53.560507
Title: Communicate-Predict-Act: Evaluating Social Intelligence of Agents
Title（参考訳）: コミュニケーション予測法:エージェントの社会的知性を評価する
Authors: David Shoresh, Sarit Kraus, Yonatan Loewenstein,
Abstract要約: 我々は,大規模言語モデル(LLM)のソーシャルインテリジェンスを研究するために,複合協調型および競争型ソーシャルゲームのマルチプレイヤーアリーナを導入する。ゲームプレイのトレースを分析し,行動予測,コミュニケーション的影響,戦略的推論,競合する利益の下でのトレードオフを捉えた社会認知的指標を抽出する。
参考スコア（独自算出の注目度）: 15.575081867336529
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: As large language model (LLM) agents become more prevalent in real world social settings, social intelligence will play an increasingly critical role. But social intelligence is still a poorly defined construct, for humans and artificial agents. We introduce a multiplayer arena of mixed cooperative and competitive social games to study LLM social intelligence. The controllability of LLM based agents enables systematic evaluation, which also supports broader inferences about social intelligence per se. We evaluated eight diverse LLMs (24B to 1T parameters) using a Communicate Predict Act (COMPACT) interaction protocol and fine grained probing of social dynamics. Elo style ratings reveal consistent performance differences across models, but this scalar measure provides only a partial characterization of social intelligence. To address this limitation, we analyze gameplay traces to extract sociocognitive metrics capturing action prediction, communicative influence, strategic reasoning, and tradeoffs under conflicting interests. These sociocognitive metrics exhibit strong intramodel consistency and they reliably predict pairwise agent advantage in game outcomes (AUC ROC = 0.82). Feature importance analysis indicates that surprisingly, influence, transparency, and adaptability are more predictive of success than Theory of Mind inference or deep planning. Together, our results advance a testable, multidimensional conception of social intelligence and provide empirical insights into the capacities that underpin it.
Abstract（参考訳）: 大規模言語モデル(LLM)エージェントが現実社会で普及するにつれて、社会知性はますます重要な役割を担うようになる。しかし、ソーシャルインテリジェンスはまだ、人間や人工エージェントにとって、十分に定義されていない構造だ。 LLMソーシャルインテリジェンスを研究するために,複合協調型・競争型ソーシャルゲームのマルチプレイヤーアリーナを導入する。 LLMをベースとしたエージェントの制御性は、体系的な評価を可能にし、社会知性に関するより広範な推論もサポートする。我々は,Communicate Predict Act (COMPACT) のインタラクションプロトコルとソーシャルダイナミクスのきめ細かいきめ細かな探索を用いて,8種類のLCM(24Bから1Tパラメータ)を評価した。エロスタイルの評価は、モデル間で一貫したパフォーマンスの違いを示すが、このスカラー尺度は、社会的知性の部分的な特徴のみを提供する。この制限に対処するために、ゲームプレイトレースを分析し、競合する利益の下で行動予測、コミュニケーション的影響、戦略的推論、トレードオフを捉えた社会認知的指標を抽出する。これらの社会認知的指標は強いモデル内整合性を示し、ゲーム結果(AUC ROC = 0.82)においてペアワイズエージェントの利点を確実に予測する。特徴的重要性の分析は、驚くほど、影響、透明性、適応性が、心の理論や深い計画よりも成功を予測できることを示している。その結果,社会的知能の実証可能な多次元概念を推し進め,その基盤となる能力に関する実証的な洞察を提供することができた。

論文の概要: Communicate-Predict-Act: Evaluating Social Intelligence of Agents

関連論文リスト