Fugu-MT 論文翻訳(概要): Strategic Tradeoffs Between Humans and AI in Multi-Agent Bargaining

論文の概要: Strategic Tradeoffs Between Humans and AI in Multi-Agent Bargaining

arxiv url: http://arxiv.org/abs/2509.09071v2
Date: Fri, 12 Sep 2025 18:52:12 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-16 11:17:11.572722
Title: Strategic Tradeoffs Between Humans and AI in Multi-Agent Bargaining
Title（参考訳）: マルチエージェント取引における人間とAIの戦略的トレードオフ
Authors: Crystal Qian, Kehang Zhu, John Horton, Benjamin S. Manning, Vivian Tsai, James Wexler, Nithum Thain,
Abstract要約: 人類が伝統的に行ってきた調整作業は、ますます自律的なエージェントに委譲されている。我々は,人間(N = 216),大言語モデル(LLM),ベイズエージェントを動的交渉条件で比較した。エージェント評価の一般的なベンチマークであるパフォーマンスの同等性は、プロセスとアライメントの根本的な違いを隠蔽する可能性がある。
参考スコア（独自算出の注目度）: 6.455342700410145
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Coordination tasks traditionally performed by humans are increasingly being delegated to autonomous agents. As this pattern progresses, it becomes critical to evaluate not only these agents' performance but also the processes through which they negotiate in dynamic, multi-agent environments. Furthermore, different agents exhibit distinct advantages: traditional statistical agents, such as Bayesian models, may excel under well-specified conditions, whereas large language models (LLMs) can generalize across contexts. In this work, we compare humans (N = 216), LLMs (GPT-4o, Gemini 1.5 Pro), and Bayesian agents in a dynamic negotiation setting that enables direct, identical-condition comparisons across populations, capturing both outcomes and behavioral dynamics. Bayesian agents extract the highest surplus through aggressive optimization, at the cost of frequent trade rejections. Humans and LLMs can achieve similar overall surplus, but through distinct behaviors: LLMs favor conservative, concessionary trades with few rejections, while humans employ more strategic, risk-taking, and fairness-oriented behaviors. Thus, we find that performance parity -- a common benchmark in agent evaluation -- can conceal fundamental differences in process and alignment, which are critical for practical deployment in real-world coordination tasks.
Abstract（参考訳）: 人類が伝統的に行ってきた調整作業は、ますます自律的なエージェントに委譲されている。このパターンが進むにつれて、これらのエージェントのパフォーマンスだけでなく、動的でマルチエージェントな環境で交渉するプロセスを評価することが重要になる。さらに、ベイズモデルのような伝統的な統計エージェントは、明確に特定された条件下では優れるが、大きな言語モデル(LLM)は文脈を横断して一般化することができる。本研究では,人間 (N = 216), LLM (GPT-4o, Gemini 1.5 Pro) とベイズエージェント (ベイズエージェント) を,集団間の直接的, 同一条件比較が可能な動的交渉条件で比較し, 結果と行動力学の両方を捉える。ベイジアン・エージェントは、頻繁な貿易拒絶を犠牲にして、積極的な最適化を通じて最高の余剰を抽出する。人間とLLMは同様の全体的な余剰を達成できるが、異なる行動によって、LLMは保守的で譲歩的な取引をほとんど拒否しないが、人間はより戦略的、リスクテイク、公正志向の行動を採用する。したがって、エージェント評価における一般的なベンチマークであるパフォーマンスの同等性は、プロセスとアライメントの根本的な違いを隠蔽する可能性がある。

論文の概要: Strategic Tradeoffs Between Humans and AI in Multi-Agent Bargaining

関連論文リスト