Fugu-MT 論文翻訳(概要): Strategic Tradeoffs Between Humans and AI in Multi-Agent Bargaining

論文の概要: Strategic Tradeoffs Between Humans and AI in Multi-Agent Bargaining

arxiv url: http://arxiv.org/abs/2509.09071v3
Date: Mon, 13 Oct 2025 22:19:28 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-15 19:02:31.990859
Title: Strategic Tradeoffs Between Humans and AI in Multi-Agent Bargaining
Title（参考訳）: マルチエージェント取引における人間とAIの戦略的トレードオフ
Authors: Crystal Qian, Kehang Zhu, John Horton, Benjamin S. Manning, Vivian Tsai, James Wexler, Nithum Thain,
Abstract要約: 我々は,人間,大言語モデル,ベイズエージェントの動的交渉条件における結果と行動力学を比較した。パフォーマンスの同等性は、プロセスとアライメントの根本的な違いを隠蔽する可能性がある。この研究は、より応用され、変数リッチな環境における将来の研究のベースラインを提供する。
参考スコア（独自算出の注目度）: 6.455342700410145
License: http://creativecommons.org/licenses/by/4.0/
Abstract: As large language models (LLMs) are increasingly embedded in collaborative human activities such as business negotiations and group coordination, it becomes critical to evaluate both the performance gains they can achieve and how they interact in dynamic, multi-agent environments. Unlike traditional statistical agents such as Bayesian models, which may excel under well-specified conditions, large language models (LLMs) can generalize across diverse, real-world scenarios, raising new questions about how their strategies and behaviors compare to those of humans and other agent types. In this work, we compare outcomes and behavioral dynamics across humans (N = 216), LLMs (GPT-4o, Gemini 1.5 Pro), and Bayesian agents in a dynamic negotiation setting under identical conditions. Bayesian agents extract the highest surplus through aggressive optimization, at the cost of frequent trade rejections. Humans and LLMs achieve similar overall surplus, but through distinct behaviors: LLMs favor conservative, concessionary trades with few rejections, while humans employ more strategic, risk-taking, and fairness-oriented behaviors. Thus, we find that performance parity -- a common benchmark in agent evaluation -- can conceal fundamental differences in process and alignment, which are critical for practical deployment in real-world coordination tasks. By establishing foundational behavioral baselines under matched conditions, this work provides a baseline for future studies in more applied, variable-rich environments.
Abstract（参考訳）: 大規模言語モデル(LLM)は、ビジネス交渉やグループ調整といった協調的な人間の活動にますます組み込まれているため、それらが達成できるパフォーマンス向上と、動的でマルチエージェント環境でどのように相互作用するかを評価することが重要になる。ベイズモデルのような特定の条件下では優れる従来の統計エージェントとは異なり、大規模言語モデル(LLM)は多種多様な現実世界のシナリオにまたがって一般化することができ、それらの戦略や行動が人間や他のエージェントとどのように比較されるのかという新たな疑問を提起する。本研究では,人間間(N = 216),LLM(GPT-4o,Gemini 1.5 Pro),ベイズエージェントを同一条件下での動的交渉条件で比較した。ベイジアン・エージェントは、頻繁な貿易拒絶を犠牲にして、積極的な最適化を通じて最高の余剰を抽出する。 LLMは保守的で譲歩的な取引をほとんど拒否しないが、人間はより戦略的、リスクテイク、フェアネス指向の行動を採用する。したがって、エージェント評価における一般的なベンチマークであるパフォーマンスの同等性は、プロセスとアライメントの根本的な違いを隠蔽する可能性がある。一致した条件下での基本的な行動ベースラインを確立することにより、この研究は、より応用され、変数リッチな環境における将来の研究のベースラインを提供する。

論文の概要: Strategic Tradeoffs Between Humans and AI in Multi-Agent Bargaining

関連論文リスト