Fugu-MT 論文翻訳(概要): CollabSim: A CSCW-Grounded Methodology for Investigating Collaborative Competence of LLM Agents through Controlled Multi-Agent Experiments

論文の概要: CollabSim: A CSCW-Grounded Methodology for Investigating Collaborative Competence of LLM Agents through Controlled Multi-Agent Experiments

arxiv url: http://arxiv.org/abs/2606.06399v2
Date: Sat, 06 Jun 2026 00:36:15 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-09 12:24:31.356113
Title: CollabSim: A CSCW-Grounded Methodology for Investigating Collaborative Competence of LLM Agents through Controlled Multi-Agent Experiments
Title（参考訳）: CollabSim: マルチエージェント実験によるLCMエージェントの協調能力調査のためのCSCW-Grounded Methodology
Authors: Jiaju Chen, Bo Sun, Yuxuan Lu, Yun Wang, Dakuo Wang, Bingsheng Yao,
Abstract要約: CollabSimは、理論に基づく協調能力の定義、相互作用条件の制御、エージェントの内部状態のアクションレベル探索を組み合わせたシミュレーションフレームワークである。 CollabSimは条件効果をキャプチャし、モデルパフォーマンスパターンを分離し、エージェント設計のタスク依存効果を明らかにする。
参考スコア（独自算出の注目度）: 32.043112677406064
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Multi-agent systems (MAS) built on large language models have shown growing promise, with their effectiveness resting on agents' ability to coordinate through text-based channels much as human teams do. Yet recent study suggests that MAS often falter not because agents lack individual task-solving ability, but because they lack collaborative competence: the capacity to establish common ground, maintain shared task understanding, balance individual and collective incentives, and repair misalignment as interaction unfolds. Decades of research in Computer-Supported Cooperative Work have characterized these requirements for human teams coordinating under constrained communication, yet existing MAS evaluations focus mainly on task outcomes or single-agent proficiency in reasoning, planning, and tool use. To enable a systematic analysis of agents' collaborative competence in MAS, we introduce CollabSim, a configurable simulation framework that combines a theory-grounded definition of collaborative capabilities, controlled manipulation of interaction conditions, and action-level probing of agents' internal states. Experiments across four LLMs show that CollabSim can capture condition effects, separate model performance patterns, and reveal task-dependent effects of agent design.
Abstract（参考訳）: 大規模言語モデル上に構築されたマルチエージェントシステム(MAS)は、人間のチームと同じようにテキストベースのチャネルを介して協調するエージェントの能力によって、その効果が向上していることを示している。しかし最近の研究では、エージェントは個別のタスク解決能力を欠いているためではなく、共通の基盤を確立し、共通のタスク理解を維持し、個人のインセンティブと集団のインセンティブをバランスさせ、相互作用が展開するにつれて不整合を修復する能力が欠如していることがしばしば示唆されている。コンピュータ支援協調作業における長年の研究は、制約されたコミュニケーションの下で協調する人間チームに対するこれらの要求を特徴付けてきたが、既存のMAS評価は、主に推論、計画、ツール使用におけるタスク成果または単一エージェントの習熟度に焦点を当てている。我々は,MASにおけるエージェントの協調能力の体系的解析を可能にするために,協調能力の理論的定義と相互作用条件の制御,エージェントの内部状態のアクションレベル探索を組み合わせた,構成可能なシミュレーションフレームワークであるCollabSimを紹介する。 4つのLCM実験により、CollabSimは条件効果を捉え、モデル性能パターンを分離し、エージェント設計のタスク依存効果を明らかにする。

論文の概要: CollabSim: A CSCW-Grounded Methodology for Investigating Collaborative Competence of LLM Agents through Controlled Multi-Agent Experiments

関連論文リスト