Fugu-MT 論文翻訳(概要): Human Values Matter: Investigating How Misalignment Shapes Collective Behaviors in LLM Agent Communities

論文の概要: Human Values Matter: Investigating How Misalignment Shapes Collective Behaviors in LLM Agent Communities

arxiv url: http://arxiv.org/abs/2604.05339v1
Date: Tue, 07 Apr 2026 02:23:48 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-08 17:42:09.57773
Title: Human Values Matter: Investigating How Misalignment Shapes Collective Behaviors in LLM Agent Communities
Title（参考訳）: 人的価値:LLMエージェントコミュニティにおけるミスアライメント形状の集合行動の解明
Authors: Xiangxu Zhang, Jiamin Wang, Qinlin Zhao, Hanze Guo, Linzhuo Li, Jing Yao, Xiao Zhou, Xiaoyuan Yi, Xing Xie,
Abstract要約: 社会科学理論に基づく制御型マルチエージェント環境CIVAを紹介する。我々は、コミュニティの集団的ダイナミクスを著しく形作るいくつかの構造的に重要な価値を識別する。マイクロレベルでの騙しやパワーセーキングといった創発的な行動を観察します。
参考スコア（独自算出の注目度）: 38.85369941405267
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: As LLMs become increasingly integrated into human society, evaluating their orientations on human values from social science has drawn growing attention. Nevertheless, it is still unclear why human values matter for LLMs, especially in LLM-based multi-agent systems, where group-level failures may accumulate from individually misaligned actions. We ask whether misalignment with human values alters the collective behavior of LLM agents and what changes it induces? In this work, we introduce CIVA, a controlled multi-agent environment grounded in social science theories, where LLM agents form a community and autonomously communicate, explore, and compete for resources, enabling systematic manipulation of value prevalence and behavioral analysis. Through comprehensive simulation experiments, we reveal three key findings. (1) We identify several structurally critical values that substantially shape the community's collective dynamics, including those diverging from LLMs' original orientations. Triggered by the misspecification of these values, we (2) detect system failure modes, e.g., catastrophic collapse, at the macro level, and (3) observe emergent behaviors like deception and power-seeking at the micro level. These results offer quantitative evidence that human values are essential for collective outcomes in LLMs and motivate future multi-agent value alignment.
Abstract（参考訳）: LLMがますます人間社会に統合されるにつれて、社会科学から人的価値への指向性を評価することが注目されている。にもかかわらず、LLMの人的価値が重要な理由、特にLLMベースのマルチエージェントシステムでは、集団レベルの障害が個々の不整合行動から蓄積される可能性がある理由はまだ不明である。人間の価値観との相違がLLMエージェントの集団行動に影響を与え、それがどのような変化をもたらすのかを問う。本研究では, LLMエージェントがコミュニティを形成し, 自律的にコミュニケーションし, 探索し, 資源と競争し, 価値の有病率と行動分析の体系的な操作を可能にする, 社会科学理論に基づくマルチエージェント環境CIVAを紹介する。総合シミュレーション実験により,3つの重要な知見が得られた。 1) LLMの元々の方向性から分岐するものを含む, コミュニティの集団的ダイナミクスを著しく形作るいくつかの構造的重要な価値を同定する。これらの値の誤特定により,(2)マクロレベルでの破滅的崩壊などのシステム障害モードを検出し,(3)マイクロレベルでの誤認やパワーサーキングといった創発的な挙動を観察する。これらの結果は、LLMの集合的な結果に人的価値が不可欠であることを定量的に証明し、将来のマルチエージェント価値アライメントを動機付けている。

論文の概要: Human Values Matter: Investigating How Misalignment Shapes Collective Behaviors in LLM Agent Communities

関連論文リスト