Fugu-MT 論文翻訳(概要): Humans' ALMANAC: A Human Collaboration Dataset of Action-Level Mental Model Annotations for Agent Collaboration

論文の概要: Humans' ALMANAC: A Human Collaboration Dataset of Action-Level Mental Model Annotations for Agent Collaboration

arxiv url: http://arxiv.org/abs/2606.06388v2
Date: Sat, 06 Jun 2026 00:32:51 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-09 14:42:04.995604
Title: Humans' ALMANAC: A Human Collaboration Dataset of Action-Level Mental Model Annotations for Agent Collaboration
Title（参考訳）: ヒトのALMANAC : エージェントコラボレーションのための行動レベルメンタルモデルアノテーションのヒトコラボレーションデータセット
Authors: Jiaju Chen, Yuxuan Lu, Jiayi Su, Chaoran Chen, Songlin Xiao, Zheng Zhang, Yun Wang, Yunyao Li, Jian Zhao, Tongshuang Wu, Toby Jia-Jun Li, Dakuo Wang, Bingsheng Yao,
Abstract要約: 本稿ではエージェント協調のための行動レベル精神モデルAnnotationsのデータセットであるALMANACについて述べる。 ALMANACには2,987の協調アクションがあり、それぞれに理論インフォームドのメンタルモデルアノテーションが組み合わされている。本研究は、ALMANACが人間の協調動作をシミュレートするモデルの能力を評価する上で有用であることを示す。
参考スコア（独自算出の注目度）: 54.18136393879849
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Recent advances in LLM agents have enabled complex cognitive capabilities, such as multi-step reasoning, planning, and tool use, that increasingly position these agents as human collaborators. Effective collaboration, however, requires collaborators to continuously maintain and align mental models of their own reasoning,partners' intentions, and shared goals during the collaborative process. Today's agents rarely develop such capabilities since they are primarily optimized for task completion, and the community lacks authentic human collaboration data with action-level mental model annotations that could guide agents toward process-level collaborative competence. To bridge this gap, we present ALMANAC, a dataset of Action-Level Mental model ANnotations for Agent Collaboration built from the Map Task, a classic dyadic routing task from social science. ALMANAC contains 2,987 collaboration actions, each paired with theory-informed mental model annotations that record the participants' self-reasoning, perceived partner intent, and perceived team goal. We benchmark six LLMs on predicting humans' next-turn behavior and mental models. Our results demonstrate ALMANAC's utility in evaluating models' ability to simulate human collaborative behaviors and infer their underlying mental models.
Abstract（参考訳）: 近年のLLMエージェントの進歩により、多段階の推論、計画、ツールの使用といった複雑な認知能力が実現し、これらのエージェントを人間の協力者として位置づけるようになっている。しかし、効果的なコラボレーションは、協力者が自身の理性、パートナーの意図、共同作業中の目標の共有を継続的に維持し、調整することを要求する。今日のエージェントは、主にタスク完了に最適化されているため、そのような機能を開発することは滅多にありません。このギャップを埋めるために、社会科学から古典的なダイアドルーティングタスクであるMap Taskから構築されたエージェント協調のための行動レベルメンタルモデルのデータセットであるALMANACを提示する。 ALMANACには2,987の協力行動が含まれており、それぞれに、参加者の自己推論、パートナーの意図、認識されたチームの目標を記録する、理論的なインフォームド・メンタル・モデルアノテーションが組み合わされている。我々は、人間の次のターン行動と精神モデルを予測するために、6つのLSMをベンチマークした。この結果から、ALMANACは、人間の協調行動をシミュレートし、基礎となる精神モデルを予測するモデルの能力を評価できることを示した。

論文の概要: Humans' ALMANAC: A Human Collaboration Dataset of Action-Level Mental Model Annotations for Agent Collaboration

関連論文リスト