Fugu-MT 論文翻訳(概要): Agent Planning with World Knowledge Model

論文の概要: Agent Planning with World Knowledge Model

arxiv url: http://arxiv.org/abs/2405.14205v2
Date: Tue, 15 Oct 2024 13:58:17 GMT
ステータス: 翻訳完了
システム内更新日: 2024-11-28 17:07:32.596014
Title: Agent Planning with World Knowledge Model
Title（参考訳）: 世界知識モデルを用いたエージェントプランニング
Authors: Shuofei Qiao, Runnan Fang, Ningyu Zhang, Yuqi Zhu, Xiang Chen, Shumin Deng, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen,
Abstract要約: エージェント計画を容易にするためにパラメトリック世界知識モデル(WKM)を導入する。我々はWKMを開発し,グローバルプランニングと動的状態知識を指導し,地域プランニングを支援する。本手法は, 各種の強靭なベースラインと比較して, 優れた性能が得られる。
参考スコア（独自算出の注目度）: 88.4897773735576
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recent endeavors towards directly using large language models (LLMs) as agent models to execute interactive planning tasks have shown commendable results. Despite their achievements, however, they still struggle with brainless trial-and-error in global planning and generating hallucinatory actions in local planning due to their poor understanding of the ``real'' physical world. Imitating humans' mental world knowledge model which provides global prior knowledge before the task and maintains local dynamic knowledge during the task, in this paper, we introduce parametric World Knowledge Model (WKM) to facilitate agent planning. Concretely, we steer the agent model to self-synthesize knowledge from both expert and sampled trajectories. Then we develop WKM, providing prior task knowledge to guide the global planning and dynamic state knowledge to assist the local planning. Experimental results on three complex real-world simulated datasets with three state-of-the-art open-source LLMs, Mistral-7B, Gemma-7B, and Llama-3-8B, demonstrate that our method can achieve superior performance compared to various strong baselines. Besides, we analyze to illustrate that our WKM can effectively alleviate the blind trial-and-error and hallucinatory action issues, providing strong support for the agent's understanding of the world. Other interesting findings include: 1) our instance-level task knowledge can generalize better to unseen tasks, 2) weak WKM can guide strong agent model planning, and 3) unified WKM training has promising potential for further development. The code is available at https://github.com/zjunlp/WKM.
Abstract（参考訳）: 対話型計画タスクを実行するエージェントモデルとして,大規模言語モデル(LLM)を直接利用するための最近の取り組みは,賞賛に値する結果を示している。しかし、これらの成果にもかかわらず、「現実」の物理世界に対する理解が乏しいため、グローバルプランニングにおける無脳の試行錯誤や、地域プランニングにおける幻覚的行動の発生に苦慮している。本稿では,タスク前にグローバルな事前知識を提供し,タスク中に局所的なダイナミックな知識を維持する人間のメンタルワールド知識モデル(WKM)について,エージェント計画を容易にするためにパラメトリックワールド知識モデル(Parametric World Knowledge Model,WKM)を紹介する。具体的には、エージェントモデルを用いて、専門家とサンプルの軌跡から知識を自己合成する。そして、WKMを開発し、グローバルプランニングと動的状態知識をガイドし、ローカルプランニングを支援する。最先端のオープンソースLLM3種, Mistral-7B, Gemma-7B, および Llama-3-8B を用いた実世界の3つの複雑なシミュレーションデータセットの実験結果から, 本手法は各種の強力なベースラインと比較して優れた性能が得られることを示した。さらに、我々のWKMが視覚障害者の試行錯誤や幻覚的行動の問題を効果的に緩和し、エージェントの世界の理解を強く支援できることを示すために分析を行った。その他の興味深い発見は以下のとおりである。 1) インスタンスレベルのタスク知識は、見当たらないタスクをより一般化することができる。 2)弱いWKMは強力なエージェントモデルプランニングを導くことができる。 3)統一的なWKM訓練はさらなる発展に有望な可能性を秘めている。コードはhttps://github.com/zjunlp/WKMで入手できる。

論文の概要: Agent Planning with World Knowledge Model

関連論文リスト