Fugu-MT 論文翻訳(概要): Pushing the Limits of LLM Tool Calling via Experiential Knowledge Integration and Activation

論文の概要: Pushing the Limits of LLM Tool Calling via Experiential Knowledge Integration and Activation

arxiv url: http://arxiv.org/abs/2606.10875v1
Date: Tue, 09 Jun 2026 13:51:32 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-10 15:40:58.530517
Title: Pushing the Limits of LLM Tool Calling via Experiential Knowledge Integration and Activation
Title（参考訳）: 経験的知識統合と活性化によるLCMツール呼び出しの限界を押し上げる
Authors: Yupu Hao, Zhuoran Jin, Huanxuan Liao, Kang Liu, Jun Zhao,
Abstract要約: 大規模言語モデル(LLM)は、自律エージェントとして機能するツールの使用に依存するが、多段階実行では失敗することが多い。本稿では,知識の獲得,アクティベーション,内部化の段階を網羅し,知識がツール利用のパフォーマンスにどのように影響するかを体系的に検討する。本稿では,知識拡張ツール実行フレームワークKATEを提案する。知識拡張ツール実行フレームワークは,経験的知識と推論幅の拡大した推論と知識認識学習を統合する。
参考スコア（独自算出の注目度）: 23.538941671680806
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language models (LLMs) rely on tool use to act as autonomous agents, yet often fail in multi-step execution due to insufficient tool-related knowledge and ineffective knowledge activation. Therefore, we present a systematic study on how knowledge influences tool-use performance, covering the stages of knowledge acquisition, activation, and internalization. In the knowledge acquisition stage, we acquire and evaluate various forms of experiential knowledge, and our analysis shows that simple instance-level knowledge can already provide strong and reliable gains, while abstract intent-level knowledge offers limited benefits. At inference time, to activate knowledge, we find that prompting LLM to expand the depth of reasoning yields diminishing returns, whereas expanding the width of reasoning by parallel sampling with aggregation more effectively activates latent experiential knowledge. At training time, for knowledge internalization, post-training with knowledge-augmented data further improves performance, with reinforcement learning outperforming supervised fine-tuning. Based on these insights, we propose the Knowledge-Augmented Tool Execution (KATE), a knowledge-augmented tool execution framework that integrates experiential knowledge with reasoning-width-expanded inference and knowledge-aware training. Experiments on BFCL-V3 and AppWorld demonstrate consistent and substantial improvements over strong baselines across model scales. Our Code is available at https://github.com/hypasd-art/KATE.
Abstract（参考訳）: 大規模言語モデル(LLM)は、自律的なエージェントとして機能するためにツールの使用に依存するが、ツール関連の知識が不足し、知識のアクティベーションが不十分なため、多段階実行では失敗することが多い。そこで本研究では,知識の獲得,活性化,内部化の段階を網羅し,知識が道具の使用実績にどのように影響するかを体系的に検討する。知識獲得の段階では,経験的知識の様々な形態を習得し,評価し,分析したところ,単純なインスタンスレベルの知識がより強力で信頼性の高い知識を提供する一方で,抽象的なインテントレベルの知識は限られた利益をもたらすことがわかった。推論時, 知識を活性化するため, LLMに推論の深さを拡大させると, 推論の幅を増大させる一方, 並列サンプリングによる推論の幅を増大させることにより, 潜時経験知識をより効果的に活性化させることがわかった。学習時には、知識の内在化のために、知識を増強したデータによる後学習により、さらにパフォーマンスが向上し、強化学習は教師付き微調整よりも優れる。これらの知見に基づいて、知識拡張ツール実行(KATE)を提案する。知識拡張ツール実行フレームワークは、経験的知識を推論幅の拡大した推論と知識認識のトレーニングと統合する。 BFCL-V3とAppWorldの実験では、モデルスケールにわたる強力なベースラインよりも一貫性があり、実質的な改善が示されている。私たちのコードはhttps://github.com/hypasd-art/KATE.comで利用可能です。

論文の概要: Pushing the Limits of LLM Tool Calling via Experiential Knowledge Integration and Activation

関連論文リスト