Fugu-MT 論文翻訳(概要): Verification-Guided Context Optimization for Tool Calling via Hierarchical LLMs-as-Editors

論文の概要: Verification-Guided Context Optimization for Tool Calling via Hierarchical LLMs-as-Editors

arxiv url: http://arxiv.org/abs/2512.13860v1
Date: Mon, 15 Dec 2025 19:48:21 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-17 16:49:26.474998
Title: Verification-Guided Context Optimization for Tool Calling via Hierarchical LLMs-as-Editors
Title（参考訳）: 階層型LLM-as-Editorsによるツール呼び出しのための検証誘導型コンテキスト最適化
Authors: Henger Li, Shuangjie You, Flavio Di Palo, Yiyue Qian, Ayush Jain,
Abstract要約: ツール関連のドキュメンテーションや知識ベースコンテキストを自動的に洗練する,大規模言語モデルをエディタとして使用するフレームワークであるVGCOを提案する。まず、ツール呼び出しワークフローに自然に統合される階層構造を使用する。第二に、状態認識、アクション特化、検証誘導であり、検索スペースを制限し、効率的で目標とする改善を可能にする。
参考スコア（独自算出の注目度）: 6.990045323115733
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Tool calling enables large language models (LLMs) to interact with external environments through tool invocation, providing a practical way to overcome the limitations of pretraining. However, the effectiveness of tool use depends heavily on the quality of the associated documentation and knowledge base context. These materials are usually written for human users and are often misaligned with how LLMs interpret information. This problem is even more pronounced in industrial settings, where hundreds of tools with overlapping functionality create challenges in scalability, variability, and ambiguity. We propose Verification-Guided Context Optimization (VGCO), a framework that uses LLMs as editors to automatically refine tool-related documentation and knowledge base context. VGCO works in two stages. First, Evaluation collects real-world failure cases and identifies mismatches between tools and their context. Second, Optimization performs hierarchical editing through offline learning with structure-aware, in-context optimization. The novelty of our LLM editors has three main aspects. First, they use a hierarchical structure that naturally integrates into the tool-calling workflow. Second, they are state-aware, action-specific, and verification-guided, which constrains the search space and enables efficient, targeted improvements. Third, they enable cost-efficient sub-task specialization, either by prompt engineering large editor models or by post-training smaller editor models. Unlike prior work that emphasizes multi-turn reasoning, VGCO focuses on the single-turn, large-scale tool-calling problem and achieves significant improvements in accuracy, robustness, and generalization across LLMs.
Abstract（参考訳）: ツール呼び出しは、ツール呼び出しを通じて、大きな言語モデル(LLM)が外部環境と対話することを可能にし、事前トレーニングの制限を克服する実用的な方法を提供する。しかし、ツールの使用の有効性は、関連するドキュメントや知識ベースコンテキストの品質に大きく依存する。これらの資料は通常、人間のユーザー向けに書かれており、LLMがどのように情報を解釈するかと不一致であることが多い。産業環境では、重複する機能を備えた何百ものツールがスケーラビリティ、可変性、あいまいさの課題を生み出します。ツール関連文書や知識ベースコンテキストを自動的に洗練するために,LLMをエディタとして使用するフレームワークであるVerification-Guided Context Optimization (VGCO)を提案する。 VGCOは2段階で動作する。まず、評価は現実世界の障害事例を収集し、ツールとそれらのコンテキストのミスマッチを特定する。第二に、最適化はオフライン学習を通じて階層的な編集を行う。 LLMエディタの新規性には3つの主な側面がある。まず、ツール呼び出しワークフローに自然に統合される階層構造を使用する。第二に、状態認識、アクション特化、検証誘導であり、検索スペースを制限し、効率的で目標とする改善を可能にする。第3に、大きなエディタモデルをプロンプトするか、より小さなエディタモデルをトレーニングした後で、コスト効率の低いサブタスクの特殊化を可能にする。マルチターン推論を強調する以前の作業とは異なり、VGCOはシングルターン、大規模ツール呼び出しの問題に重点を置いており、LLM間の精度、堅牢性、一般化の大幅な改善を実現している。

論文の概要: Verification-Guided Context Optimization for Tool Calling via Hierarchical LLMs-as-Editors

関連論文リスト