Fugu-MT 論文翻訳(概要): LoopTool: Closing the Data-Training Loop for Robust LLM Tool Calls

論文の概要: LoopTool: Closing the Data-Training Loop for Robust LLM Tool Calls

arxiv url: http://arxiv.org/abs/2511.09148v2
Date: Tue, 18 Nov 2025 07:03:59 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-19 13:59:16.5745
Title: LoopTool: Closing the Data-Training Loop for Robust LLM Tool Calls
Title（参考訳）: LoopTool:ロバストなLLMツールコールのためのデータトレーニングループのクローン
Authors: Kangning Zhang, Wenxiang Jiao, Kounianhua Du, Yuan Lu, Weiwen Liu, Weinan Zhang, Yong Yu,
Abstract要約: LoopToolは、完全に自動化され、モデル対応のデータ進化フレームワークである。 3つの相乗的モジュールを通して、データとモデルを反復的に洗練する。実験によると、LoopToolでトレーニングした8Bモデルは、32Bデータジェネレータを大幅に上回っている。
参考スコア（独自算出の注目度）: 46.34510189812439
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Augmenting Large Language Models (LLMs) with external tools enables them to execute complex, multi-step tasks. However, tool learning is hampered by the static synthetic data pipelines where data generation and model training are executed as two separate, non-interactive processes. This approach fails to adaptively focus on a model's specific weaknesses and allows noisy labels to persist, degrading training efficiency. We introduce LoopTool, a fully automated, model-aware data evolution framework that closes this loop by tightly integrating data synthesis and model training. LoopTool iteratively refines both the data and the model through three synergistic modules: (1) Greedy Capability Probing (GCP) diagnoses the model's mastered and failed capabilities; (2) Judgement-Guided Label Verification (JGLV) uses an open-source judge model to find and correct annotation errors, progressively purifying the dataset; and (3) Error-Driven Data Expansion (EDDE) generates new, challenging samples based on identified failures. This closed-loop process operates within a cost-effective, open-source ecosystem, eliminating dependence on expensive closed-source APIs. Experiments show that our 8B model trained with LoopTool significantly surpasses its 32B data generator and achieves new state-of-the-art results on the BFCL-v3 and ACEBench benchmarks for its scale. Our work demonstrates that closed-loop, self-refining data pipelines can dramatically enhance the tool-use capabilities of LLMs.
Abstract（参考訳）: LLM(Large Language Models)を外部ツールで拡張することで、複雑なマルチステップタスクを実行できる。しかし、ツール学習は静的な合成データパイプラインによって妨げられ、データ生成とモデルトレーニングは2つの独立した非対話的プロセスとして実行される。このアプローチは、モデルの特定の弱点に適応的に焦点を合わせることができず、ノイズの多いラベルを持続させ、トレーニング効率を低下させます。 LoopToolは、完全に自動化されたモデル対応のデータ進化フレームワークで、データ合成とモデルトレーニングを緊密に統合することで、このループを閉じます。 LoopToolは3つの相乗的モジュールを通じて、データとモデルを反復的に洗練する。 1) Greedy Capability Probing (GCP)は、モデルのマスターされた機能とフェールした機能を診断する; (2) Judgement-Guided Label Verification (JGLV)は、アノテーションエラーを見つけて修正するためにオープンソースの判断モデルを使用し、データセットを徐々に浄化する; 3) Error-Driven Data Expansion (EDDE)は、識別された障害に基づいて、新しい、挑戦的なサンプルを生成する。このクローズドループプロセスはコスト効率のよいオープンソースエコシステム内で動作し、高価なクローズドソースAPIへの依存を排除している。 LoopToolでトレーニングした8Bモデルは,32Bデータジェネレータを大幅に上回り,BFCL-v3とACEBenchベンチマークの新たな最先端結果を実現している。我々の研究は、LLMのツール使用能力を劇的に向上させることができるクローズドループ、自己精製データパイプラインを実証している。

論文の概要: LoopTool: Closing the Data-Training Loop for Robust LLM Tool Calls

関連論文リスト