Fugu-MT 論文翻訳(概要): GenesisFunc: Multi-Agent Data Generation for Accurate and Generalizable Function-Calling

論文の概要: GenesisFunc: Multi-Agent Data Generation for Accurate and Generalizable Function-Calling

arxiv url: http://arxiv.org/abs/2605.28835v1
Date: Fri, 10 Apr 2026 14:02:03 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-15 07:09:36.550218
Title: GenesisFunc: Multi-Agent Data Generation for Accurate and Generalizable Function-Calling
Title（参考訳）: GenesisFunc: 正確で一般化可能な関数計算のためのマルチエージェントデータ生成
Authors: Hao-Xiang Xu, Chong Deng, Jiaqing Liu, Wen Wang, Qian Chen, Lujia Bao, Xiangang Li, Zhen-Hua Ling,
Abstract要約: 関数呼び出しデータを生成する自動パイプラインであるGenesisFuncを提案する。提案手法は,下流のツールにまたがって効果的にスケールできる可能性を示し,実世界の応用性を裏付けるものである。
参考スコア（独自算出の注目度）: 48.71974949983576
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large Language Models (LLMs) extend their capabilities through function-calling (FC), which relies on training data with high quality, diversity, and broad coverage of scenario. However, obtaining and annotating real function-calling data is challenging, while synthetic data from existing pipelines often suffers from unreliable APIs, limited tool scalability, insufficient diversity, and weak quality control. To address these, we present GenesisFunc, an automated pipeline for generating FC training data. Starting from reliable tools in widely used public benchmarks, our GenesisFunc employs a multi-agent framework to support a dialogue generation system that produces conversations spanning diverse scenarios, while maintaining both diversity and quality throughout the process. The accuracy of the data is further reinforced through a multi-stage evaluation system. We fine-tune an 8B LLM on the synthetic dataset and show through extensive experiments that it outperforms similarly sized open-source models in in-domain FC performance and out-of-domain generalization, while reaching FC capabilities comparable to some of the latest API-based models. In addition, our method demonstrates strong potential to scale effectively across downstream tools, underscoring its real-world applicability.
Abstract（参考訳）: 大規模言語モデル(LLM)は、高い品質、多様性、幅広いシナリオのカバレッジを持つトレーニングデータに依存する関数呼び出し(FC)を通じて、その能力を拡張します。しかし、実際の関数呼び出しデータの取得と注釈付けは難しい。一方、既存のパイプラインからの合成データは、信頼性の低いAPI、ツールのスケーラビリティの制限、多様性の欠如、品質管理の弱さに悩まされることが多い。これらの問題に対処するため、FCトレーニングデータを生成するための自動パイプラインであるGenesisFuncを提案する。私たちのGenesisFuncは、広く使われている公開ベンチマークの信頼性の高いツールから始まり、多エージェントフレームワークを使用して、さまざまなシナリオにまたがる会話を生成する対話生成システムをサポートします。多段階評価システムによりデータの精度をさらに高める。合成データセット上で8B LLMを微調整し、最新のAPIベースモデルに匹敵するFC能力を維持しながら、ドメイン内FC性能とドメイン外一般化において、同様の規模のオープンソースモデルを上回る性能を示す広範な実験を行った。さらに,本手法は,下流のツールにまたがって効果的にスケールできる可能性を示し,実世界の応用性を裏付けるものである。

論文の概要: GenesisFunc: Multi-Agent Data Generation for Accurate and Generalizable Function-Calling

関連論文リスト