Fugu-MT 論文翻訳(概要): Simulating Complex Multi-Turn Tool Calling Interactions in Stateless Execution Environments

論文の概要: Simulating Complex Multi-Turn Tool Calling Interactions in Stateless Execution Environments

arxiv url: http://arxiv.org/abs/2601.19914v1
Date: Tue, 06 Jan 2026 20:04:30 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-02 02:21:38.528037
Title: Simulating Complex Multi-Turn Tool Calling Interactions in Stateless Execution Environments
Title（参考訳）: ステートレス実行環境における複合型マルチTurnツールによるインタラクションのシミュレーション
Authors: Maxwell Crouse, Ibrahim Abdelaziz, Kshitij Fadnis, Siva Sankalp Patel, Kinjal Basu, Chulaka Gunasekara, Sadhana Kumaravel, Asim Munawar, Pavan Kapanipathi,
Abstract要約: DiGiT-TCは、ステートフルな環境で検索によって生成された会話の特徴を持つ会話を呼び出すツールを作成するように設計されている。標準ツール呼び出しベンチマークに対する我々のアプローチを検証するとともに、ステートフルな問題設定においても、我々のアプローチはパフォーマンスが向上することを示す。
参考スコア（独自算出の注目度）: 14.539418822648658
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Synthetic data has proven itself to be a valuable resource for tuning smaller, cost-effective language models to handle the complexities of multi-turn tool calling conversations. While many frameworks and systems for producing synthetic multi-turn tool calling data have been proposed, prior works have frequently assumed that any tool calling interactions will take place in an execution environment that maintains state. When such an environment is available, this is advantageous as it allows for the validity of an interaction to be determined by whether or not the state of the execution environment matches to some prespecified objective. Unfortunately, this does not hold in many real-world tool use settings, e.g., in enterprise settings where data security is of the utmost importance or in cases where tool specifications are synthesized from multiple sources. In this work, we address this gap by introducing a data generation method, DiGiT-TC, that is designed to produce tool calling conversations that have the characteristics of conversations generated through search in a stateful environment. The key to our technique lies in a novel generation pattern that allows our approach to implicitly represent certain tool calls in the user request. We validate our approach on standard tool calling benchmarks and demonstrate that, even in stateful problem settings, our approach results in strong performance gains.
Abstract（参考訳）: 合成データは、会話を呼び出すマルチターンツールの複雑さを扱うために、より小型で費用効率のよい言語モデルをチューニングするための貴重なリソースであることが証明されている。人工的なマルチターンツールコールデータを生成するためのフレームワークやシステムが数多く提案されているが、以前の研究では、状態を維持する実行環境において、あらゆるツールコールインタラクションが実行されると想定されていた。このような環境が利用可能である場合、実行環境の状態が所定の目的に合致するか否かによって、インタラクションの妥当性を決定することができるため、これは有利である。残念なことに、これは多くの実世界のツール利用設定、例えば、データセキュリティが最も重要であるエンタープライズ環境では、あるいはツール仕様が複数のソースから合成されている場合では、当てはまらない。本研究では,このギャップに対処するため,ステートフルな環境での検索によって生成される会話の特徴を持つ対話ツールを作成可能なデータ生成手法であるDiGiT-TCを導入する。私たちのテクニックの鍵は、ユーザリクエスト内の特定のツール呼び出しを暗黙的に表現できる新しい生成パターンにあります。標準ツール呼び出しベンチマークに対する我々のアプローチを検証するとともに、ステートフルな問題設定においても、我々のアプローチはパフォーマンスが向上することを示す。

論文の概要: Simulating Complex Multi-Turn Tool Calling Interactions in Stateless Execution Environments

関連論文リスト