Fugu-MT 論文翻訳(概要): ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

論文の概要: ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

arxiv url: http://arxiv.org/abs/2511.21689v1
Date: Wed, 26 Nov 2025 18:59:46 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-27 18:37:59.263765
Title: ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration
Title（参考訳）: ToolOrchestra: 効率的なモデルとツールオーケストレーションによるインテリジェンス向上
Authors: Hongjin Su, Shizhe Diao, Ximing Lu, Mingjie Liu, Jiacheng Xu, Xin Dong, Yonggan Fu, Peter Belcak, Hanrong Ye, Hongxu Yin, Yi Dong, Evelina Bakhturina, Tao Yu, Yejin Choi, Jan Kautz, Pavlo Molchanov,
Abstract要約: 私たちは、他のモデルやさまざまなツールを管理する小さなオーケストレータが、どちらもインテリジェンスの上限を押し上げることができることを示しています。インテリジェントなツールをコーディネートする,小さなオーケストレータのトレーニング方法であるToolOrchestraを紹介する。 ToolOrchestraを使うことで、従来のツール使用エージェントよりも低コストで精度の高い8BモデルであるOrchestratorを作成できる。
参考スコア（独自算出の注目度）: 110.24410841004777
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language models are powerful generalists, yet solving deep and complex problems such as those of the Humanity's Last Exam (HLE) remains both conceptually challenging and computationally expensive. We show that small orchestrators managing other models and a variety of tools can both push the upper bound of intelligence and improve efficiency in solving difficult agentic tasks. We introduce ToolOrchestra, a method for training small orchestrators that coordinate intelligent tools. ToolOrchestra explicitly uses reinforcement learning with outcome-, efficiency-, and user-preference-aware rewards. Using ToolOrchestra, we produce Orchestrator, an 8B model that achieves higher accuracy at lower cost than previous tool-use agents while aligning with user preferences on which tools are to be used for a given query. On HLE, Orchestrator achieves a score of 37.1%, outperforming GPT-5 (35.1%) while being 2.5x more efficient. On tau2-Bench and FRAMES, Orchestrator surpasses GPT-5 by a wide margin while using only about 30% of the cost. Extensive analysis shows that Orchestrator achieves the best trade-off between performance and cost under multiple metrics, and generalizes robustly to unseen tools. These results demonstrate that composing diverse tools with a lightweight orchestration model is both more efficient and more effective than existing methods, paving the way for practical and scalable tool-augmented reasoning systems.
Abstract（参考訳）: 大規模な言語モデルは強力なジェネラリストであるが、HLE(Humanity's Last Exam)のような深い複雑な問題を解くことは概念的には困難であり、計算コストも高いままである。私たちは、他のモデルやさまざまなツールを管理する小さなオーケストレータが、インテリジェンスの上層部を押し上げ、難しいエージェントタスクを解決するための効率を改善することができることを示しています。インテリジェントなツールをコーディネートする,小さなオーケストレータのトレーニング方法であるToolOrchestraを紹介する。 ToolOrchestraでは、成果、効率性、ユーザ優先の報酬による強化学習を明示的に使用しています。 ToolOrchestraを使うと、Orchestratorは従来のツール使用エージェントよりも低コストで精度の高い8Bモデルになります。 HLEでは、Orchestratorのスコアは37.1%で、GPT-5(35.1%)を上回っ、効率は2.5倍である。 tau2-Bench と FRAMES では、Orchestrator は GPT-5 をはるかに上回り、コストの 30% しか使用していない。大規模な分析によると、Orchestratorは複数のメトリクスでパフォーマンスとコストの最良のトレードオフを実現し、目に見えないツールにしっかりと一般化している。これらの結果は、軽量なオーケストレーションモデルで多様なツールを構成することは、既存の方法よりも効率的かつ効果的であることを示し、実用的でスケーラブルなツール拡張推論システムへの道を開いた。

論文の概要: ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

関連論文リスト