Fugu-MT 論文翻訳(概要): SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research

論文の概要: SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research

arxiv url: http://arxiv.org/abs/2606.09730v1
Date: Mon, 08 Jun 2026 16:52:26 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-09 14:42:07.573389
Title: SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research
Title（参考訳）: SearchSwarm:ロングホライゾンディープリサーチのためのエージェントLDMにおけるデリゲーションインテリジェンスを目指して
Authors: Pu Ning, Quan Chen, Kun Tao, Xinyu Tang, Tianshu Wang, Qianggang Cao, Xinyu Kong, Zujie Wen, Zhiqiang Zhang, Jun Zhou,
Abstract要約: 大規模言語モデルは、複雑で長期の現実世界のタスクを扱うことがますます期待されている。本稿では, 深層調査を対象とする予備調査を, 代表的な長期エージェントタスクとして提示する。我々は,高品質なタスク分解とデリゲートに向けてモデルをガイドするハーネスを設計し,サブエージェントに適切な結果を返すよう制約する。我々のモデルである SearchSwarm-30B-A3B は BrowseComp で68.1 、BrowseComp-ZH で73.3 を達成している。
参考スコア（独自算出の注目度）: 17.956691114919987
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models are increasingly expected to handle complex, long-horizon real-world tasks whose context demands can grow without bound, yet model context windows remain inherently finite. Recent work explores a paradigm where a main agent decomposes tasks and dispatches subtasks to subagents, which execute and return only summarized results, conserving the main agent's context budget. However, performing this well requires delegation intelligence: the ability to decompose complex tasks, determine when and what to delegate, and integrate returned results into the ongoing workflow. Training data for this capability is scarce in naturally occurring text, and to our knowledge, how to synthesize such data and train models to acquire this capability remains largely unexplored in the open-source community. To bridge this gap, we present a preliminary exploration targeting deep research, a representative long-horizon agent task. Specifically, we design a harness that guides the model toward high-quality task decomposition and delegation, while constraining subagents to return results properly to support the main agent's workflow. The harness-guided trajectories naturally encode correct delegation decisions, which we use as supervised fine-tuning data to internalize delegation intelligence into model weights. Our resulting model, SearchSwarm-30B-A3B, achieves 68.1 on BrowseComp and 73.3 on BrowseComp-ZH, the best results among all models of comparable scale. We will release our harness, model weights, and training data to facilitate future research.
Abstract（参考訳）: 大規模言語モデルは、コンテキスト要求がバウンドなしで成長できる複雑な、長期の現実世界タスクを扱うことがますます期待されているが、モデルコンテキストウィンドウは本質的に有限である。最近の研究は、メインエージェントがタスクを分解し、サブタスクをサブエージェントにディスパッチするパラダイムを探求している。しかし、これをうまく実行するにはデリゲートインテリジェンスが必要です。複雑なタスクを分解し、いつ、何をデリゲートするかを決定し、返された結果を継続的なワークフローに統合する機能です。この能力のトレーニングデータは、自然発生のテキストでは不十分であり、私たちの知識では、そのようなデータをどうやって合成し、この能力を得るためにモデルを訓練するかは、オープンソースコミュニティでは明らかにされていない。このギャップを埋めるために、我々は深層研究をターゲットとした予備調査、代表的長距離エージェントタスクを提示する。具体的には、モデルが高品質なタスク分解とデリゲートに向けてガイドされるハーネスを設計し、サブエージェントがメインエージェントのワークフローをサポートするために結果を返すように制約する。ハーネス誘導軌道は自然に正しいデリゲート決定を符号化しており、我々はデリゲートインテリジェンスをモデル重みに内部化するために教師付き微調整データとして使っている。我々のモデルである SearchSwarm-30B-A3B は BrowseComp で68.1 、BrowseComp-ZH で73.3 を達成している。将来の研究を促進するために、ハーネス、モデルウェイト、トレーニングデータをリリースします。

論文の概要: SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research

関連論文リスト