Fugu-MT 論文翻訳(概要): SearchSkill: Teaching LLMs to Use Search Tools with Evolving Skill Banks

論文の概要: SearchSkill: Teaching LLMs to Use Search Tools with Evolving Skill Banks

arxiv url: http://arxiv.org/abs/2605.09038v2
Date: Thu, 14 May 2026 10:12:06 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-15 21:45:34.372317
Title: SearchSkill: Teaching LLMs to Use Search Tools with Evolving Skill Banks
Title（参考訳）: 検索スキル:LLMにスキルバンクを進化させる検索ツールを教える
Authors: Jinchao Hu, Meizhi Zhong, Kehai Chen, Min Zhang,
Abstract要約: 本稿では,再利用可能な検索スキルによってクエリ計画を明確にするフレームワークを提案する。オープンソースモデルとクローズドソースモデル全体で、SearchSkillは知識集約型QAベンチマークの正確なマッチを改善している。これらの結果は、明示的なスキル条件付きクエリプランニングが、未分化のアクションとして検索を扱うための軽量な代替手段であることを示唆している。
参考スコア（独自算出の注目度）: 32.240602087886494
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Teaching language models to use search tools is not only a question of whether they search, but also of whether they issue good queries. This is especially important in open-domain question answering, where broad or copied queries often waste retrieval budget and derail later reasoning. We propose \Ours, a framework that makes query planning explicit through reusable search skills. At each step, the model first selects a skill, then generates a search or answer action conditioned on the selected skill card. The skill inventory itself is not fixed: SearchSkill maintains an evolving SkillBank, expands or refines it from recurrent failure patterns, and reconstructs affected trajectories before supervised training. The resulting two-stage SFT recipe aligns training with the inference-time protocol of skill selection followed by skill-grounded execution. Across open-source and closed-source models, SearchSkill improves exact match on knowledge-intensive QA benchmarks and yields better retrieval behavior, including fewer copied first queries, more atomic hop-focused queries, and more correct answers within a small search budget. These results suggest that explicit skill-conditioned query planning is a lightweight alternative to treating search as an undifferentiated action.
Abstract（参考訳）: 検索ツールを使うための言語モデルを教えることは、検索するかどうかだけでなく、良いクエリを発行するかどうかという問題でもある。これは、広範またはコピーされたクエリが検索予算を浪費し、後続の推論を遅らせる、オープンドメインの質問応答において特に重要である。再利用可能な検索スキルによってクエリ計画を明確にするフレームワークである‘Ours’を提案する。各ステップで、まず、モデルがスキルを選択し、選択されたスキルカードに条件付き検索または回答アクションを生成する。 SearchSkillは進化を続けるSkillBankを維持し、繰り返し発生する障害パターンからそれを拡張または洗練し、教師付きトレーニングの前に影響のあるトラジェクトリを再構築する。得られた2段階のSFTレシピは、スキル選択の推論時プロトコルとトレーニングを整合させ、その後、スキル接地実行を行う。オープンソースモデルとクローズドソースモデル全体で、SearchSkillは知識集約型QAベンチマークの正確なマッチングを改善し、コピーされた最初のクエリの少ない、アトミックなホップ中心のクエリ、小さな検索予算内でのより正確な回答など、より良い検索動作を提供する。これらの結果は、明示的なスキル条件付きクエリプランニングが、未分化のアクションとして検索を扱うための軽量な代替手段であることを示唆している。

論文の概要: SearchSkill: Teaching LLMs to Use Search Tools with Evolving Skill Banks

関連論文リスト