Fugu-MT 論文翻訳(概要): From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills

論文の概要: From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills

arxiv url: http://arxiv.org/abs/2605.23899v1
Date: Fri, 22 May 2026 17:59:12 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-25 17:29:20.463938
Title: From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills
Title（参考訳）: 経験からスキル消費へ:モデル生成エージェントスキルの体系的研究
Authors: Zisu Huang, Jingwen Xu, Yifan Yang, Ziyang Gong, Qihao Yang, Muzhao Tian, Xiaohua Wang, Changze Lv, Xuemei Gao, Qi Dai, Bei Liu, Kai Qiu, Xue Yang, Dongdong Chen, Xiaoqing Zheng, Chong Luo,
Abstract要約: モデル生成スキルは平均的に有益であるが,非自明な負の伝達を示す。モデルは強力な抽出器でありながら、弱い消費者、あるいはその逆もまた、モデルスケールやベースラインタスク強度とは無関係に、スキルユーティリティを持つことができる。本研究は,これらの知見を,スキル抽出のガイドとなる具体的なエフェメタスキルに翻訳する。
参考スコア（独自算出の注目度）: 46.76651106969111
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Language agents increasingly improve by reusing \emph{skills} -- structured procedural artifacts distilled from past experience. In particular, \emph{domain-level} and \emph{model-generated} skills are especially promising. They offer fast adaptation within a domain by encoding domain-specific recurring procedures, and they scale beyond labor-intensive hand-crafting. However, while extraction methods continue to proliferate, understanding remains limited, with no comprehensive study spanning the full skill lifecycle -- \textbf{experience generation}, \textbf{skill extraction}, and \textbf{skill consumption} -- to ask whether such skills actually work, when they work, and what makes them succeed or fail. To close this gap, we build a utility-grounded evaluation framework that provides systematic experimental results across extractors and target agents, covering five diverse agentic task domains. We find that model-generated skills are beneficial on average but exhibit non-trivial negative transfer, and that neither extractors nor targets behave uniformly. A model can be a strong extractor yet a weak consumer, or vice versa, with skill utility independent of model scale or baseline task strength. To explain these patterns, we then dissect each lifecycle stage in depth, analyzing how experience composition shapes skill quality, what properties characterize useful skills, and how the same skill transfers across different consumers. Finally, we translate these findings into a concrete \emph{meta-skill} that guides skill extraction toward the features tied to actual utility, which consistently improves skill quality across domains and substantially reduces negative transfer.
Abstract（参考訳）: 言語エージェントは、過去の経験から抽出された構造化手続き的アーティファクトを再利用することで、ますます改善される。特に、 \emph{domain-level} と \emph{model- generated} スキルは特に有望である。ドメイン固有の反復手順をコーディングすることで、ドメイン内での迅速な適応を提供し、労働集約的な手作りを超えてスケールする。しかしながら、抽出方法が拡大を続ける一方で、理解は限定的であり、完全なスキルライフサイクル – \textbf{experience generation}、 \textbf{skill extract}、 \textbf{skill consumption} – にまたがる包括的な研究は行われていない。このギャップを埋めるために,5つのエージェントタスクドメインをカバーする抽出器と対象エージェントを対象とする,系統的な実験結果を提供するユーティリティグラウンド評価フレームワークを構築した。モデル生成スキルは平均的に有益であるが,非自明な負の伝達を示し,抽出者も対象者も一様に振る舞わないことがわかった。モデルは強力な抽出器でありながら、弱い消費者、あるいはその逆もまた、モデルスケールやベースラインタスク強度とは無関係に、スキルユーティリティを持つことができる。これらのパターンを説明するために、我々は各ライフサイクルステージを深く分析し、経験的構成がいかにスキルの質を形作るか、どのような特性が有用なスキルを特徴付けるか、そして異なる消費者間で同じスキルが伝達されるかを分析した。最後に、これらの知見を具体的「emph{meta-skill」に翻訳し、実際のユーティリティに関連付けられた特徴に対してスキル抽出を誘導し、ドメイン間のスキル品質を一貫して改善し、負の転送を大幅に削減する。

論文の概要: From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills

関連論文リスト