Fugu-MT 論文翻訳(概要): LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents

論文の概要: LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents

arxiv url: http://arxiv.org/abs/2606.06087v1
Date: Thu, 04 Jun 2026 12:26:09 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-05 22:39:44.783783
Title: LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents
Title（参考訳）: LatentSkill: LLMエージェントのインテクストスキルからインウェイト潜在スキルへ
Authors: Aofan Yu, Chenyu Zhou, Tianyi Xu, Zihan Guo, Rong Shan, Zhihui Fu, Jun Wang, Weiwen Liu, Yong Yu, Weinan Zhang, Jianghao Lin,
Abstract要約: テキストスキルをプラグアンドプレイのLoRAアダプタに変換するフレームワークであるLatentSkillを紹介する。 LatentSkillは、コンテキスト空間よりも重み空間にスキル知識を格納し、ステップ単位のスキルトークンを削除する。 ALFWorldとSearch-QAでは、LatentSkillは、かなり少ないプリフィルトークンを使用しながら、対応するインコンテキストスキルベースラインよりもパフォーマンスが高い。
参考スコア（独自算出の注目度）: 40.33528208274459
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Agent systems increasingly use textual skills to encode reusable task procedures, but injecting these skills into the prompt at every step incurs substantial context overhead and exposes skill content as plaintext. We present LatentSkill, a framework that converts textual skills into plug-and-play LoRA adapters through a pretrained hypernetwork. LatentSkill stores skill knowledge in weight space rather than context space, removing per-step skill tokens while preserving modular loading, scaling, and composition. On ALFWorld and Search-QA, LatentSkill outperforms the corresponding in-context skill baseline while using substantially fewer prefill tokens: it improves ALFWorld success by 21.4 and 13.4 points on the seen and unseen splits with 64.1% fewer prefill tokens, and improves Search-QA exact match by 3.0 points with 72.2% lower skill-token overhead. Further analysis shows that generated skill LoRAs form a structured semantic geometry, can be precisely controlled via the LoRA scaling coefficient, and can be composed through parameter-space arithmetic when skill components are aligned. These findings suggest that weight-space skills provide an efficient, modular, and less exposed substrate for extending LLM agents.
Abstract（参考訳）: エージェントシステムは、再利用可能なタスクプロシージャをエンコードするためにテキストスキルを使用することが多いが、各ステップでプロンプトにこれらのスキルを注入すると、かなりのコンテキストオーバーヘッドが発生し、スキルコンテンツをプレーンテキストとして公開する。テキストスキルをプラグイン・アンド・プレイのLoRAアダプタに変換するフレームワークであるLatentSkillを,トレーニング済みのハイパーネットワークを通じて紹介する。 LatentSkillは、コンテキスト空間ではなく、スキル知識を重み空間に格納し、モジュールローディング、スケーリング、コンポジションを維持しながら、ステップ単位のスキルトークンを削除する。 ALFWorld と Search-QA では、LatentSkill は、対応するコンテキスト内スキルベースラインよりもはるかに少ないプリフィルトークンを使用し、ALFWorld の成功率を 21.4 と 13.4 ポイント改善し、64.1% のプリフィルトークンで、未確認のスプリットを 64.1% 削減し、サーチ-QA の正確なマッチを 3.0 ポイント改善し、スキルトーケンオーバーヘッドを 72.2% 下げた。さらに分析したところ、生成したスキルLoRAが構造化セマンティックジオメトリを形成し、LoRAスケーリング係数を介して正確に制御でき、スキルコンポーネントのアライメント時にパラメータ空間演算によって構成できることがわかった。これらの結果から, 重量空間技術はLLM剤の伸長に有効で, モジュール性があり, 露出が少ないことが示唆された。

論文の概要: LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents

関連論文リスト