Fugu-MT 論文翻訳(概要): Skill Neologisms: Towards Skill-based Continual Learning

論文の概要: Skill Neologisms: Towards Skill-based Continual Learning

arxiv url: http://arxiv.org/abs/2605.04970v1
Date: Wed, 06 May 2026 14:27:12 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-07 18:41:07.874431
Title: Skill Neologisms: Towards Skill-based Continual Learning
Title（参考訳）: スキル・ネオロジズム : スキル・ベースの継続的な学習を目指して
Authors: Antonin Berthon, Nicolas Astorga, Mihaela van der Schaar,
Abstract要約: スキル・ネオロジズムは、アウト・オブ・ディストリビューション・スキルによって構成されながら、特定のスキルのモデル能力を改善するために学習できることが示される。これらの結果から,スキル・ネオロジズムは,スキル・ベースの継続的学習へのスケーラブルな道筋を提供する可能性が示唆された。
参考スコア（独自算出の注目度）: 46.44104273347049
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Modern LLMs show mastery over an ever-growing range of skills, as well as the ability to compose them flexibly. However, extending model capabilities to new skills in a scalable manner is an open-problem: fine-tuning and parameter-efficient variants risk catastrophic forgetting, while context-based approaches have limited expressiveness and are constrained by the model's effective context. We explore skill neologisms--i.e., soft tokens integrated in the model's vocabulary and optimized to improve capabilities over a specific skill--as a way to selectively extend model capabilities to new skills without weight updates. We first observe that off-the-shelf pre-trained LLMs already demonstrate tokens associated with procedural knowledge. We then show that skill neologisms can be learned to improve model capabilities on specific skills while being composable with out-of-distribution skills, and that independently trained skill neologisms can be composed zero-shot. These results suggest that skill neologisms may provide a scalable path towards skill-based continual learning.
Abstract（参考訳）: 現代のLLMは、成長を続けるスキルに対する熟達と、それらを柔軟に構成する能力を示している。しかし、モデル能力をスケーラブルな方法で新しいスキルに拡張することはオープン・プロブレムである: 微調整とパラメータ効率の良い変種は破滅的な忘れをし、コンテキストベースのアプローチは表現力に制限があり、モデルの効果的なコンテキストによって制約される。モデルのボキャブラリに統合され、特定のスキルよりも機能を改善するために最適化されたソフトトークンを、重み付けなしでモデル能力を新しいスキルに選択的に拡張する方法として、スキル・ネオロジズムを探求する。まず、既成のLLMが、手続き的知識に関連するトークンを既に示していることを観察する。次に、スキル・ネオロジズムが、アウト・オブ・ディストリビューション・スキルによって構成されながら、特定のスキルのモデル能力を改善するために学習できることを示し、独立に訓練されたスキル・ネオロジズムをゼロショットで構成できることを示す。これらの結果から,スキル・ネオロジズムは,スキル・ベースの継続的学習へのスケーラブルな道筋を提供する可能性が示唆された。

論文の概要: Skill Neologisms: Towards Skill-based Continual Learning

関連論文リスト