Fugu-MT 論文翻訳(概要): SoftSkill: Behavioral Compression for Contextual Adaptation

論文の概要: SoftSkill: Behavioral Compression for Contextual Adaptation

arxiv url: http://arxiv.org/abs/2606.20333v1
Date: Thu, 18 Jun 2026 15:04:47 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-19 18:23:39.93641
Title: SoftSkill: Behavioral Compression for Contextual Adaptation
Title（参考訳）: SoftSkill: コンテキスト適応のための行動圧縮
Authors: Xijia Tao, Yihua Teng, Xinyu Fu, Ziru Liu, Kecheng Chen, Yuzhi Zhao, Suiyun Zhang, Rui Liu, Lingpeng Kong,
Abstract要約: 本稿では、自然言語スキルがコンパクトな連続コンテキストオブジェクトを初期化できるかどうかを問う。そこで本研究では,凍結したバックボーン法であるSoftSkillを提案する。 Skillとは対照的に、SoftSkillはSearchQAでは5.2ポイント、LiveMathでは12.5ポイントの精度向上を実現している。
参考スコア（独自算出の注目度）: 45.10778084241114
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Agent skills are commonly deployed as natural-language Markdown files that encode answer policies, evidence-use habits, and task procedures. These files are readable and portable, but they are consumed indirectly: for each task instance, a frozen language model must translate a long textual artifact into generation-time behavior. This paper asks whether a natural-language skill can instead initialize a compact continuous context object, refined by a trainable soft delta while the base model remains frozen. We propose SoftSkill, a frozen-backbone method that tunes such soft skills with next-token prediction and deploys them as latent behavioral priors at inference time. In our main single-round setting, a length-32 SoftSkill prefix on Qwen3.5-4B improves over no-skill prompting by 8.3 points on SearchQA, 42.1 points on LiveMath, and 1.3 points on DocVQA. Relative to SkillOpt, SoftSkill improves accuracy by 5.2 points on SearchQA and 12.5 points on LiveMath, while replacing hundreds to thousands of Markdown skill tokens with a few virtual tokens. We further study agentic execution as a harder boundary case, where sparse trajectory imitation provides useful signal but does not yet robustly compress long-horizon procedural behavior. More broadly, the results suggest that some task skills are better treated not as additional Markdown to be reinterpreted at inference time, but as compact latent controls over how a frozen model enters the task.
Abstract（参考訳）: エージェントスキルは一般的に、回答ポリシー、エビデンス利用習慣、タスク手順をエンコードする自然言語のMarkdownファイルとしてデプロイされる。これらのファイルは読みやすくポータブルだが、間接的に消費される。各タスクインスタンスに対して、凍結された言語モデルは長いテキストのアーティファクトを生成時の振る舞いに変換する必要がある。本稿では,学習可能なソフトデルタによって改良されたコンパクトな連続文脈オブジェクトを,ベースモデルが凍結状態のままに初期化できるかどうかを問う。そこで本研究では,そのソフトスキルを次世代の予測で調整し,予測時に潜時行動前処理として展開する,フリーズバックボーン方式のSoftSkillを提案する。メインのシングルラウンド設定では、Qwen3.5-4Bの32のSoftSkillプレフィックスは、検索QAの8.3ポイント、LiveMathの42.1ポイント、DocVQAの1.3ポイントよりも向上する。 SkillOptとは対照的に、SoftSkillは検索QAで5.2ポイント、LiveMathで12.5ポイント、Markdownスキルトークンで数百から数千の仮想トークンを置き換える。我々はさらに,スパース軌跡模倣が有用な信号を提供するが,長期の手続き動作を頑健に圧縮しない,という,より難しい境界条件としてのエージェント実行について検討した。より広義には、いくつかのタスクスキルは、推論時にMarkdownを追加して再解釈するよりも、凍結モデルがどのようにタスクに入るかに関するコンパクトな潜在性制御として、より良く扱われることを示している。

論文の概要: SoftSkill: Behavioral Compression for Contextual Adaptation

関連論文リスト