Fugu-MT 論文翻訳(概要): Adaptive Task Vectors for Large Language Models

論文の概要: Adaptive Task Vectors for Large Language Models

arxiv url: http://arxiv.org/abs/2506.03426v1
Date: Tue, 03 Jun 2025 22:12:28 GMT
ステータス: 翻訳完了
システム内更新日: 2025-06-05 21:20:14.070692
Title: Adaptive Task Vectors for Large Language Models
Title（参考訳）: 大規模言語モデルに対する適応型タスクベクトル
Authors: Joonseong Kang, Soojeong Lee, Subeen Park, Sumin Park, Taero Kim, Jihee Kim, Ryunyi Lee, Kyungwoo Song,
Abstract要約: Adaptive Task Vectors (ATV) は、各入力クエリに条件付きタスクベクトルを動的に生成する、シンプルで効果的なフレームワークである。 ATVは、目に見えないタスクであっても、強力なパフォーマンスと一般化能力を示す。
参考スコア（独自算出の注目度）: 14.108866468832623
License: http://creativecommons.org/licenses/by/4.0/
Abstract: In-Context Learning (ICL) enables Large Language Models (LLMs) to perform tasks without parameter updates by conditioning on a few demonstrations provided in the prompt. Despite its success, ICL suffers from several limitations, including sensitivity to demonstration order, context length constraints, and computational inefficiency. To address these challenges, task vector-based approaches compress task information into a single vector. However, these methods typically construct task vectors from fixed sets of demonstrations and reuse them across input queries, without conditioning on the specific input. This limitation can lead models to struggle with effective adaptation when the input query is not well aligned with the underlying demonstrations, consequently degrading their generalization performance on unseen tasks. To overcome this limitation, we propose Adaptive Task Vectors (ATV), a simple and effective framework that dynamically generates task vectors conditioned on each input query. ATV employs a small language model to generate task vectors, which are then transformed to match the target LLM's architecture and applied to guide its output generation. In contrast to ICL and previous vector-based approaches, which rely on fixed demonstration sets and their corresponding vectors, ATV dynamically generates task vectors tailored to each specific input query and task. Consequently, ATV demonstrates strong performance and generalization capabilities, even for unseen tasks. Furthermore, we provide a theoretical analysis indicating that ATV is expressively equivalent to LoRA under equal rank budgets and more expressive than Prefix-Tuning, thereby offering formal support for its representational advantage.
Abstract（参考訳）: In-Context Learning (ICL)は、大規模言語モデル(LLM)がプロンプトで提供されるいくつかのデモを条件にすることで、パラメータ更新なしでタスクを実行することを可能にする。その成功にもかかわらず、ICLはデモの順序に対する感度、コンテキスト長の制約、計算の非効率など、いくつかの制限に悩まされている。これらの課題に対処するため、タスクベクトルベースのアプローチはタスク情報を1つのベクトルに圧縮する。しかし、これらのメソッドは通常、特定の入力を条件にすることなく、固定されたデモセットからタスクベクトルを構築し、入力クエリ間で再利用する。この制限により、入力クエリが基礎となるデモとうまく一致していない場合、モデルが効果的な適応に苦労する可能性がある。この制限を克服するために,各入力クエリに条件付きタスクベクトルを動的に生成する,シンプルで効果的なフレームワークであるAdaptive Task Vectors (ATV)を提案する。 ATVは、タスクベクトルを生成するために小さな言語モデルを使用し、ターゲットのLLMアーキテクチャに適合するように変換され、出力生成をガイドするために適用される。 ICLやそれ以前のベクトルベースのアプローチとは対照的に、ATVは特定の入力クエリやタスクに適したタスクベクトルを動的に生成する。その結果、ATVは、目に見えないタスクであっても、強力なパフォーマンスと一般化能力を示す。さらに,ATV が同じランクの予算の下では LoRA と表現的に等価であり,Prefix-Tuning よりも表現力が高いことを示す理論的解析を行い,その表現的優位性に対する公式なサポートを提供する。

論文の概要: Adaptive Task Vectors for Large Language Models

関連論文リスト