Fugu-MT 論文翻訳(概要): Memory Limitations of Prompt Tuning in Transformers

論文の概要: Memory Limitations of Prompt Tuning in Transformers

arxiv url: http://arxiv.org/abs/2509.00421v1
Date: Sat, 30 Aug 2025 09:08:00 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-04 15:17:03.225069
Title: Memory Limitations of Prompt Tuning in Transformers
Title（参考訳）: 変圧器のプロンプトチューニングのメモリ制限
Authors: Maxime Meyer, Mario Michelessa, Caroline Chaux, Vincent Y. F. Tan,
Abstract要約: 本研究では, 変圧器が記憶する情報量は, 即時長よりも高速に拡張できないことを示す。また,大規模言語モデルで経験的に観察された現象,すなわち性能劣化の最初の公式な証明も提示する。この発見は、トランスフォーマーアーキテクチャの本質的な制限に関する根本的な理解を提供する。
参考スコア（独自算出の注目度）: 45.158621811869466
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Despite the empirical success of prompt tuning in adapting pretrained language models to new tasks, theoretical analyses of its capabilities remain limited. Existing theoretical work primarily addresses universal approximation properties, demonstrating results comparable to standard weight tuning. In this paper, we explore a different aspect of the theory of transformers: the memorization capability of prompt tuning. We provide two principal theoretical contributions. First, we prove that the amount of information memorized by a transformer cannot scale faster than linearly with the prompt length. Second, and more importantly, we present the first formal proof of a phenomenon empirically observed in large language models: performance degradation in transformers with extended contexts. We rigorously demonstrate that transformers inherently have limited memory, constraining the amount of information they can retain, regardless of the context size. This finding offers a fundamental understanding of the intrinsic limitations of transformer architectures, particularly their ability to handle long sequences.
Abstract（参考訳）: 事前訓練された言語モデルを新しいタスクに適用する際の即時チューニングの実証的な成功にもかかわらず、その能力に関する理論的分析は限定的のままである。既存の理論的な研究は主に普遍近似特性を扱い、標準ウェイトチューニングに匹敵する結果を示す。本稿では,変圧器理論の異なる側面,即時チューニングの記憶能力について考察する。主な理論的貢献は2つある。まず, 変圧器が記憶する情報量が, 即時長で線形に拡張できないことを証明する。第2に,大言語モデルで経験的に観察された現象の初めての公式な証明として,拡張文脈をもつ変圧器の性能劣化を示す。コンテクストのサイズに関わらず、変換器は本質的に限られたメモリを持ち、保持できる情報の量を制限することを厳格に実証する。この発見は、トランスフォーマーアーキテクチャの本質的な制限、特に長いシーケンスを扱う能力に関する根本的な理解を提供する。

論文の概要: Memory Limitations of Prompt Tuning in Transformers

関連論文リスト