Fugu-MT 論文翻訳(概要): Higher-order interactions of multi-layer prompt

論文の概要: Higher-order interactions of multi-layer prompt

arxiv url: http://arxiv.org/abs/2510.09394v2
Date: Thu, 16 Oct 2025 15:48:27 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-17 14:17:28.079227
Title: Higher-order interactions of multi-layer prompt
Title（参考訳）: 多層プロンプトの高次相互作用
Authors: Ziyu Zheng, Yaming Yang, Ziyu Guan, Wei Zhao, Xinyan Huang, Weigang Lu,
Abstract要約: 「プレトレイン・プロンプト」パラダイムは表現学習において成功している。現在のプロンプトチューニング手法は、プロンプトを異なるネットワーク層にまたがる独立した独立したコンポーネントとして扱う。本稿では,多層プロンプトの高次相互作用を明示的にモデル化する新しいフレームワークを提案する。
参考スコア（独自算出の注目度）: 22.205298192935313
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The "pre-train, prompt" paradigm has successfully evolved in representation learning. While current prompt-tuning methods often introduce learnable prompts, they predominantly treat prompts as isolated, independent components across different network layers. This overlooks the complex and synergistic higher-order interactions that exist between prompts at various hierarchical depths, consequently limiting the expressive power and semantic richness of the prompted model. To address this fundamental gap, we propose a novel framework that explicitly models the Higher-order Interactions of Multi-layer Prompt. Our approach conceptualizes prompts from different layers not as separate entities, but as a cohesive system where their inter-relationships are critical. We design an innovative interaction module that captures these sophisticated, non-linear correlations among multi-layer prompts, effectively modeling their cooperative effects. This allows the model to dynamically aggregate and refine prompt information across the network's depth, leading to a more integrated and powerful prompting strategy. Extensive experiments on eight benchmark datasets demonstrate that our method, by leveraging these higher-order interactions, consistently surpasses state-of-the-art prompt-tuning baselines. The performance advantage is particularly pronounced in few-shot scenarios, validating that capturing the intricate interplay between multi-layer prompts is key to unlocking more robust and generalizable representation learning.
Abstract（参考訳）: プレトレイン、プロンプト」パラダイムは表現学習においてうまく進化してきた。現在のプロンプトチューニング手法は学習可能なプロンプトを導入することが多いが、主に異なるネットワーク層にまたがる独立した独立したコンポーネントとしてプロンプトを扱う。これは、様々な階層的な深さのプロンプトの間に存在する複雑でシナジスティックな高次相互作用を見落とし、結果として、引き起こされるモデルの表現力と意味的なリッチさを制限する。この基本的なギャップに対処するため、我々は多層プロンプトの高次相互作用を明示的にモデル化する新しいフレームワークを提案する。本手法は, 異なる層からのプロンプトを個別の実体ではなく, 相互関係が重要となる凝集系として概念化する。我々は、これらの高度で非線形な関係を多層プロンプト間で捉え、協調効果を効果的にモデル化する革新的な相互作用モジュールを設計する。これにより、モデルはネットワークの深さをまたいだプロンプト情報を動的に集約し、洗練し、より統合され強力なプロンプト戦略へと導くことができる。 8つのベンチマークデータセットの大規模な実験により、これらの高次相互作用を活用することで、我々の手法が最先端のプロンプトチューニングベースラインを一貫して超越していることが示される。パフォーマンス上の優位性は特に、多層プロンプト間の複雑な相互作用をキャプチャすることが、より堅牢で一般化可能な表現学習の鍵であることを証明している。

論文の概要: Higher-order interactions of multi-layer prompt

関連論文リスト