Fugu-MT 論文翻訳(概要): Beyond Interleaving: Causal Attention Reformulations for Generative Recommender Systems

論文の概要: Beyond Interleaving: Causal Attention Reformulations for Generative Recommender Systems

arxiv url: http://arxiv.org/abs/2603.10369v1
Date: Wed, 11 Mar 2026 03:30:49 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-12 16:22:32.762957
Title: Beyond Interleaving: Causal Attention Reformulations for Generative Recommender Systems
Title（参考訳）: インターリービングを超えて:ジェネレーティブレコメンダシステムのための因果注意改革
Authors: Hailing Cheng,
Abstract要約: AttnLFA(Late Fusion for Actions)とAttnMVP(Mixed Value Pooling)を紹介する。これらのモデルは、Transformerベースのシーケンスモデリングの表現力を維持しながら、$i_n rightarrow a_n$因果依存性を明示的にエンコードする。我々は,大手ソーシャルネットワークからの大規模製品レコメンデーションデータに関するフレームワークの評価を行った。
参考スコア（独自算出の注目度）: 0.14930644306169805
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Generative Recommender Systems (GR) increasingly model user behavior as a sequence generation task by interleaving item and action tokens. While effective, this formulation introduces significant structural and computational inefficiencies: it doubles sequence length, incurs quadratic overhead, and relies on implicit attention to recover the causal relationship between an item and its associated action. Furthermore, interleaving heterogeneous tokens forces the Transformer to disentangle semantically incompatible signals, leading to increased attention noise and reduced representation efficiency.In this work, we propose a principled reformulation of generative recommendation that aligns sequence modeling with underlying causal structures and attention theory. We demonstrate that current interleaving mechanisms act as inefficient proxies for similarity-weighted action pooling. To address this, we introduce two novel architectures that eliminate interleaved dependencies to reduce sequence complexity by 50%: Attention-based Late Fusion for Actions (AttnLFA) and Attention-based Mixed Value Pooling (AttnMVP). These models explicitly encode the $i_n \rightarrow a_n$ causal dependency while preserving the expressive power of Transformer-based sequence modeling.We evaluate our framework on large-scale product recommendation data from a major social network. Experimental results show that AttnLFA and AttnMVP consistently outperform interleaved baselines, achieving evaluation loss improvements of 0.29% and 0.80%, and significant gains in Normalized Entropy (NE). Crucially, these performance gains are accompanied by training time reductions of 23% and 12%, respectively. Our findings suggest that explicitly modeling item-action causality provides a superior design paradigm for scalable and efficient generative ranking.
Abstract（参考訳）: Generative Recommender Systems (GR)は、アイテムとアクショントークンをインターリーブすることで、シーケンス生成タスクとしてユーザー動作をモデル化する。シーケンスの長さを2倍にし、二次的オーバーヘッドを発生させ、アイテムと関連するアクションの間の因果関係を回復するために暗黙の注意を払っている。さらに,異種トークンを介在させることにより,トランスフォーマーは意味的に不整合な信号を切り離し,注意ノイズの増大と表現効率の低下につながる。本研究では, 現在のインターリーブ機構が類似性重み付け動作プールの非効率なプロキシとして機能することを実証する。これを解決するために,AttnLFA (Retent-based Late Fusion for Actions) とAttnMVP (Mixed Value Pooling) の2つの新しいアーキテクチャを導入する。これらのモデルは、Transformerベースのシーケンスモデリングの表現力を維持しながら、$i_n \rightarrow a_n$因果依存性を明示的に符号化する。実験結果から,AttnLFAとAttnMVPはインターリーブベースラインを一貫して上回り,0.29%と0.80%の評価損失改善を実現し,正規化エントロピー(NE)において有意な向上を示した。これらのパフォーマンス向上には,それぞれ23%と12%のトレーニング時間短縮が伴う。この結果から,アイテムアクション因果関係を明示的にモデル化することは,スケーラブルで効率的な生成的ランク付けに優れた設計パラダイムをもたらすことが示唆された。

論文の概要: Beyond Interleaving: Causal Attention Reformulations for Generative Recommender Systems

関連論文リスト