Fugu-MT 論文翻訳(概要): The Pitfall of Scaling Up: Uncovering and Mitigating Popularity Bias Amplification in Scaling Transformer-based Recommenders

論文の概要: The Pitfall of Scaling Up: Uncovering and Mitigating Popularity Bias Amplification in Scaling Transformer-based Recommenders

arxiv url: http://arxiv.org/abs/2606.21911v1
Date: Sat, 20 Jun 2026 07:13:49 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-25 23:57:37.306647
Title: The Pitfall of Scaling Up: Uncovering and Mitigating Popularity Bias Amplification in Scaling Transformer-based Recommenders
Title（参考訳）: スケーリングアップの落とし穴:スケーリングトランスフォーマーベースのレコメンダにおける人気バイアス増幅の発見と緩和
Authors: Weiqin Yang, Yue Pan, Chongming Gao, Sheng Zhou, Xiang Wang, Can Wang, Jiawei Chen,
Abstract要約: トランスフォーマーをベースとしたシーケンシャルレコメンデータのスケーリングにおける致命的な落とし穴を特定します。モデルサイズの増加はレコメンデーションの精度を向上させるが、同時に人気バイアスを増幅する。このバイアスにより、システムはニッチなアイテムを犠牲にして、人気のあるアイテムを過剰に推奨するようになる。
参考スコア（独自算出の注目度）: 26.25851138178879
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: We identify a critical pitfall in scaling transformer-based sequential recommenders: while increasing model size improves recommendation accuracy, it simultaneously amplifies popularity bias. This bias drives systems to over-recommend popular items at the expense of niche ones, which not only undermines fairness but also degrades the broader ecosystem by reinforcing the Matthew effect and filter bubbles. Consequently, this bias amplification emerges as a fundamental obstacle to sustainable model scaling. Through comprehensive theoretical and empirical analyses, we uncover the root cause of this amplification. Our findings reveal that as model depth increases, the two core components of the transformer architecture, i.e., attention aggregation and feed-forward projections, synergistically induce severe spectral collapse in model predictions, which directly translates to the amplification of popularity bias. To address this challenge, we propose SPRINT (Scalable Popularity Regularization IN Transformers), which mitigates spectral collapse during scaling by constraining (i) the maximum column-sums of the attention score matrices and (ii) the spectral norms of the feed-forward parameters. Extensive experiments demonstrate that SPRINT significantly improves both accuracy and long-tail fairness. Crucially, it yields more favorable scaling behaviors when expanding model sizes from 0.05M to 0.34B parameters. The code is available at https://github.com/Tiny-Snow/GenRec.
Abstract（参考訳）: モデルサイズの増加はレコメンデーションの精度を向上させるが、同時に人気バイアスを増幅する。このバイアスによってシステムは、ニッチなアイテムを犠牲にして、人気アイテムを過剰に推奨するようになり、公正性を損なうだけでなく、マシュー効果を強化してバブルをフィルターすることで、より広いエコシステムを悪化させる。結果として、このバイアス増幅は持続可能なモデルスケーリングの根本的な障害として現れます。包括的理論的および経験的分析を通して、この増幅の根本原因を明らかにする。モデル深度が増加するにつれて、アテンションアグリゲーションとフィードフォワードプロジェクションというトランスフォーマーアーキテクチャの2つの中核成分が相乗的にモデル予測のスペクトル崩壊を引き起こし、それが人気バイアスの増幅に直結することが明らかとなった。この課題に対処するため,SPRINT(Scalable Popularity Regularization in Transformers)を提案する。一注目スコア行列の最大列数及び (ii)フィードフォワードパラメータのスペクトルノルム。大規模な実験により、SPRINTは精度と長い尾の公平性の両方を著しく改善することが示された。重要なことに、モデルのサイズを0.05Mから0.34Bに拡大する際に、より好ましいスケーリングの振る舞いをもたらす。コードはhttps://github.com/Tiny-Snow/GenRec.comで入手できる。

論文の概要: The Pitfall of Scaling Up: Uncovering and Mitigating Popularity Bias Amplification in Scaling Transformer-based Recommenders

関連論文リスト