Fugu-MT 論文翻訳(概要): Architecture is All You Need: Improving LLM Recommenders by Dropping the Text

論文の概要: Architecture is All You Need: Improving LLM Recommenders by Dropping the Text

arxiv url: http://arxiv.org/abs/2506.15833v1
Date: Wed, 18 Jun 2025 19:18:49 GMT
ステータス: 翻訳完了
システム内更新日: 2025-06-23 19:00:04.819772
Title: Architecture is All You Need: Improving LLM Recommenders by Dropping the Text
Title（参考訳）: LLMレコメンダをテキストをドロップすることで改善するアーキテクチャ
Authors: Kevin Foley, Shaghayegh Agah, Kavya Priyanka Kakinada,
Abstract要約: 本稿では,大規模言語モデル (LLM) のアーキテクチャを用いて,層数と次元を削減したレコメンデータモデルを提案する。この単純化されたアプローチは、従来のシーケンシャルレコメンデータモデルとPLMベースのレコメンデータモデルの両方を大幅に上回っていることがわかった。
参考スコア（独自算出の注目度）: 0.40964539027092917
License: http://creativecommons.org/licenses/by/4.0/
Abstract: In recent years, there has been an explosion of interest in the applications of large pre-trained language models (PLMs) to recommender systems, with many studies showing strong performance of PLMs on common benchmark datasets. PLM-based recommender models benefit from flexible and customizable prompting, an unlimited vocabulary of recommendable items, and general ``world knowledge'' acquired through pre-training on massive text corpora. While PLM-based recommenders show promise in settings where data is limited, they are hard to implement in practice due to their large size and computational cost. Additionally, fine-tuning PLMs to improve performance on collaborative signals may degrade the model's capacity for world knowledge and generalizability. We propose a recommender model that uses the architecture of large language models (LLMs) while reducing layer count and dimensions and replacing the text-based subword tokenization of a typical LLM with discrete tokens that uniquely represent individual content items. We find that this simplified approach substantially outperforms both traditional sequential recommender models and PLM-based recommender models at a tiny fraction of the size and computational complexity of PLM-based models. Our results suggest that the principal benefit of LLMs in recommender systems is their architecture, rather than the world knowledge acquired during extensive pre-training.
Abstract（参考訳）: 近年,大規模事前学習型言語モデル (PLM) を推薦システムに適用することへの関心が高まっている。 PLMベースのレコメンデータモデルは、柔軟でカスタマイズ可能なプロンプト、推奨項目の無制限語彙、および大量のテキストコーパスの事前学習を通じて取得される一般的な 'world knowledge' の恩恵を受ける。 PLMベースのレコメンデータは、データが制限されている設定でpromiseを示すが、大きなサイズと計算コストのために実際に実装するのは困難である。さらに、協調信号の性能を向上させるための微調整 PLM は、世界の知識と一般化可能性に対するモデルの能力を低下させる可能性がある。本稿では,ラージ言語モデル(LLM)のアーキテクチャを用いて,レイヤ数と次元を削減し,典型的なLCMのテキストベースのサブワードトークン化を,個別のコンテンツ項目を一意に表現する離散トークンに置き換えるレコメンデータモデルを提案する。この単純化されたアプローチは、従来のシーケンシャルレコメンダモデルとPLMベースのレコメンダモデルの両方を、PLMベースのモデルのサイズと計算複雑性のごく一部で大幅に上回っていることがわかった。この結果から,LLMのアーキテクチャは,広範囲な事前学習において得られる知識よりも,LLMの持つメリットが重要であることが示唆された。

論文の概要: Architecture is All You Need: Improving LLM Recommenders by Dropping the Text

関連論文リスト