Fugu-MT 論文翻訳(概要): Fine-grained Semantics Integration for Large Language Model-based Recommendation

論文の概要: Fine-grained Semantics Integration for Large Language Model-based Recommendation

arxiv url: http://arxiv.org/abs/2602.22632v1
Date: Thu, 26 Feb 2026 05:17:24 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-27 18:41:22.539006
Title: Fine-grained Semantics Integration for Large Language Model-based Recommendation
Title（参考訳）: 大規模言語モデルに基づくレコメンデーションのためのきめ細かいセマンティックス統合
Authors: Jiawen Feng, Xiaoyu Kong, Leheng Sheng, Bin Wu, Chao Yi, Feifang Yang, Xiang-Rong Sheng, Han Zhu, Xiang Wang, Jiancan Wu, Xiangnan He,
Abstract要約: token-level Semantics を LLM ベースの Recommender に組み込む TS-Rec を提案する。具体的には、(1)教師モデルによって抽出されたキーワードの事前学習された埋め込みに平均プーリングを適用してSIDトークンの埋め込みを初期化するSemantic-Aware Embedding Initialization (SA-Init)、(2)SIDシーケンス内の個々のトークンと対応するアイテムクラスタの共有セマンティックセマンティックアライメントを整列するToken-level Semantic Alignment (TS-Align)の2つのキーコンポーネントを含む。
参考スコア（独自算出の注目度）: 32.85855885749811
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recent advances in Large Language Models (LLMs) have shifted in recommendation systems from the discriminative paradigm to the LLM-based generative paradigm, where the recommender autoregressively generates sequences of semantic identifiers (SIDs) for target items conditioned on historical interaction. While prevalent LLM-based recommenders have demonstrated performance gains by aligning pretrained LLMs between the language space and the SID space, modeling the SID space still faces two fundamental challenges: (1) Semantically Meaningless Initialization: SID tokens are randomly initialized, severing the semantic linkage between the SID space and the pretrained language space at start point, and (2) Coarse-grained Alignment: existing SFT-based alignment tasks primarily focus on item-level optimization, while overlooking the semantics of individual tokens within SID sequences.To address these challenges, we propose TS-Rec, which can integrate Token-level Semantics into LLM-based Recommenders. Specifically, TS-Rec comprises two key components: (1) Semantic-Aware embedding Initialization (SA-Init), which initializes SID token embeddings by applying mean pooling to the pretrained embeddings of keywords extracted by a teacher model; and (2) Token-level Semantic Alignment (TS-Align), which aligns individual tokens within the SID sequence with the shared semantics of the corresponding item clusters. Extensive experiments on two real-world benchmarks demonstrate that TS-Rec consistently outperforms traditional and generative baselines across all standard metrics. The results demonstrate that integrating fine-grained semantic information significantly enhances the performance of LLM-based generative recommenders.
Abstract（参考訳）: 近年のLLM(Large Language Models)の進歩は、識別パラダイムからLLMに基づく生成パラダイムに移行し、リコメンダは歴史的相互作用を前提としたターゲット項目のセマンティック識別子(SID)のシーケンスを自動回帰的に生成する。 1) 意味のない初期化: SIDトークンはランダムに初期化され、開始時点でSID空間と事前訓練された言語空間とのセマンティックリンクが切断される。(2) 粗粒調整: 既存のSFTベースのアライメントタスクは、主にSIDシーケンス内の個々のトークンのセマンティックスを見越しながら、アイテムレベルの最適化に重点を置いている。これらの課題に対処するため、TS-Recを提案する。具体的には、(1)教師モデルによって抽出されたキーワードの事前学習された埋め込みに平均プーリングを適用してSIDトークンの埋め込みを初期化するSemantic-Aware Embedding Initialization (SA-Init)、(2)SIDシーケンス内の個々のトークンと対応するアイテムクラスタの共有セマンティックセマンティックアライメントを整列するToken-level Semantic Alignment (TS-Align)の2つのキーコンポーネントを含む。 2つの実世界のベンチマークに関する大規模な実験は、TS-Recがすべての標準メトリクスで従来的および生成的ベースラインを一貫して上回っていることを示している。その結果, 微粒な意味情報の統合は, LLMに基づく生成レコメンデータの性能を著しく向上させることがわかった。

論文の概要: Fine-grained Semantics Integration for Large Language Model-based Recommendation

関連論文リスト