Fugu-MT 論文翻訳(概要): TokenMinds: Pretrained User Tokens and Embeddings for User Understanding in Large Recommender Systems

論文の概要: TokenMinds: Pretrained User Tokens and Embeddings for User Understanding in Large Recommender Systems

arxiv url: http://arxiv.org/abs/2606.25147v1
Date: Tue, 23 Jun 2026 20:20:37 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-25 17:05:30.138284
Title: TokenMinds: Pretrained User Tokens and Embeddings for User Understanding in Large Recommender Systems
Title（参考訳）: TokenMinds: 大規模レコメンダシステムにおけるユーザ理解のための事前トレーニングされたユーザトークンと埋め込み
Authors: Qingyun Liu, Bo Yan, Yang Liu, Yuji Roh, Ekansh Sharma, Likang Yin, Emma Olowo, Min-hsuan Tsai, Yuxuan Li, Diego Uribe, Saksham Aggarwal, Siqi Wu, Yuan Hao, Vikas Kedigehalli, Lukasz Heldt, Lichan Hong, Li Wei, Xinyang Yi,
Abstract要約: TokenMindsはPLUMフレームワークをアイテム検索からユーザモデリングまで拡張する産業規模のシステムである。個別のSIDベースのユーザトークンと、事前訓練されたLLMから適応したエンコーダデコーダアーキテクチャにより、密なユーザ埋め込みの両方を生成する。
参考スコア（独自算出の注目度）: 22.086361560657593
License: http://creativecommons.org/licenses/by/4.0/
Abstract: User modeling in industrial recommender systems typically produces dense embeddings, which suffer from representational constraints inherent to fixed-dimensional vectors. An emerging alternative for discrete user representation -- using LLMs to generate text-based user tokens -- captures topical co-occurrences rather than deep sequential behavior dynamics and produces outputs that are difficult to ground to item attributes. Meanwhile, Semantic ID (SID) based item tokenization has proven effective for improving generalization in generative recommendation, yet discrete SID-based representations for users remain largely unexplored. We propose TokenMinds, an industrial-scale system that extends the PLUM framework from item retrieval to user modeling, generating both discrete SID-based user tokens and dense user embeddings via an encoder-decoder architecture adapted from pre-trained LLMs. This dual-output design provides the complementary benefits of discrete, semantically grounded user representations while maintaining compatibility with existing downstream models that rely on dense embeddings. Additionally, the shared SID vocabulary naturally extends to cross-scenario modeling: by unifying long-form and short-form video behaviors into a single model, we substantially reduce training and serving costs. We validate TokenMinds through extensive offline experiments and live launches on multiple YouTube surfaces, served on full user traffic (billions of users) via an asynchronous infrastructure that decouples representation generation from downstream scoring. Focusing on ranking as the primary downstream use case, our results confirm the practical viability of SID-based user tokens at industrial scale and demonstrate that tokens and dense embeddings provide complementary value across different production ranking systems.
Abstract（参考訳）: 産業レコメンデータシステムにおけるユーザモデリングは、一般に、固定次元ベクトルに固有の表現的制約に悩まされる密着な埋め込みを生成する。 LLMを使ってテキストベースのユーザトークンを生成する、離散的なユーザ表現の新たな代替手段は、深いシーケンシャルな振る舞いのダイナミクスではなく、トピックの共起をキャプチャし、アイテム属性をグラウンディングするのが難しいアウトプットを生成する。一方、セマンティックID(SID)に基づくアイテムトークン化は、ジェネレーティブレコメンデーションにおける一般化を改善するのに有効であることが証明されている。 TokenMindsは、PLUMフレームワークをアイテム検索からユーザモデリングまで拡張し、事前学習されたLLMから適応したエンコーダ・デコーダアーキテクチャを用いて、離散SIDベースのユーザトークンと高密度ユーザ埋め込みの両方を生成する産業規模のシステムである。このデュアルアウトプット設計は、密接な埋め込みに依存する既存の下流モデルとの互換性を維持しながら、個別でセマンティックな基底を持つユーザ表現の相補的な利点を提供する。さらに,共有SID語彙は,長文・短文の動画行動を単一モデルに統一することにより,トレーニングやサービスコストを大幅に削減する。 TokenMindsを大規模なオフライン実験を通じて検証し、複数のYouTubeサーフェスでライブローンチし、ダウンストリームスコアリングから表現生成を分離する非同期インフラストラクチャを介して、完全なユーザトラフィック(ユーザ数億)を提供する。本研究は,産業規模でのSIDベースのユーザトークンの実用性を確認するとともに,各種製品ランキングシステムにおいて,トークンと密着型埋め込みが相補的価値を提供することを示す。

論文の概要: TokenMinds: Pretrained User Tokens and Embeddings for User Understanding in Large Recommender Systems

関連論文リスト