Fugu-MT 論文翻訳(概要): FLUID: From Ephemeral IDs to Multimodal Semantic Codes for Industrial-Scale Livestreaming Recommendation

論文の概要: FLUID: From Ephemeral IDs to Multimodal Semantic Codes for Industrial-Scale Livestreaming Recommendation

arxiv url: http://arxiv.org/abs/2605.21832v2
Date: Tue, 26 May 2026 21:00:35 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-28 17:38:54.860836
Title: FLUID: From Ephemeral IDs to Multimodal Semantic Codes for Industrial-Scale Livestreaming Recommendation
Title（参考訳）: FLUID:産業規模のライブストリーミングレコメンデーションのための一時IDからマルチモーダルセマンティックコードへ
Authors: Xinhang Yuan, Zexi Huang, Anjia Cao, Xudong Lu, Zikai Wang, Penghao Zhou, Chang Liu, Wentao Guo, Qinglei Wang,
Abstract要約: FLUIDは、候補側のアイテムIDをプロダクションスケールのライブストリーミングローダからリタイアするためのフレームワークである。 LUCIDと呼ばれる個別の階層的セマンティックコードを生成し、コンテンツに基づく特徴付けを行う。当社の産業用ライブストリーミングレコメンデーションにデプロイされたユーザベースは、全世界で10億を超えている。
参考スコア（独自算出の注目度）: 18.833195310715126
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Modern recommender systems rely heavily on ID-based collaborative filtering: each item is represented by a unique ID embedding that accumulates collaborative signals from user interactions. Livestreaming recommendation, however, faces a unique challenge in this paradigm: a live room typically broadcasts for only tens of minutes, so its item ID remains poorly learned in a persistent cold-start state and ID-centric ranking models fail to generalize. We present FLUID, the first framework to fully retire the candidate-side item ID from a production-scale livestreaming ranker. FLUID introduces a cross-domain multimodal encoder, jointly trained on short videos and livestreams, to produce discrete hierarchical semantic codes, called LUCID, for content-based item characterization. To adapt the ranker to LUCID, FLUID further employs a staged warmup scheme: it first incorporates cold, slice-level LUCID as an independent token alongside the ID embedding, and then replaces the ID embedding with warm, room-level LUCID before online incremental training. Deployed on our industrial livestreaming recommenders with a cross-platform combined user base of over one billion globally, FLUID delivers significant online gains of +0.55% Quality Watch Duration, +2.05% Cold-Start Room Views, and +0.05% Active Hours.
Abstract（参考訳）: 現代のレコメンデータシステムは、IDベースのコラボレーティブフィルタリングに大きく依存している。各項目は、ユーザインタラクションからのコラボレーティブシグナルを蓄積するユニークなID埋め込みによって表現される。しかし、ライブストリーミングのレコメンデーションは、このパラダイムでユニークな課題に直面している: ライブルームは通常、ほんの数分間だけブロードキャストするので、そのアイテムIDは、永続的なコールドスタート状態での学習が不十分であり、ID中心のランキングモデルが一般化に失敗する。 FLUIDは、プロダクションスケールのライブストリーミングランサーから候補側アイテムIDを完全にリタイアする最初のフレームワークである。 FLUIDは、短いビデオとライブストリームで共同で訓練されたクロスドメインマルチモーダルエンコーダを導入し、コンテンツベースのアイテムキャラクタリゼーションのために、LUCIDと呼ばれる独立した階層的なセマンティックコードを生成する。ランサーをLUCIDに適応させるために、FLUIDはさらにステージドウォームアップ方式を採用しており、まず冷たくスライスレベルのLUCIDをID埋め込みと並行して独立したトークンとして組み込み、その後、オンラインインクリメンタルトレーニングの前に温かい部屋レベルのLUCIDに置き換える。 FLUIDは、当社の産業用ライブストリーミングレコメンデーションに、全世界で10億以上のクロスプラットフォーム統合ユーザベースを配置し、オンライン上の大きな利益として、+0.55%品質監視期間、+2.05%コールドスタートルームビュー、+0.05%アクティブ時間を提供している。

論文の概要: FLUID: From Ephemeral IDs to Multimodal Semantic Codes for Industrial-Scale Livestreaming Recommendation

関連論文リスト