Fugu-MT 論文翻訳(概要): High Fidelity Textual User Representation over Heterogeneous Sources via Reinforcement Learning

論文の概要: High Fidelity Textual User Representation over Heterogeneous Sources via Reinforcement Learning

arxiv url: http://arxiv.org/abs/2602.07333v1
Date: Sat, 07 Feb 2026 03:27:55 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-10 20:26:24.572275
Title: High Fidelity Textual User Representation over Heterogeneous Sources via Reinforcement Learning
Title（参考訳）: 強化学習による異種音源上の高忠実度テキストユーザ表現
Authors: Rajat Arora, Ye Tao, Jianqiang Shen, Ping Liu, Muchen Wu, Qianqi Shen, Benjamin Le, Fedor Borisyuk, Jingwei Wu, Wenjing Zhang,
Abstract要約: 本稿では,各メンバーに統一されたテキスト表現を合成するための新しい強化学習(RL)フレームワークを提案する。我々のアプローチは、暗黙のユーザエンゲージメント信号(例えば、クリック、適用)を、有能な情報を抽出する主要な報酬として活用する。この作業は、解釈可能なユーザ表現を構築するための実用的でラベリングフリーでスケーラブルなソリューションを提供する。
参考スコア（独自算出の注目度）: 7.091225956788446
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Effective personalization on large-scale job platforms requires modeling members based on heterogeneous textual sources, including profiles, professional data, and search activity logs. As recommender systems increasingly adopt Large Language Models (LLMs), creating unified, interpretable, and concise representations from heterogeneous sources becomes critical, especially for latency-sensitive online environments. In this work, we propose a novel Reinforcement Learning (RL) framework to synthesize a unified textual representation for each member. Our approach leverages implicit user engagement signals (e.g., clicks, applies) as the primary reward to distill salient information. Additionally, the framework is complemented by rule-based rewards that enforce formatting and length constraints. Extensive offline experiments across multiple LinkedIn products, one of the world's largest job platforms, demonstrate significant improvements in key downstream business metrics. This work provides a practical, labeling-free, and scalable solution for constructing interpretable user representations that are directly compatible with LLM-based systems.
Abstract（参考訳）: 大規模ジョブプラットフォーム上での効果的なパーソナライズには、プロファイル、専門データ、検索アクティビティログを含む異種テキストソースに基づくモデリングメンバーが必要である。推薦システムがLarge Language Models (LLMs) を採用するにつれて、特に遅延に敏感なオンライン環境において、異種ソースからの統一的、解釈可能、簡潔な表現が重要となる。本研究では,各メンバーに統一されたテキスト表現を合成するための新しい強化学習(RL)フレームワークを提案する。当社のアプローチでは、暗黙のユーザエンゲージメント信号(例えば、クリック、適用)を、有能な情報を抽出する主要な報酬として活用しています。さらに、このフレームワークは、フォーマットと長さの制約を強制するルールベースの報酬によって補完される。世界最大のジョブプラットフォームのひとつである複数のLinkedIn製品に対する大規模なオフライン実験は、重要なダウンストリームビジネスメトリクスを大幅に改善したことを示している。この作業は、LCMベースのシステムと直接互換性のある解釈可能なユーザ表現を構築するための実用的でラベリングフリーでスケーラブルなソリューションを提供する。

論文の概要: High Fidelity Textual User Representation over Heterogeneous Sources via Reinforcement Learning

関連論文リスト