Fugu-MT 論文翻訳(概要): ENCODE: Breaking the Trade-Off Between Performance and Efficiency in Long-Term User Behavior Modeling

論文の概要: ENCODE: Breaking the Trade-Off Between Performance and Efficiency in Long-Term User Behavior Modeling

arxiv url: http://arxiv.org/abs/2508.13567v1
Date: Tue, 19 Aug 2025 06:58:21 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-20 15:36:31.825683
Title: ENCODE: Breaking the Trade-Off Between Performance and Efficiency in Long-Term User Behavior Modeling
Title（参考訳）: ENCODE: 長期ユーザー行動モデリングにおけるパフォーマンスと効率のトレードオフを打破する
Authors: Wenji Zhou, Yuhang Zheng, Yinfu Feng, Yunan Ye, Rong Xiao, Long Chen, Xiaosong Yang, Jun Xiao,
Abstract要約: EfficieNt Clustering を用いた twO-stage interest moDEling (ENCODE) と呼ばれる2段階の効率的な長期配列モデリング手法を提案する。オフラインの抽出段階では、ENCODEは全動作シーケンスをクラスタリングし、正確な関心を抽出する。オンライン推論の段階では、ECNODEは、ターゲットアイテムとの関連性を予測するために、既成のユーザ関心を生かしている。
参考スコア（独自算出の注目度）: 12.963611514800656
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Long-term user behavior sequences are a goldmine for businesses to explore users' interests to improve Click-Through Rate. However, it is very challenging to accurately capture users' long-term interests from their long-term behavior sequences and give quick responses from the online serving systems. To meet such requirements, existing methods "inadvertently" destroy two basic requirements in long-term sequence modeling: R1) make full use of the entire sequence to keep the information as much as possible; R2) extract information from the most relevant behaviors to keep high relevance between learned interests and current target items. The performance of online serving systems is significantly affected by incomplete and inaccurate user interest information obtained by existing methods. To this end, we propose an efficient two-stage long-term sequence modeling approach, named as EfficieNt Clustering based twO-stage interest moDEling (ENCODE), consisting of offline extraction stage and online inference stage. It not only meets the aforementioned two basic requirements but also achieves a desirable balance between online service efficiency and precision. Specifically, in the offline extraction stage, ENCODE clusters the entire behavior sequence and extracts accurate interests. To reduce the overhead of the clustering process, we design a metric learning-based dimension reduction algorithm that preserves the relative pairwise distances of behaviors in the new feature space. While in the online inference stage, ENCODE takes the off-the-shelf user interests to predict the associations with target items. Besides, to further ensure the relevance between user interests and target items, we adopt the same relevance metric throughout the whole pipeline of ENCODE. The extensive experiment and comparison with SOTA have demonstrated the effectiveness and efficiency of our proposed ENCODE.
Abstract（参考訳）: 長期ユーザー行動シーケンスは、Click-Through Rateを改善するためにユーザーの興味を探求するビジネスにとって、金鉱である。しかし、ユーザの長期的関心を長期的行動シーケンスから正確に把握し、オンラインサービスシステムから素早く応答することは非常に困難である。このような要件を満たすために、既存の手法は、長期的シーケンスモデリングにおける2つの基本的な要件を「不注意に」破壊する: R1) シーケンス全体を最大限に活用して情報を維持する; R2) 学習対象項目と学習対象項目の関連性を維持するために、最も関連性の高い行動から情報を抽出する。オンラインサービスシステムの性能は,既存手法による不完全かつ不正確なユーザ関心情報の影響が大きい。そこで本研究では, オフライン抽出段階とオンライン推論段階からなる, EfficieNt Clustering に基づく twO-stage interest moDEling (ENCODE) と呼ばれる2段階の効率的な長期シーケンスモデリング手法を提案する。上記の2つの基本的な要件を満たすだけでなく、オンラインサービスの効率性と精度の望ましいバランスも達成します。具体的には、オフラインの抽出段階では、ENCODEは全動作シーケンスをクラスタリングし、正確な関心を抽出する。クラスタリングプロセスのオーバーヘッドを軽減するため,新しい特徴空間における相対的な対の行動距離を保存するメトリック学習に基づく次元削減アルゴリズムを設計する。オンライン推論の段階では、ECNODEは、ターゲットアイテムとの関連性を予測するために、既成のユーザ関心を生かしている。さらに,ユーザの関心事と対象項目の関連性をさらに確保するために,ENCODEのパイプライン全体を通して,同じ関連度基準を採用する。提案手法の有効性と有効性を実証し,SOTAとの比較を行った。

論文の概要: ENCODE: Breaking the Trade-Off Between Performance and Efficiency in Long-Term User Behavior Modeling

関連論文リスト