Fugu-MT 論文翻訳(概要): Request-Only Optimization for Recommendation Systems

論文の概要: Request-Only Optimization for Recommendation Systems

arxiv url: http://arxiv.org/abs/2508.05640v2
Date: Wed, 13 Aug 2025 16:13:09 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-14 16:17:42.784016
Title: Request-Only Optimization for Recommendation Systems
Title（参考訳）: レコメンデーションシステムのリクエストオンリー最適化
Authors: Liang Guo, Wei Li, Lucy Liao, Huihui Cheng, Rui Zhang, Yu Shi, Yueming Wang, Yanzun Huang, Keke Zhai, Pengchao Wang, Timothy Shi, Xuan Cao, Shengzhi Wang, Renqin Cai, Zhaojie Gong, Omkar Vichare, Rui Jian, Leon Gao, Shiyan Deng, Xingyu Liu, Xiong Zhang, Fu Li, Wenlei Xie, Bin Wen, Rui Li, Xing Liu, Jiaqi Zhai,
Abstract要約: 産業規模のDeep Learning Recommendation Models (DLRMs) は毎日何十億ものユーザにサービスを提供している。 DLRMは、例えば1兆の浮動小数点演算(TFLOP)など、前例のない複雑さまで拡張されている。このスケールは、大量のトレーニングデータとともに、これらの複雑なレコメンデーションシステムの品質を効率的に改善するために、新しいストレージとトレーニングアルゴリズムを必要とします。
参考スコア（独自算出の注目度）: 31.503856495990977
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Deep Learning Recommendation Models (DLRMs) represent one of the largest machine learning applications on the planet. Industry-scale DLRMs are trained with petabytes of recommendation data to serve billions of users every day. To utilize the rich user signals in the long user history, DLRMs have been scaled up to unprecedented complexity, up to trillions of floating-point operations (TFLOPs) per example. This scale, coupled with the huge amount of training data, necessitates new storage and training algorithms to efficiently improve the quality of these complex recommendation systems. In this paper, we present a Request-Only Optimizations (ROO) training and modeling paradigm. ROO simultaneously improves the storage and training efficiency as well as the model quality of recommendation systems. We holistically approach this challenge through co-designing data (i.e., request-only data), infrastructure (i.e., request-only based data processing pipeline), and model architecture (i.e., request-only neural architectures). Our ROO training and modeling paradigm treats a user request as a unit of the training data. Compared with the established practice of treating a user impression as a unit, our new design achieves native feature deduplication in data logging, consequently saving data storage. Second, by de-duplicating computations and communications across multiple impressions in a request, this new paradigm enables highly scaled-up neural network architectures to better capture user interest signals, such as Generative Recommenders (GRs) and other request-only friendly architectures.
Abstract（参考訳）: Deep Learning Recommendation Models (DLRM)は、地球上で最大の機械学習アプリケーションの一つである。産業規模のDLRMは、毎日数十億のユーザにサービスを提供するために、ペタバイトのレコメンデーションデータでトレーニングされています。長いユーザ履歴の中でリッチなユーザ信号を利用するために、DLRMは前例のないほど複雑になり、例えば1兆の浮動小数点演算(TFLOP)まで拡張されてきた。このスケールは、大量のトレーニングデータとともに、これらの複雑なレコメンデーションシステムの品質を効率的に改善するために、新しいストレージとトレーニングアルゴリズムを必要とします。本稿では、リクエストオンリー最適化(ROO)トレーニングおよびモデリングパラダイムを提案する。 ROOは、ストレージとトレーニングの効率とレコメンデーションシステムのモデル品質を同時に改善する。データ(すなわち、要求のみのデータ)、インフラストラクチャ(すなわち、要求のみのデータ処理パイプライン)、モデルアーキテクチャ(すなわち、要求のみのデータアーキテクチャ)を共同設計することで、この課題に全力で取り組みます。我々のROOトレーニングおよびモデリングパラダイムは、ユーザ要求をトレーニングデータの単位として扱います。ユーザインプレッションを単位として扱うという確立した慣行と比較すると,新しい設計はデータロギングにおけるネイティブな特徴重複を実現し,データストレージを節約します。第二に、リクエスト内の複数のインプレッションにまたがる計算と通信の非重複化によって、この新しいパラダイムは、高スケールのニューラルネットワークアーキテクチャによって、ジェネレーティブレコメンダ(GR)やその他のリクエスト専用のフレンドリなアーキテクチャといった、ユーザの関心信号をよりよくキャプチャすることを可能にする。

論文の概要: Request-Only Optimization for Recommendation Systems

関連論文リスト