Fugu-MT 論文翻訳(概要): GReF: A Unified Generative Framework for Efficient Reranking via Ordered Multi-token Prediction

論文の概要: GReF: A Unified Generative Framework for Efficient Reranking via Ordered Multi-token Prediction

arxiv url: http://arxiv.org/abs/2510.25220v1
Date: Wed, 29 Oct 2025 06:54:42 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-30 15:50:45.18981
Title: GReF: A Unified Generative Framework for Efficient Reranking via Ordered Multi-token Prediction
Title（参考訳）: GReF: 順序付きマルチトークン予測による効率向上のための統合生成フレームワーク
Authors: Zhijie Lin, Zhuofeng Li, Chenglei Dai, Wentian Bao, Shuai Lin, Enyun Yu, Haoxiang Zhang, Liang Zhao,
Abstract要約: ランク付けは、項目間のリスト内相関をモデル化する上で重要な役割を果たす。最近の研究は2段階(ジェネレータ・評価器)パラダイムを踏襲している。本稿では,2つの主要な課題に対処するため,GReF(Unified Generative Efficient Re rank Framework)を提案する。
参考スコア（独自算出の注目度）: 12.254397628788647
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In a multi-stage recommendation system, reranking plays a crucial role in modeling intra-list correlations among items. A key challenge lies in exploring optimal sequences within the combinatorial space of permutations. Recent research follows a two-stage (generator-evaluator) paradigm, where a generator produces multiple feasible sequences, and an evaluator selects the best one. In practice, the generator is typically implemented as an autoregressive model. However, these two-stage methods face two main challenges. First, the separation of the generator and evaluator hinders end-to-end training. Second, autoregressive generators suffer from inference efficiency. In this work, we propose a Unified Generative Efficient Reranking Framework (GReF) to address the two primary challenges. Specifically, we introduce Gen-Reranker, an autoregressive generator featuring a bidirectional encoder and a dynamic autoregressive decoder to generate causal reranking sequences. Subsequently, we pre-train Gen-Reranker on the item exposure order for high-quality parameter initialization. To eliminate the need for the evaluator while integrating sequence-level evaluation during training for end-to-end optimization, we propose post-training the model through Rerank-DPO. Moreover, for efficient autoregressive inference, we introduce ordered multi-token prediction (OMTP), which trains Gen-Reranker to simultaneously generate multiple future items while preserving their order, ensuring practical deployment in real-time recommender systems. Extensive offline experiments demonstrate that GReF outperforms state-of-the-art reranking methods while achieving latency that is nearly comparable to non-autoregressive models. Additionally, GReF has also been deployed in a real-world video app Kuaishou with over 300 million daily active users, significantly improving online recommendation quality.
Abstract（参考訳）: 多段階レコメンデーションシステムでは、項目間のリスト内相関をモデル化する上で、リランクが重要な役割を果たす。鍵となる課題は、置換の組合せ空間内で最適な列を探索することである。最近の研究は、2段階(ジェネレータ-評価器)パラダイムに従っており、ジェネレータは複数の実行可能なシーケンスを生成し、評価器は最適なシーケンスを選択する。実際には、ジェネレータは一般的に自己回帰モデルとして実装される。しかし、これらの2段階の手法は2つの大きな課題に直面している。まず、ジェネレータと評価器の分離はエンドツーエンドのトレーニングを妨げる。第2に、自己回帰発電機は推論効率に悩まされる。本稿では,2つの主要な課題に対処するため,統一生成効率向上フレームワーク(GReF)を提案する。具体的には、双方向エンコーダと動的自己回帰デコーダを備えた自己回帰生成器であるGen-Rerankerを導入し、因果リランクシーケンスを生成する。その後、高品質なパラメータ初期化のためのアイテム露光順序でGen-Rerankerを事前訓練する。エンドツーエンド最適化のためのトレーニング中にシーケンスレベルの評価を統合しながら評価器の必要性を解消するため,Rerank-DPOを用いてモデルを後学習する手法を提案する。さらに,効率的な自己回帰推論を行うために,Gen-Rerankerを訓練し,その順序を保ちながら複数の将来の項目を同時に生成し,リアルタイムレコメンデータシステムに実用的な展開を確実にする,順序付きマルチトークン予測(OMTP)を導入する。大規模なオフライン実験により、GReFは、非自己回帰モデルにほぼ匹敵するレイテンシを達成しながら、最先端のリグレードメソッドより優れていることが示された。さらにGReFは、毎日3億人以上のアクティブユーザーがいる現実世界のビデオアプリKuaishouにも展開され、オンラインレコメンデーションの品質が大幅に向上した。

論文の概要: GReF: A Unified Generative Framework for Efficient Reranking via Ordered Multi-token Prediction

関連論文リスト