Fugu-MT 論文翻訳(概要): GRank: Towards Target-Aware and Streamlined Industrial Retrieval with a Generate-Rank Framework

論文の概要: GRank: Towards Target-Aware and Streamlined Industrial Retrieval with a Generate-Rank Framework

arxiv url: http://arxiv.org/abs/2510.15299v1
Date: Fri, 17 Oct 2025 04:15:09 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-20 20:17:34.467573
Title: GRank: Towards Target-Aware and Streamlined Industrial Retrieval with a Generate-Rank Framework
Title（参考訳）: GRank:Generate-Rankフレームワークによるターゲット認識と合理化産業検索を目指して
Authors: Yijia Sun, Shanshan Huang, Zhiyuan Guan, Qiang Luo, Ruiming Tang, Kun Gai, Guorui Zhou,
Abstract要約: 産業規模のレコメンデータシステムはカスケードパイプラインに依存しており、検索段階では何十億ものアイテムから設定されたハイリコール候補を厳格なレイテンシで返さなければならない。本稿では,ユーザ中心の検索とターゲット認識学習をシームレスに統一する新しい構造化インデックスフリー検索パラダイムであるGRankを提案する。
参考スコア（独自算出の注目度）: 47.25361445845229
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Industrial-scale recommender systems rely on a cascade pipeline in which the retrieval stage must return a high-recall candidate set from billions of items under tight latency. Existing solutions ei- ther (i) suffer from limited expressiveness in capturing fine-grained user-item interactions, as seen in decoupled dual-tower architectures that rely on separate encoders, or generative models that lack precise target-aware matching capabilities, or (ii) build structured indices (tree, graph, quantization) whose item-centric topologies struggle to incorporate dynamic user preferences and incur prohibitive construction and maintenance costs. We present GRank, a novel structured-index-free retrieval paradigm that seamlessly unifies target-aware learning with user-centric retrieval. Our key innovations include: (1) A target-aware Generator trained to perform personalized candidate generation via GPU-accelerated MIPS, eliminating semantic drift and maintenance costs of structured indexing; (2) A lightweight but powerful Ranker that performs fine-grained, candidate-specific inference on small subsets; (3) An end-to-end multi-task learning framework that ensures semantic consistency between generation and ranking objectives. Extensive experiments on two public benchmarks and a billion-item production corpus demonstrate that GRank improves Recall@500 by over 30% and 1.7$\times$ the P99 QPS of state-of-the-art tree- and graph-based retrievers. GRank has been fully deployed in production in our recommendation platform since Q2 2025, serving 400 million active users with 99.95% service availability. Online A/B tests confirm significant improvements in core engagement metrics, with Total App Usage Time increasing by 0.160% in the main app and 0.165% in the Lite version.
Abstract（参考訳）: 産業規模のレコメンデータシステムはカスケードパイプラインに依存しており、検索段階では何十億ものアイテムから設定されたハイリコール候補を厳格なレイテンシで返さなければならない。既存の解 ei-ther i) 個別のエンコーダに依存する分離された二重塔アーキテクチャや、正確な目標対応能力に欠ける生成モデルに見られるような、きめ細かなユーザ・イテム相互作用の捕捉において、限定的な表現性に苦しむこと。 (i) アイテム中心のトポロジが動的ユーザの好みを取り入れたり、禁忌な建設や保守コストを発生させるのに苦労する構造化インデックス(ツリー、グラフ、量子化)を構築する。本稿では,ユーザ中心の検索とターゲット認識学習をシームレスに統一する新しい構造化インデックスフリー検索パラダイムであるGRankを提案する。キーとなるイノベーションは,(1)GPUアクセラレーションされたMIPSを用いてパーソナライズされた候補生成を訓練し,構造化インデックスのセマンティックドリフトとメンテナンスコストを低減させるターゲット・アウェア・ジェネレータ,(2)小さなサブセットに対してきめ細かな候補固有推論を行う軽量で強力なランサー,(3)生成とランキングのセマンティック一貫性を保証するエンドツーエンドのマルチタスク学習フレームワークである。 2つの公開ベンチマークと10億のプロダクションコーパスに関する大規模な実験は、GRankがRecall@500を30%以上改善し、1.7$\times$P99 QPSで最先端のツリーとグラフベースのレトリバーを開発したことを示している。 GRankは2025年の第2四半期以降、当社のレコメンデーションプラットフォームで完全に運用されている。オンラインA/Bテストでは、コアエンゲージメント指標が大幅に改善され、Total App Usage Timeがメインアプリで0.160%、Liteバージョンで0.165%増加した。

論文の概要: GRank: Towards Target-Aware and Streamlined Industrial Retrieval with a Generate-Rank Framework

関連論文リスト