Fugu-MT 論文翻訳(概要): Efficient Generative Retrieval for E-commerce Search with Semantic Cluster IDs and Expert-Guided RL

論文の概要: Efficient Generative Retrieval for E-commerce Search with Semantic Cluster IDs and Expert-Guided RL

arxiv url: http://arxiv.org/abs/2605.14434v1
Date: Thu, 14 May 2026 06:27:46 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-15 21:45:34.661266
Title: Efficient Generative Retrieval for E-commerce Search with Semantic Cluster IDs and Expert-Guided RL
Title（参考訳）: セマンティッククラスタIDとエキスパートガイドRLを用いたEコマース検索の効率的な生成検索
Authors: Jianbo Zhu, Xing Fang, Jing Wang, Mingmin Jin, Bokang Wang, Guangxin Song, Zhenyu Xie, Junjie Bai,
Abstract要約: 生成検索は、断片化された多段階検索プロセスを単一のエンドツーエンドモデルに統一することで、有望な代替手段を提供する。本稿では,実世界のリコールシナリオに適した検索フレームワークを提案し,生成的検索をエンド・ツー・エンドの置き換えではなく,リコール・ステージ・サプリメントとして位置付ける。
参考スコア（独自算出の注目度）: 8.046510111008638
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Generative retrieval offers a promising alternative by unifying the fragmented multi-stage retrieval process into a single end-to-end model. However, its practical adoption in industrial e-commerce search remains challenging, given the massive and dynamic product catalogs, strict latency requirements, and the need to align retrieval with downstream ranking goals. In this work, we propose a retrieval framework tailored for real-world recall scenarios, positioning generative retrieval as a recall-stage supplement rather than an end-to-end replacement. Our method, CQ-SID (Category-and-Query constrained Semantic ID), employs category-aware and query-item contrastive learning along with Residual Quantized VAEs to encode items into hierarchical semantic cluster identifiers, significantly reducing beam search complexity. Additionally, we develop EG-GRPO (Expert-Guided Group Relative Policy Optimization), a reinforcement learning approach that aligns generative recall with downstream ranking under sparse rewards by injecting ground-truth samples to stabilize training. Offline experiments on TmallAPP search logs show that CQ-SID achieves up to 26.76% and 11.11% relative gains in semantic and personalized click hitrate over RQ-VAE baselines, while halving beam search size. EG-GRPO further improves multi-objective performance. Online A/B tests confirm gains in GMV (+1.15%) and UCTCVR (+0.40%). The generative recall channel now contributes substantially in production, accounting for over 50.25% of exposures, 58.96% of clicks, and 72.63% of purchases, demonstrating a viable path for deploying generative retrieval in real-world e-commerce systems.
Abstract（参考訳）: 生成検索は、断片化された多段階検索プロセスを単一のエンドツーエンドモデルに統一することで、有望な代替手段を提供する。しかし、大規模でダイナミックな製品カタログ、厳格なレイテンシ要件、検索を下流ランキングの目標と整合させる必要性を考えれば、産業用eコマースサーチへの実践的な採用は依然として困難である。本研究では,実世界のリコールシナリオに適した検索フレームワークを提案し,生成的検索をエンド・ツー・エンドの置き換えではなく,リコール・ステージ・サプリメントとして位置づける。我々の手法であるCQ-SID(Category-and-Query constrained Semantic ID)では、カテゴリ認識とクエリイテムの対照的な学習とResidual Quantized VAEを用いて、アイテムを階層的なセマンティッククラスタ識別子にエンコードし、ビーム探索の複雑さを大幅に低減する。また, 提案手法は, 学習を安定させるために, 接地トラスサンプルを注入することにより, 遺伝的リコールと低報酬下での下流ランキングを整合させる強化学習手法であるEG-GRPO(Expert-Guided Group Relative Policy Optimization)を開発した。 TmallAPPサーチログのオフライン実験により、CQ-SIDはRQ-VAEベースラインよりもセマンティックでパーソナライズされたクリックヒット率で最大26.76%、11.11%向上し、ビームサーチサイズは半減した。 EG-GRPOは多目的性能をさらに向上させる。オンラインA/Bテストでは、GMV(+1.15%)とUCTCVR(+0.40%)の上昇が確認されている。再生リコールチャネルは現在、50.25%以上の露出、58.96%のクリック、72.63%の購入で生産に大きく貢献しており、実際のeコマースシステムに生成検索を展開するための有効な道筋を示している。

論文の概要: Efficient Generative Retrieval for E-commerce Search with Semantic Cluster IDs and Expert-Guided RL

関連論文リスト