Fugu-MT 論文翻訳(概要): Scaling Dense Retrieval with LLM-Annotated Training Data: Structured Mining and Progressive Curriculum for E-Commerce Sponsored Search

論文の概要: Scaling Dense Retrieval with LLM-Annotated Training Data: Structured Mining and Progressive Curriculum for E-Commerce Sponsored Search

arxiv url: http://arxiv.org/abs/2606.23911v1
Date: Mon, 22 Jun 2026 20:19:41 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-24 22:16:48.648941
Title: Scaling Dense Retrieval with LLM-Annotated Training Data: Structured Mining and Progressive Curriculum for E-Commerce Sponsored Search
Title（参考訳）: LLMアノテートトレーニングデータによるDense Retrievalのスケーリング:Eコマーススポンサー検索のための構造化マイニングとプログレッシブカリキュラム
Authors: Md Omar Faruk Rokon, Shasvat Desai, Jhalak Nilesh Acharya, Isha Shah, Kumar Priyam, Brahanyaa Somasundaram, Vamsee Tangirala, Minuteresa Thomas, Vivek Arora, Vijay Manchi, Hong Yao, Kuang-chih Lee,
Abstract要約: クリック信号や手動のアノテーションに頼ることなく、高密度検索モデルの高品質なトレーニングデータを生成する方法を示す。トレーニングされた2tower BERTモデルをWalmartのスポンサー付き検索にデプロイし、トレーニングされたサードパーティのアノテータによってラベル付けされた30Kクエリに対して評価する。このシステムは,クリック訓練された生産ベースラインに対して+5.1%のNDCG@10を達成し,テールクエリの最大増加率を示す。
参考スコア（独自算出の注目度）: 4.369711604105306
License: http://creativecommons.org/licenses/by/4.0/
Abstract: How can we generate high-quality training data for dense retrieval models at production scale, without relying on click signals or manual annotation? This question is critical for e-commerce sponsored search, where click-based training suffers from position bias and tail-query sparsity, and manual labeling at the scale of hundreds of millions of query-item pairs is economically infeasible. Our work is driven by the following insight: heterogeneous retrieval systems disagree on most items they retrieve, and this disagreement creates a natural source of structured training signal -- easy positives where all systems agree, hard positives that only lexical systems find, and hard negatives that fool exactly one system. As our key novelty, we combine three ideas into an end-to-end pipeline: (a) multi-channel retrieval mining with rank metadata from three production systems, (b) graded-relevance annotation by a calibrated three-model cascade ) that reaches 89.1% agreement with trained human annotators, and (c) three-stage progressive curriculum training that organizes 240M+ training examples across five difficulty levels. We deploy the trained two-tower BERT model on Walmart's sponsored search and evaluate it against 30K queries labeled by trained third-party human annotators. First, we show that the system achieves +5.1% NDCG@10 over the click-trained production baseline, with the largest gain on tail queries . Second, we show that embarrassing retrievals (rating 0) drop from 8.7% to 3.5%. Third, a two-week online A/B test with tens of millions of ad requests per arm confirms +2.80% ad spend, +1.4% CTR, +2.8% eCPM, and +2.9% click conversion rate. Overall, our work provides a practical and scalable blueprint for replacing click-based training with structured LLM-annotated supervision in production retrieval systems.
Abstract（参考訳）: クリック信号や手動のアノテーションを使わずに、どうやって高密度検索モデルの高品質なトレーニングデータを生成することができるのか? この問題は、クリックベースのトレーニングが位置バイアスやテールクェリの間隔に悩まされているEコマースによる検索にとって重要であり、数億のクエリ-イムペアの規模での手動ラベリングは経済的に不可能である。異質な検索システムは、取得するほとんどの項目に異を唱え、この不一致は、構造化されたトレーニング信号の自然なソースを生み出します。重要なノベルティとして、私たちは3つのアイデアをエンドツーエンドのパイプラインにまとめています。 (a)3つの生産システムからのランクメタデータによるマルチチャネル検索マイニング b) 訓練されたヒトアノテータとの89.1%の合意に達した校正3モデルカスケードによる格付け関連アノテーション (c)5つの難易度で240万以上の研修例を編成する3段階の累進カリキュラム研修。トレーニングされた2tower BERTモデルをWalmartのスポンサー付き検索にデプロイし、トレーニングされたサードパーティのアノテータによってラベル付けされた30Kクエリに対して評価する。まず,5.1%のNDCG@10をクリック訓練した生産ベースラインで達成し,テールクエリの最大増加率を示す。第2に、恥ずかしい検索(0)は8.7%から3.5%に減少した。第3に、腕に数千万の広告要求がある2週間のオンラインA/Bテストでは、+2.80%の広告支出、+1.4%のCTR、+2.8%のeCPM、+2.9%のクリックコンバージョン率が確認されている。全体として、本研究は、クリックベースのトレーニングを実運用システムにおけるLLMアノテートされた監視システムに置き換えるための実用的でスケーラブルな青写真を提供する。

論文の概要: Scaling Dense Retrieval with LLM-Annotated Training Data: Structured Mining and Progressive Curriculum for E-Commerce Sponsored Search

関連論文リスト