Fugu-MT 論文翻訳(概要): CardRewriter: Leveraging Knowledge Cards for Long-Tail Query Rewriting on Short-Video Platforms

論文の概要: CardRewriter: Leveraging Knowledge Cards for Long-Tail Query Rewriting on Short-Video Platforms

arxiv url: http://arxiv.org/abs/2510.10095v1
Date: Sat, 11 Oct 2025 08:09:14 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-14 18:06:29.778044
Title: CardRewriter: Leveraging Knowledge Cards for Long-Tail Query Rewriting on Short-Video Platforms
Title（参考訳）: CardRewriter:ショートビデオプラットフォーム上でのロングテールクエリ書き換えに知識カードを活用する
Authors: Peiyuan Gong, Feiran Zhu, Yaqi Yin, Chenglei Dai, Chao Zhang, Kai Zheng, Wentian Bao, Jiaxin Mao, Yi Zhang,
Abstract要約: textbfCardRewriterは、ドメイン固有の知識を取り入れたフレームワークで、ロングテールクエリの書き直しを強化します。 CardRewriterは、プロプライエタリなコンテンツをターゲットにしたクエリの書き換え品質を大幅に改善する。中国最大のショートビデオプラットフォームであるKuaishouに毎日数億人のユーザーがサービスを提供している。
参考スコア（独自算出の注目度）: 22.46507227200999
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Short-video platforms have rapidly become a new generation of information retrieval systems, where users formulate queries to access desired videos. However, user queries, especially long-tail ones, often suffer from spelling errors, incomplete phrasing, and ambiguous intent, resulting in mismatches between user expectations and retrieved results. While large language models (LLMs) have shown success in long-tail query rewriting within e-commerce, they struggle on short-video platforms, where proprietary content such as short videos, live streams, micro dramas, and user social networks falls outside their training distribution. To address this challenge, we introduce \textbf{CardRewriter}, an LLM-based framework that incorporates domain-specific knowledge to enhance long-tail query rewriting. For each query, our method aggregates multi-source knowledge relevant to the query and summarizes it into an informative and query-relevant knowledge card. This card then guides the LLM to better capture user intent and produce more effective query rewrites. We optimize CardRewriter using a two-stage training pipeline: supervised fine-tuning followed by group relative policy optimization, with a tailored reward system balancing query relevance and retrieval effectiveness. Offline experiments show that CardRewriter substantially improves rewriting quality for queries targeting proprietary content. Online A/B testing further confirms significant gains in long-view rate (LVR) and click-through rate (CTR), along with a notable reduction in initiative query reformulation rate (IQRR). Since September 2025, CardRewriter has been deployed on Kuaishou, one of China's largest short-video platforms, serving hundreds of millions of users daily.
Abstract（参考訳）: ショートビデオプラットフォームは、ユーザーが望むビデオにアクセスするためのクエリを定式化する情報検索システムとして、急速に新しい世代のものになりつつある。しかし、ユーザクエリ、特にロングテールクエリは、スペルエラー、不完全なフレーズ、曖昧な意図に悩まされ、ユーザの期待と結果のミスマッチが発生します。大規模言語モデル(LLM)は、eコマース内でのロングテールクエリ書き換えに成功しているが、ショートビデオ、ライブストリーム、マイクロドラマ、ユーザーソーシャルネットワークといったプロプライエタリなコンテンツがトレーニング配信外にあるショートビデオプラットフォームでは苦労している。この課題に対処するために,LLM ベースのフレームワークである \textbf{CardRewriter} を紹介した。クエリ毎に,クエリに関連する複数のソース知識を集約し,それを情報およびクエリ関連知識カードに要約する。このカードはLLMを誘導し、ユーザーの意図をよりよく捉え、より効果的なクエリ書き換えを生成する。 CardRewriterを2段階の訓練パイプラインで最適化する: 教師付き微調整とグループ相対的なポリシー最適化、クエリの妥当性と検索効率のバランスを調整した報酬システム。オフライン実験は、CardRewriterがプロプライエタリなコンテンツをターゲットにしたクエリの書き換え品質を大幅に改善していることを示している。オンラインA/Bテストはさらに、LVR(Long-view rate)とCTR(Click-through rate)の顕著な増加と、イニシアチブクエリ改革率(IQRR)の顕著な低下を確認している。 2025年9月以降、CardRewriterは中国最大のショートビデオプラットフォームであるKuaishouに展開され、毎日数億人のユーザーにサービスを提供している。

論文の概要: CardRewriter: Leveraging Knowledge Cards for Long-Tail Query Rewriting on Short-Video Platforms

関連論文リスト