Fugu-MT 論文翻訳(概要): RRCM: Ranking-Driven Retrieval over Collaborative and Meta Memories for LLM Recommendation

論文の概要: RRCM: Ranking-Driven Retrieval over Collaborative and Meta Memories for LLM Recommendation

arxiv url: http://arxiv.org/abs/2605.07129v1
Date: Fri, 08 May 2026 02:07:30 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-11 19:43:38.737717
Title: RRCM: Ranking-Driven Retrieval over Collaborative and Meta Memories for LLM Recommendation
Title（参考訳）: RRCM:LLMレコメンデーションのためのコラボレーションとメタメモリのランキング駆動検索
Authors: Shijun Li, Wooseong Yang, Yu Wang, Tianxin Wei, Joydeep Ghosh,
Abstract要約: 大規模言語モデル(LLM)は、次世代のレコメンデーションシステムにとって有望なパラダイムとして登場した。 LLMに基づくエージェントレコメンデーションのためのランキング駆動型検索・推論フレームワークRRCMを提案する。
参考スコア（独自算出の注目度）: 14.301653810918713
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large Language Models (LLMs) have emerged as a promising paradigm for next-generation recommender systems, offering strong semantic understanding and natural-language reasoning abilities. Despite recent progress, current LLM-based recommenders still face key challenges in constructing decision-relevant contexts from heterogeneous evidence. First, existing methods often rely on fixed context construction strategies: collaborative behavioral evidence and item-side metadata are typically incorporated through predefined prompts, static retrieval pipelines, or handcrafted injection mechanisms, making it difficult to determine what information is truly beneficial for each instance. Second, heterogeneous evidence introduces a severe context-efficiency bottleneck. Rich metadata and collaborative interaction records can quickly overwhelm the context window, while aggressive compression or heuristic filtering may discard fine-grained evidence critical for accurate recommendation. To address these challenges, we propose RRCM, a ranking-driven retrieval-and-reasoning framework over collaborative and metadata memories for LLM-based agentic recommendation. RRCM starts from a lightweight user-history context and learns whether to recommend directly, retrieve collaborative evidence, retrieve item metadata, or interleave both through reasoning. Both memories are represented in natural language and accessed through a unified retrieval interface, enabling flexible evidence acquisition without handcrafted CF injection or fixed retrieval rules. We optimize this memory-reading policy with an outcome-only ranking reward, instantiated using group relative policy optimization, so that retrieval decisions are directly driven by final top-k recommendation quality. Extensive experiments show that RRCM significantly outperforms traditional baselines and diverse LLM-based recommendation approaches.
Abstract（参考訳）: 大規模言語モデル(LLM)は、強力な意味理解と自然言語推論能力を提供する次世代レコメンデータシステムにとって有望なパラダイムとして登場した。近年の進歩にもかかわらず、現在のLLMベースのレコメンデータは、異種証拠から意思決定関連コンテキストを構築する上で重要な課題に直面している。協調的な振る舞いのエビデンスとアイテム側のメタデータは通常、事前に定義されたプロンプトや静的なパイプライン、手作りのインジェクションメカニズムを通じて組み込まれます。第二に、異質な証拠は厳しい文脈効果のボトルネックをもたらす。リッチなメタデータと協調的な相互作用記録は、文脈の窓を素早く圧倒し、攻撃的な圧縮やヒューリスティックなフィルタリングは、正確な推奨のために重要なきめ細かい証拠を捨てる可能性がある。これらの課題に対処するために,LLMに基づくエージェント推奨のための協調記憶とメタデータ記憶を用いたランキング駆動型検索・推論フレームワークRRCMを提案する。 RRCMは、軽量なユーザ履歴コンテキストから始まり、直接推奨するか、協力的なエビデンスを検索するか、アイテムメタデータを検索するか、推論を通じて両方をインターリーブするかを学ぶ。どちらの記憶も自然言語で表現され、統一された検索インタフェースを通じてアクセスされ、手作りのCF注入や固定された検索規則なしで柔軟なエビデンスを取得することができる。我々は、グループ相対ポリシー最適化を用いて、結果のみのランキング報酬で、このメモリ読み込みポリシーを最適化し、検索決定が最終トップk推奨品質によって直接駆動されるようにした。大規模な実験により、RRCMは従来のベースラインと多様なLLMベースのレコメンデーションアプローチを著しく上回っていることが示された。

論文の概要: RRCM: Ranking-Driven Retrieval over Collaborative and Meta Memories for LLM Recommendation

関連論文リスト