Fugu-MT 論文翻訳(概要): URecJPQ: Memory-efficient Multimodal Recommendation Models through RecJPQ in Large-Scale Scenarios

論文の概要: URecJPQ: Memory-efficient Multimodal Recommendation Models through RecJPQ in Large-Scale Scenarios

arxiv url: http://arxiv.org/abs/2606.23291v1
Date: Mon, 22 Jun 2026 13:05:03 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-24 22:52:25.415134
Title: URecJPQ: Memory-efficient Multimodal Recommendation Models through RecJPQ in Large-Scale Scenarios
Title（参考訳）: URecJPQ:大規模シナリオにおけるRecJPQによるメモリ効率のよいマルチモーダルレコメンデーションモデル
Authors: Giuseppe Spillo, Zixuan Yi, Aleksandr Petrov, Cataldo Musto, Craig Macdonald, Iadh Ounis,
Abstract要約: 本稿では,大規模かつマルチモーダルなトップkレコメンデーションタスクに特化して設計された共同製品量子化手法URecJPQを紹介する。中心となる考え方は、各ユーザ/イテムを、完全に学習された独自の埋め込みではなく、共有学習されたサブ埋め込みの結合として表現することである。広く使われている3つのデータセットの実験では、URecJPQがマルチモーダルなレコメンデーション設定に効果的に適用できることが示されている。
参考スコア（独自算出の注目度）: 52.726756906540636
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Training state-of-the-art recommendation models on large-scale industrial datasets can be a challenging task due to the high number of users and items which are typically represented through ID embeddings. Such embeddings typically require a large amount of memory resources, which are not always available. This problem is further exacerbated in multimodal recommendation, in which multimodal item features generally improve recommendation performance, but require more resources to encode. In this paper, we introduce URecJPQ, a Joint Product Quantization method specifically designed for large-scale and multimodal top-k recommendation tasks, in which the vast number of users and items, combined with the available modalities, further increases the memory demands for the computation. The core idea is to represent each user/item not as a fully learned, unique embedding, but rather as a concatenation of shared learned sub-embeddings, thereby significantly reducing the total number of trainable parameters. Our experiments on three widely-used datasets across different domains (movies, baby and sports products) show that URecJPQ can be effectively applied to multimodal recommendation settings. In large scale scenarios, we observe a substantial reduction in checkpoint sizes and the number of trainable parameters (ranging from 86% to 98%, and 98% to 99%, respectively), with only a marginal decrease in accuracy (8.5% on recall and 16% on NDCG, on average), and, in some cases, even performance improvements (up to 85%), as in the baby products domain. Our codebase is available at https://anonymous.4open.science/r/large_mmrecjpq-839B/README.md.
Abstract（参考訳）: 大規模産業データセット上で最先端のレコメンデーションモデルをトレーニングすることは、ID埋め込みを通じて一般的に表現される多数のユーザやアイテムのために、難しい作業になる可能性がある。このような埋め込みは一般的に大量のメモリリソースを必要とするが、必ずしも利用できない。この問題は、マルチモーダルな項目の特徴が一般的にレコメンデーション性能を改善するが、エンコードするためにより多くのリソースを必要とする、マルチモーダルなレコメンデーションにおいてさらに悪化している。本稿では,大規模かつマルチモーダルなトップkレコメンデーションタスクに特化して設計された共同製品量子化手法であるURecJPQを紹介する。中心となる考え方は、各ユーザ/イテムを、完全に学習されたユニークな埋め込みではなく、共有された学習サブ埋め込みの結合として表現し、トレーニング可能なパラメータの総数を著しく削減することである。異なるドメイン(移動,ベビー,スポーツ製品)で広く利用されている3つのデータセットについて実験したところ,URecJPQはマルチモーダルなレコメンデーション設定に効果的に適用できることがわかった。大規模シナリオでは,チェックポイントサイズとトレーニング可能なパラメータ(それぞれ86%から98%,98%から99%)の大幅な削減が観察され,精度は限界的に低下した(リコールでは8.5%,NDCGでは16%)。私たちのコードベースはhttps://anonymous.4open.science/r/large_mmrecjpq-839B/README.mdで公開されています。

論文の概要: URecJPQ: Memory-efficient Multimodal Recommendation Models through RecJPQ in Large-Scale Scenarios

関連論文リスト