Fugu-MT 論文翻訳(概要): Balanced Co-Clustering of Users and Items for Embedding Table Compression in Recommender Systems

論文の概要: Balanced Co-Clustering of Users and Items for Embedding Table Compression in Recommender Systems

arxiv url: http://arxiv.org/abs/2604.18351v1
Date: Mon, 20 Apr 2026 14:45:01 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-21 21:52:52.943311
Title: Balanced Co-Clustering of Users and Items for Embedding Table Compression in Recommender Systems
Title（参考訳）: レコメンダシステムにおけるテーブル圧縮を埋め込むためのユーザとアイテムのバランスの取れた協調クラスタリング
Authors: Runhao Jiang, Renchi Yang, Donghao Wu,
Abstract要約: BACOは、ユーザとアイテムのグルーピングのためのユーザとイテムのインタラクションにおいて、協調的なシグナルを利用するというアイデアに基づいて構築されている。 BACOは,最大346倍の速度で,最大1.85%のリコール率で,埋め込みパラメータを75%以上削減し,最強のベースラインを超えることを示す。
参考スコア（独自算出の注目度）: 11.506483851165656
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Recommender systems have advanced markedly over the past decade by transforming each user/item into a dense embedding vector with deep learning models. At industrial scale, embedding tables constituted by such vectors of all users/items demand a vast amount of parameters and impose heavy compute and memory overhead during training and inference, hindering model deployment under resource constraints. Existing solutions towards embedding compression either suffer from severely compromised recommendation accuracy or incur considerable computational costs. To mitigate these issues, this paper presents BACO, a fast and effective framework for compressing embedding tables. Unlike traditional ID hashing, BACO is built on the idea of exploiting collaborative signals in user-item interactions for user and item groupings, such that similar users/items share the same embeddings in the codebook. Specifically, we formulate a balanced co-clustering objective that maximizes intra-cluster connectivity while enforcing cluster-volume balance, and unify canonical graph clustering techniques into the framework through rigorous theoretical analyses. To produce effective groupings while averting codebook collapse, BACO instantiates this framework with a principled weighting scheme for users and items, an efficient label propagation solver, as well as secondary user clusters. Our extensive experiments comparing BACO against full models and 18 baselines over benchmark datasets demonstrate that BACO cuts embedding parameters by over 75% with a drop of at most 1.85% in recall, while surpassing the strongest baselines by being up to 346X faster.
Abstract（参考訳）: 過去10年間で、レコメンダシステムは、各ユーザ/イテムをディープラーニングモデルで密着した埋め込みベクタに変換することで、著しく進歩しました。産業規模では、すべてのユーザ/イテムのこのようなベクトルで構成された埋め込みテーブルは、大量のパラメータを必要とし、トレーニングと推論中に大量の計算とメモリオーバーヘッドを課し、リソース制約下でのモデルデプロイメントを妨げる。圧縮を埋め込むための既存の解決策は、非常に妥協された推奨精度に悩まされるか、かなりの計算コストがかかる。これらの問題を緩和するために,組み込みテーブルを高速かつ効果的に圧縮する BACO を提案する。従来のIDハッシュとは異なり、BACOはユーザとアイテムのグルーピングにおける協調的なシグナルを利用するという考え方に基づいている。具体的には、クラスタ・ボリュームのバランスを保ちながらクラスタ内接続を最大化し、厳密な理論的解析により標準グラフクラスタリング技術をフレームワークに統一する、バランスの取れたクラスタリングの目的を定式化する。 BACOは、コードブックの崩壊を回避しつつ効果的なグルーピングを作成するために、ユーザとアイテムの原則的な重み付けスキーム、効率的なラベル伝搬解決器、セカンダリユーザクラスタを用いて、このフレームワークをインスタンス化する。ベンチマークデータセットに対するBACOと18のベースラインを比較した大規模な実験は、BACOが埋め込みパラメータを75%以上削減し、最大1.85%のリコールを達成し、最大346倍高速で最強のベースラインを超えたことを示している。

論文の概要: Balanced Co-Clustering of Users and Items for Embedding Table Compression in Recommender Systems

関連論文リスト