Robust Collaborative Filtering to Popularity Distribution Shift
- URL: http://arxiv.org/abs/2310.10696v1
- Date: Mon, 16 Oct 2023 04:20:52 GMT
- Title: Robust Collaborative Filtering to Popularity Distribution Shift
- Authors: An Zhang, Wenchang Ma, Jingnan Zheng, Xiang Wang, Tat-seng Chua
- Abstract summary: We present a simple yet effective debiasing strategy, PopGo, which quantifies and reduces the interaction-wise popularity shortcut without assumptions on the test data.
On both ID and OOD test sets, PopGo achieves significant gains over the state-of-the-art debiasing strategies.
- Score: 56.78171423428719
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In leading collaborative filtering (CF) models, representations of users and
items are prone to learn popularity bias in the training data as shortcuts. The
popularity shortcut tricks are good for in-distribution (ID) performance but
poorly generalized to out-of-distribution (OOD) data, i.e., when popularity
distribution of test data shifts w.r.t. the training one. To close the gap,
debiasing strategies try to assess the shortcut degrees and mitigate them from
the representations. However, there exist two deficiencies: (1) when measuring
the shortcut degrees, most strategies only use statistical metrics on a single
aspect (i.e., item frequency on item and user frequency on user aspect),
failing to accommodate the compositional degree of a user-item pair; (2) when
mitigating shortcuts, many strategies assume that the test distribution is
known in advance. This results in low-quality debiased representations. Worse
still, these strategies achieve OOD generalizability with a sacrifice on ID
performance. In this work, we present a simple yet effective debiasing
strategy, PopGo, which quantifies and reduces the interaction-wise popularity
shortcut without any assumptions on the test data. It first learns a shortcut
model, which yields a shortcut degree of a user-item pair based on their
popularity representations. Then, it trains the CF model by adjusting the
predictions with the interaction-wise shortcut degrees. By taking both causal-
and information-theoretical looks at PopGo, we can justify why it encourages
the CF model to capture the critical popularity-agnostic features while leaving
the spurious popularity-relevant patterns out. We use PopGo to debias two
high-performing CF models (MF, LightGCN) on four benchmark datasets. On both ID
and OOD test sets, PopGo achieves significant gains over the state-of-the-art
debiasing strategies (e.g., DICE, MACR).
Related papers
- Popularity-Aware Alignment and Contrast for Mitigating Popularity Bias [34.006766098392525]
Collaborative Filtering (CF) typically suffers from the challenge of popularity bias due to the uneven distribution of items in real-world datasets.
This bias leads to a significant accuracy gap between popular and unpopular items.
We propose Popularity-Aware Alignment and Contrast (PAAC) to address two challenges.
arXiv Detail & Related papers (2024-05-31T09:14:48Z) - Debiasing Multimodal Models via Causal Information Minimization [65.23982806840182]
We study bias arising from confounders in a causal graph for multimodal data.
Robust predictive features contain diverse information that helps a model generalize to out-of-distribution data.
We use these features as confounder representations and use them via methods motivated by causal theory to remove bias from models.
arXiv Detail & Related papers (2023-11-28T16:46:14Z) - Test Time Embedding Normalization for Popularity Bias Mitigation [6.145760252113906]
Popularity bias is a widespread problem in the field of recommender systems.
We propose 'Test Time Embedding Normalization' as a simple yet effective strategy for mitigating popularity bias.
arXiv Detail & Related papers (2023-08-22T08:57:44Z) - uCTRL: Unbiased Contrastive Representation Learning via Alignment and
Uniformity for Collaborative Filtering [6.663503238373593]
Collaborative filtering (CF) models tend to yield recommendation lists with popularity bias.
We propose Unbiased ConTrastive Representation Learning (uCTRL) to mitigate this problem.
We also devise a novel IPW estimation method that removes the bias of both users and items.
arXiv Detail & Related papers (2023-05-22T06:55:38Z) - Look Beyond Bias with Entropic Adversarial Data Augmentation [4.893694715581673]
Deep neural networks do not discriminate between spurious and causal patterns, and will only learn the most predictive ones while ignoring the others.
Debiasing methods were developed to make networks robust to such spurious biases but require to know in advance if a dataset is biased.
In this paper, we argue that such samples should not be necessarily needed because the ''hidden'' causal information is often also contained in biased images.
arXiv Detail & Related papers (2023-01-10T08:25:24Z) - A Graph-Enhanced Click Model for Web Search [67.27218481132185]
We propose a novel graph-enhanced click model (GraphCM) for web search.
We exploit both intra-session and inter-session information for the sparsity and cold-start problems.
arXiv Detail & Related papers (2022-06-17T08:32:43Z) - Cross Pairwise Ranking for Unbiased Item Recommendation [57.71258289870123]
We develop a new learning paradigm named Cross Pairwise Ranking (CPR)
CPR achieves unbiased recommendation without knowing the exposure mechanism.
We prove in theory that this way offsets the influence of user/item propensity on the learning.
arXiv Detail & Related papers (2022-04-26T09:20:27Z) - A Closer Look at Debiased Temporal Sentence Grounding in Videos:
Dataset, Metric, and Approach [53.727460222955266]
Temporal Sentence Grounding in Videos (TSGV) aims to ground a natural language sentence in an untrimmed video.
Recent studies have found that current benchmark datasets may have obvious moment annotation biases.
We introduce a new evaluation metric "dR@n,IoU@m" that discounts the basic recall scores to alleviate the inflating evaluation caused by biased datasets.
arXiv Detail & Related papers (2022-03-10T08:58:18Z) - Diverse Knowledge Distillation for End-to-End Person Search [81.4926655119318]
Person search aims to localize and identify a specific person from a gallery of images.
Recent methods can be categorized into two groups, i.e., two-step and end-to-end approaches.
We propose a simple yet strong end-to-end network with diverse knowledge distillation to break the bottleneck.
arXiv Detail & Related papers (2020-12-21T09:04:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.