Learning to Recommend Using Non-Uniform Data
- URL: http://arxiv.org/abs/2110.11248v1
- Date: Thu, 21 Oct 2021 16:17:40 GMT
- Title: Learning to Recommend Using Non-Uniform Data
- Authors: Wanning Chen and Mohsen Bayati
- Abstract summary: Learning user preferences for products based on past purchases or reviews is at the cornerstone of modern recommendation engines.
Some users are more likely to purchase products or review them, and some products are more likely to be purchased or reviewed by the users.
This non-uniform pattern degrades the power of many existing recommendation algorithms.
- Score: 7.005458308454873
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Learning user preferences for products based on their past purchases or
reviews is at the cornerstone of modern recommendation engines. One
complication in this learning task is that some users are more likely to
purchase products or review them, and some products are more likely to be
purchased or reviewed by the users. This non-uniform pattern degrades the power
of many existing recommendation algorithms, as they assume that the observed
data is sampled uniformly at random among user-product pairs. In addition,
existing literature on modeling non-uniformity either assume user interests are
independent of the products, or lack theoretical understanding. In this paper,
we first model the user-product preferences as a partially observed matrix with
non-uniform observation pattern. Next, building on the literature about
low-rank matrix estimation, we introduce a new weighted trace-norm penalized
regression to predict unobserved values of the matrix. We then prove an upper
bound for the prediction error of our proposed approach. Our upper bound is a
function of a number of parameters that are based on a certain weight matrix
that depends on the joint distribution of users and products. Utilizing this
observation, we introduce a new optimization problem to select a weight matrix
that minimizes the upper bound on the prediction error. The final product is a
new estimator, NU-Recommend, that outperforms existing methods in both
synthetic and real datasets.
Related papers
- Attribute-to-Delete: Machine Unlearning via Datamodel Matching [65.13151619119782]
Machine unlearning -- efficiently removing a small "forget set" training data on a pre-divertrained machine learning model -- has recently attracted interest.
Recent research shows that machine unlearning techniques do not hold up in such a challenging setting.
arXiv Detail & Related papers (2024-10-30T17:20:10Z) - Error Reduction from Stacked Regressions [12.657895453939298]
Stacking regressions is an ensemble technique that forms linear combinations of different regression estimators to enhance predictive accuracy.
In this paper, we learn these weights analogously by minimizing a regularized version of the empirical risk subject to a nonnegativity constraint.
Thanks to an adaptive shrinkage effect, the resulting stacked estimator has strictly smaller population risk than best single estimator among them.
arXiv Detail & Related papers (2023-09-18T15:42:12Z) - Rethinking Missing Data: Aleatoric Uncertainty-Aware Recommendation [59.500347564280204]
We propose a new Aleatoric Uncertainty-aware Recommendation (AUR) framework.
AUR consists of a new uncertainty estimator along with a normal recommender model.
As the chance of mislabeling reflects the potential of a pair, AUR makes recommendations according to the uncertainty.
arXiv Detail & Related papers (2022-09-22T04:32:51Z) - MRCLens: an MRC Dataset Bias Detection Toolkit [82.44296974850639]
We introduce MRCLens, a toolkit that detects whether biases exist before users train the full model.
For the convenience of introducing the toolkit, we also provide a categorization of common biases in MRC.
arXiv Detail & Related papers (2022-07-18T21:05:39Z) - Sequential Recommendation via Stochastic Self-Attention [68.52192964559829]
Transformer-based approaches embed items as vectors and use dot-product self-attention to measure the relationship between items.
We propose a novel textbfSTOchastic textbfSelf-textbfAttention(STOSA) to overcome these issues.
We devise a novel Wasserstein Self-Attention module to characterize item-item position-wise relationships in sequences.
arXiv Detail & Related papers (2022-01-16T12:38:45Z) - Top-N Recommendation with Counterfactual User Preference Simulation [26.597102553608348]
Top-N recommendation, which aims to learn user ranking-based preference, has long been a fundamental problem in a wide range of applications.
In this paper, we propose to reformulate the recommendation task within the causal inference framework to handle the data scarce problem.
arXiv Detail & Related papers (2021-09-02T14:28:46Z) - PURS: Personalized Unexpected Recommender System for Improving User
Satisfaction [76.98616102965023]
We describe a novel Personalized Unexpected Recommender System (PURS) model that incorporates unexpectedness into the recommendation process.
Extensive offline experiments on three real-world datasets illustrate that the proposed PURS model significantly outperforms the state-of-the-art baseline approaches.
arXiv Detail & Related papers (2021-06-05T01:33:21Z) - Towards Open-World Recommendation: An Inductive Model-based
Collaborative Filtering Approach [115.76667128325361]
Recommendation models can effectively estimate underlying user interests and predict one's future behaviors.
We propose an inductive collaborative filtering framework that contains two representation models.
Our model achieves promising results for recommendation on few-shot users with limited training ratings and new unseen users.
arXiv Detail & Related papers (2020-07-09T14:31:25Z) - Adversarial learning for product recommendation [0.0]
This work proposes a conditional, coupled generative adversarial network (RecommenderGAN) that learns to produce samples from a joint distribution between (view, buy) behaviors.
Our results are preliminary, however they suggest that the recommendations produced by the model may provide utility for consumers and digital retailers.
arXiv Detail & Related papers (2020-07-07T23:35:36Z) - Discrete-Valued Latent Preference Matrix Estimation with Graph Side
Information [12.836994708337144]
We develop an algorithm that matches the optimal sample complexity.
Our algorithm is robust to model errors and outperforms the existing algorithms in terms of prediction performance.
arXiv Detail & Related papers (2020-03-16T06:29:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.