Theoretical Modeling of the Iterative Properties of User Discovery in a
Collaborative Filtering Recommender System
- URL: http://arxiv.org/abs/2008.13526v1
- Date: Fri, 21 Aug 2020 20:30:39 GMT
- Title: Theoretical Modeling of the Iterative Properties of User Discovery in a
Collaborative Filtering Recommender System
- Authors: Sami Khenissi and Mariem Boujelbene and Olfa Nasraoui
- Abstract summary: The closed feedback loop in recommender systems is a common setting that can lead to different types of biases.
We present a theoretical framework to model the evolution of the different components of a recommender system operating within a feedback loop setting.
Our findings lay the theoretical basis for quantifying the effect of feedback loops and for designing Artificial Intelligence and machine learning algorithms.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The closed feedback loop in recommender systems is a common setting that can
lead to different types of biases. Several studies have dealt with these biases
by designing methods to mitigate their effect on the recommendations. However,
most existing studies do not consider the iterative behavior of the system
where the closed feedback loop plays a crucial role in incorporating different
biases into several parts of the recommendation steps.
We present a theoretical framework to model the asymptotic evolution of the
different components of a recommender system operating within a feedback loop
setting, and derive theoretical bounds and convergence properties on
quantifiable measures of the user discovery and blind spots. We also validate
our theoretical findings empirically using a real-life dataset and empirically
test the efficiency of a basic exploration strategy within our theoretical
framework.
Our findings lay the theoretical basis for quantifying the effect of feedback
loops and for designing Artificial Intelligence and machine learning algorithms
that explicitly incorporate the iterative nature of feedback loops in the
machine learning and recommendation process.
Related papers
- Learning Recommender Systems with Soft Target: A Decoupled Perspective [49.83787742587449]
We propose a novel decoupled soft label optimization framework to consider the objectives as two aspects by leveraging soft labels.
We present a sensible soft-label generation algorithm that models a label propagation algorithm to explore users' latent interests in unobserved feedback via neighbors.
arXiv Detail & Related papers (2024-10-09T04:20:15Z) - Optimizing Circuit Reusing and its Application in Randomized Benchmarking [5.783105931700547]
Quantum learning tasks often leverage randomly sampled quantum circuits to characterize unknown systems.
An efficient approach known as "circuit reusing," where each circuit is executed multiple times, reduces the cost compared to implementing new circuits.
This work investigates the optimal reusing parameter that minimizes the variance of measurement outcomes for a given experimental cost.
arXiv Detail & Related papers (2024-07-22T12:18:12Z) - Top-K Pairwise Ranking: Bridging the Gap Among Ranking-Based Measures for Multi-Label Classification [120.37051160567277]
This paper proposes a novel measure named Top-K Pairwise Ranking (TKPR)
A series of analyses show that TKPR is compatible with existing ranking-based measures.
On the other hand, we establish a sharp generalization bound for the proposed framework based on a novel technique named data-dependent contraction.
arXiv Detail & Related papers (2024-07-09T09:36:37Z) - Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-Constraint [56.74058752955209]
This paper studies the alignment process of generative models with Reinforcement Learning from Human Feedback (RLHF)
We first identify the primary challenges of existing popular methods like offline PPO and offline DPO as lacking in strategical exploration of the environment.
We propose efficient algorithms with finite-sample theoretical guarantees.
arXiv Detail & Related papers (2023-12-18T18:58:42Z) - Exploring Popularity Bias in Session-based Recommendation [0.6798775532273751]
We extend the analysis to session-based setup and adapted propensity calculation to the unique characteristics of session-based recommendation tasks.
We study the distributions of propensity and different stratification techniques on different datasets and find that propensity-related traits are actually dataset-specific.
arXiv Detail & Related papers (2023-12-13T02:48:35Z) - A PAC-Bayesian Perspective on the Interpolating Information Criterion [54.548058449535155]
We show how a PAC-Bayes bound is obtained for a general class of models, characterizing factors which influence performance in the interpolating regime.
We quantify how the test error for overparameterized models achieving effectively zero training error depends on the quality of the implicit regularization imposed by e.g. the combination of model, parameter-initialization scheme.
arXiv Detail & Related papers (2023-11-13T01:48:08Z) - Provable Reward-Agnostic Preference-Based Reinforcement Learning [61.39541986848391]
Preference-based Reinforcement Learning (PbRL) is a paradigm in which an RL agent learns to optimize a task using pair-wise preference-based feedback over trajectories.
We propose a theoretical reward-agnostic PbRL framework where exploratory trajectories that enable accurate learning of hidden reward functions are acquired.
arXiv Detail & Related papers (2023-05-29T15:00:09Z) - CausPref: Causal Preference Learning for Out-of-Distribution
Recommendation [36.22965012642248]
The current recommender system is still vulnerable to the distribution shift of users and items in realistic scenarios.
We propose to incorporate the recommendation-specific DAG learner into a novel causal preference-based recommendation framework named CausPref.
Our approach surpasses the benchmark models significantly under types of out-of-distribution settings.
arXiv Detail & Related papers (2022-02-08T16:42:03Z) - Top-N Recommendation with Counterfactual User Preference Simulation [26.597102553608348]
Top-N recommendation, which aims to learn user ranking-based preference, has long been a fundamental problem in a wide range of applications.
In this paper, we propose to reformulate the recommendation task within the causal inference framework to handle the data scarce problem.
arXiv Detail & Related papers (2021-09-02T14:28:46Z) - Learning the Truth From Only One Side of the Story [58.65439277460011]
We focus on generalized linear models and show that without adjusting for this sampling bias, the model may converge suboptimally or even fail to converge to the optimal solution.
We propose an adaptive approach that comes with theoretical guarantees and show that it outperforms several existing methods empirically.
arXiv Detail & Related papers (2020-06-08T18:20:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.