Counterfactual Multi-player Bandits for Explainable Recommendation Diversification
- URL: http://arxiv.org/abs/2505.21165v2
- Date: Wed, 11 Jun 2025 13:27:06 GMT
- Title: Counterfactual Multi-player Bandits for Explainable Recommendation Diversification
- Authors: Yansen Zhang, Bowei He, Xiaokun Zhang, Haolun Wu, Zexu Sun, Chen Ma,
- Abstract summary: We propose a textbfCounterfactual textbfMulti-player textbfBandits (CMB) method to deliver explainable recommendation diversification.
- Score: 7.948416784331374
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Existing recommender systems tend to prioritize items closely aligned with users' historical interactions, inevitably trapping users in the dilemma of ``filter bubble''. Recent efforts are dedicated to improving the diversity of recommendations. However, they mainly suffer from two major issues: 1) a lack of explainability, making it difficult for the system designers to understand how diverse recommendations are generated, and 2) limitations to specific metrics, with difficulty in enhancing non-differentiable diversity metrics. To this end, we propose a \textbf{C}ounterfactual \textbf{M}ulti-player \textbf{B}andits (CMB) method to deliver explainable recommendation diversification across a wide range of diversity metrics. Leveraging a counterfactual framework, our method identifies the factors influencing diversity outcomes. Meanwhile, we adopt the multi-player bandits to optimize the counterfactual optimization objective, making it adaptable to both differentiable and non-differentiable diversity metrics. Extensive experiments conducted on three real-world datasets demonstrate the applicability, effectiveness, and explainability of the proposed CMB.
Related papers
- Dual-disentangle Framework for Diversified Sequential Recommendation [16.688375054719767]
We propose a model-agnostic Dual-disangling framework for Diversified Sequential Recommendation (DDSRec)<n>The framework refines user interest and intention modeling by adopting disentangling perspectives in interaction modeling and representation.
arXiv Detail & Related papers (2025-08-05T07:25:56Z) - Bayesian-Guided Diversity in Sequential Sampling for Recommender Systems [1.675857332621569]
We propose a novel framework that leverages a multi-objective, contextual sequential sampling strategy.<n>Item selection is guided by Bayesian updates that dynamically adjust scores to optimize diversity.<n> Experiments on a real-world dataset show that our approach significantly improves diversity without sacrificing relevance.
arXiv Detail & Related papers (2025-06-22T19:36:02Z) - Combinatorial Optimization Perspective based Framework for Multi-behavior Recommendation [23.26102452699347]
We propose a novel multi-behavior recommendation framework based on the optimization perspective, named COPF.<n>In the prediction step, we improve both forward and backward propagation during the generation and aggregation of multiple experts.<n>Experiments on three real-world datasets indicate the superiority of COPF.
arXiv Detail & Related papers (2025-02-04T11:19:47Z) - Online Clustering of Dueling Bandits [59.09590979404303]
We introduce the first "clustering of dueling bandit algorithms" to enable collaborative decision-making based on preference feedback.<n>We propose two novel algorithms: (1) Clustering of Linear Dueling Bandits (COLDB) which models the user reward functions as linear functions of the context vectors, and (2) Clustering of Neural Dueling Bandits (CONDB) which uses a neural network to model complex, non-linear user reward functions.
arXiv Detail & Related papers (2025-02-04T07:55:41Z) - MDAP: A Multi-view Disentangled and Adaptive Preference Learning Framework for Cross-Domain Recommendation [63.27390451208503]
Cross-domain Recommendation systems leverage multi-domain user interactions to improve performance.
We propose the Multi-view Disentangled and Adaptive Preference Learning framework.
Our framework uses a multiview encoder to capture diverse user preferences.
arXiv Detail & Related papers (2024-10-08T10:06:45Z) - DLCRec: A Novel Approach for Managing Diversity in LLM-Based Recommender Systems [9.433227503973077]
We propose a novel framework designed to enable fine-grained control over diversity in LLM-based recommendations.<n>Unlike traditional methods, DLCRec adopts a fine-grained task decomposition strategy, breaking down the recommendation process into three sub-tasks.<n>We introduce two data augmentation techniques that enhance the model's robustness to noisy and out-of-distribution data.
arXiv Detail & Related papers (2024-08-22T15:10:56Z) - Relevance meets Diversity: A User-Centric Framework for Knowledge Exploration through Recommendations [15.143224593682012]
We propose a novel recommendation strategy that combines relevance and diversity by a copula function.
We use diversity as a surrogate of the amount of knowledge obtained by the user while interacting with the system.
Our strategy outperforms several state-of-the-art competitors.
arXiv Detail & Related papers (2024-08-07T13:48:24Z) - A Preference-oriented Diversity Model Based on Mutual-information in Re-ranking for E-commerce Search [11.49911967350851]
This paper proposes a Preference-oriented Diversity Model Based on Mutual-information (PODM-MI)
PODM-MI consider both accuracy and diversity in the re-ranking process.
We have successfully deployed PODM-MI on an e-commerce search platform.
arXiv Detail & Related papers (2024-05-24T13:03:34Z) - Sequential Recommendation with Controllable Diversification: Representation Degeneration and Diversity [59.24517649169952]
We argue that the representation degeneration issue is the root cause of insufficient recommendation diversity in existing SR methods.
We propose a novel Singular sPectrum sMoothing regularization for Recommendation (SPMRec), which acts as a controllable surrogate to alleviate the degeneration.
arXiv Detail & Related papers (2023-06-21T02:42:37Z) - A Unified Algorithm Framework for Unsupervised Discovery of Skills based
on Determinantal Point Process [53.86223883060367]
We show that diversity and coverage in unsupervised option discovery can indeed be unified under the same mathematical framework.
Our proposed algorithm, ODPP, has undergone extensive evaluation on challenging tasks created with Mujoco and Atari.
arXiv Detail & Related papers (2022-12-01T01:40:03Z) - Choosing the Best of Both Worlds: Diverse and Novel Recommendations
through Multi-Objective Reinforcement Learning [68.45370492516531]
We introduce Scalarized Multi-Objective Reinforcement Learning (SMORL) for the Recommender Systems (RS) setting.
SMORL agent augments standard recommendation models with additional RL layers that enforce it to simultaneously satisfy three principal objectives: accuracy, diversity, and novelty of recommendations.
Our experimental results on two real-world datasets reveal a substantial increase in aggregate diversity, a moderate increase in accuracy, reduced repetitiveness of recommendations, and demonstrate the importance of reinforcing diversity and novelty as complementary objectives.
arXiv Detail & Related papers (2021-10-28T13:22:45Z) - Simultaneous Relevance and Diversity: A New Recommendation Inference
Approach [81.44167398308979]
We propose a new approach, which extends the general collaborative filtering (CF) by introducing a new way of CF inference, negative-to-positive.
Our approach is applicable to a wide range of recommendation scenarios/use-cases at various sophistication levels.
Our analysis and experiments on public datasets and real-world production data show that our approach outperforms existing methods on relevance and diversity simultaneously.
arXiv Detail & Related papers (2020-09-27T22:20:12Z) - Sequential Recommendation with Self-Attentive Multi-Adversarial Network [101.25533520688654]
We present a Multi-Factor Generative Adversarial Network (MFGAN) for explicitly modeling the effect of context information on sequential recommendation.
Our framework is flexible to incorporate multiple kinds of factor information, and is able to trace how each factor contributes to the recommendation decision over time.
arXiv Detail & Related papers (2020-05-21T12:28:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.