StableDR: Stabilized Doubly Robust Learning for Recommendation on Data
Missing Not at Random
- URL: http://arxiv.org/abs/2205.04701v3
- Date: Wed, 23 Aug 2023 18:42:36 GMT
- Title: StableDR: Stabilized Doubly Robust Learning for Recommendation on Data
Missing Not at Random
- Authors: Haoxuan Li, Chunyuan Zheng, Peng Wu
- Abstract summary: We show that the doubly robust (DR) methods are unstable and have unbounded bias, variance, and generalization bounds to extremely small propensities.
We propose a doubly robust (StableDR) learning approach with a weaker reliance on extrapolation.
In addition, we propose a novel learning approach for StableDR that updates the imputation, propensity, and prediction models cyclically.
- Score: 16.700598755439685
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In recommender systems, users always choose the favorite items to rate, which
leads to data missing not at random and poses a great challenge for unbiased
evaluation and learning of prediction models. Currently, the doubly robust (DR)
methods have been widely studied and demonstrate superior performance. However,
in this paper, we show that DR methods are unstable and have unbounded bias,
variance, and generalization bounds to extremely small propensities. Moreover,
the fact that DR relies more on extrapolation will lead to suboptimal
performance. To address the above limitations while retaining double
robustness, we propose a stabilized doubly robust (StableDR) learning approach
with a weaker reliance on extrapolation. Theoretical analysis shows that
StableDR has bounded bias, variance, and generalization error bound
simultaneously under inaccurate imputed errors and arbitrarily small
propensities. In addition, we propose a novel learning approach for StableDR
that updates the imputation, propensity, and prediction models cyclically,
achieving more stable and accurate predictions. Extensive experiments show that
our approaches significantly outperform the existing methods.
Related papers
- CDR: Conservative Doubly Robust Learning for Debiased Recommendation [23.90593406172408]
Doubly Robust Learning (DR) has gained significant attention due to its remarkable performance and robust properties.
To address this issue, this work proposes Conservative Doubly Robust strategy (CDR) which filters imputations by scrutinizing their mean and variance.
arXiv Detail & Related papers (2023-08-13T08:10:56Z) - Conservative Prediction via Data-Driven Confidence Minimization [70.93946578046003]
In safety-critical applications of machine learning, it is often desirable for a model to be conservative.
We propose the Data-Driven Confidence Minimization framework, which minimizes confidence on an uncertainty dataset.
arXiv Detail & Related papers (2023-06-08T07:05:36Z) - Toward Reliable Human Pose Forecasting with Uncertainty [51.628234388046195]
We develop an open-source library for human pose forecasting, including multiple models, supporting several datasets.
We devise two types of uncertainty in the problem to increase performance and convey better trust.
arXiv Detail & Related papers (2023-04-13T17:56:08Z) - Reliability-Aware Prediction via Uncertainty Learning for Person Image
Retrieval [51.83967175585896]
UAL aims at providing reliability-aware predictions by considering data uncertainty and model uncertainty simultaneously.
Data uncertainty captures the noise" inherent in the sample, while model uncertainty depicts the model's confidence in the sample's prediction.
arXiv Detail & Related papers (2022-10-24T17:53:20Z) - Rethinking Missing Data: Aleatoric Uncertainty-Aware Recommendation [59.500347564280204]
We propose a new Aleatoric Uncertainty-aware Recommendation (AUR) framework.
AUR consists of a new uncertainty estimator along with a normal recommender model.
As the chance of mislabeling reflects the potential of a pair, AUR makes recommendations according to the uncertainty.
arXiv Detail & Related papers (2022-09-22T04:32:51Z) - Multiple Robust Learning for Recommendation [13.06593469196849]
In recommender systems, a common problem is the presence of various biases in the collected data.
We propose a multiple robust (MR) estimator that can take the advantage of multiple candidate imputation and propensity models to achieve unbiasedness.
arXiv Detail & Related papers (2022-07-09T13:15:56Z) - Doubly Robust Collaborative Targeted Learning for Recommendation on Data
Missing Not at Random [6.563595953273317]
In recommender systems, the feedback data received is always missing not at random (MNAR)
We propose bf DR-TMLE that effectively captures the merits of both error imputation-based (EIB) and doubly robust (DR) methods.
We also propose a novel RCT-free collaborative targeted learning algorithm for DR-TMLE, called bf DR-TMLE-TL
arXiv Detail & Related papers (2022-03-19T06:48:50Z) - Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning [59.02006924867438]
Off-policy evaluation and learning (OPE/L) use offline observational data to make better decisions.
Recent work proposed distributionally robust OPE/L (DROPE/L) to remedy this, but the proposal relies on inverse-propensity weighting.
We propose the first DR algorithms for DROPE/L with KL-divergence uncertainty sets.
arXiv Detail & Related papers (2022-02-19T20:00:44Z) - Enhanced Doubly Robust Learning for Debiasing Post-click Conversion Rate
Estimation [29.27760413892272]
Post-click conversion, as a strong signal indicating the user preference, is salutary for building recommender systems.
Currently, most existing methods utilize counterfactual learning to debias recommender systems.
We propose a novel double learning approach for the MRDR estimator, which can convert the error imputation into the general CVR estimation.
arXiv Detail & Related papers (2021-05-28T06:59:49Z) - Understanding and Mitigating the Tradeoff Between Robustness and
Accuracy [88.51943635427709]
Adversarial training augments the training set with perturbations to improve the robust error.
We show that the standard error could increase even when the augmented perturbations have noiseless observations from the optimal linear predictor.
arXiv Detail & Related papers (2020-02-25T08:03:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.