Entire-Space Variational Information Exploitation for Post-Click Conversion Rate Prediction
- URL: http://arxiv.org/abs/2502.15687v1
- Date: Tue, 17 Dec 2024 09:37:01 GMT
- Title: Entire-Space Variational Information Exploitation for Post-Click Conversion Rate Prediction
- Authors: Ke Fei, Xinyue Zhang, Jingjing Li,
- Abstract summary: We propose an entire-space variational information exploitation framework (EVI) for CVR prediction.<n>First, EVI uses a conditional entire-space CVR teacher to generate unbiased pseudo labels.<n>Then, it applies variational information exploitation and logit distillation to transfer non-click space information to the target CVR estimator.
- Score: 11.675652028256428
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: In recommender systems, post-click conversion rate (CVR) estimation is an essential task to model user preferences for items and estimate the value of recommendations. Sample selection bias (SSB) and data sparsity (DS) are two persistent challenges for post-click conversion rate (CVR) estimation. Currently, entire-space approaches that exploit unclicked samples through knowledge distillation are promising to mitigate SSB and DS simultaneously. Existing methods use non-conversion, conversion, or adaptive conversion predictors to generate pseudo labels for unclicked samples. However, they fail to consider the unbiasedness and information limitations of these pseudo labels. Motivated by such analysis, we propose an entire-space variational information exploitation framework (EVI) for CVR prediction. First, EVI uses a conditional entire-space CVR teacher to generate unbiased pseudo labels. Then, it applies variational information exploitation and logit distillation to transfer non-click space information to the target CVR estimator. We conduct extensive offline experiments on six large-scale datasets. EVI demonstrated a 2.25\% average improvement compared to the state-of-the-art baselines.
Related papers
- ChorusCVR: Chorus Supervision for Entire Space Post-Click Conversion Rate Modeling [31.709370437858322]
Post-click conversion rate estimation is a vital task in recommender systems of revenue businesses.<n>For lack of post-event labels for un-clicked samples, CVR learning task commonly only utilizes clicked samples.<n>We propose a novel ChorusCVR model to realize debiased CVR learning in entire-space.
arXiv Detail & Related papers (2025-02-12T10:31:45Z) - EGEAN: An Exposure-Guided Embedding Alignment Network for Post-Click Conversion Estimation [6.178133899988549]
Post-click conversion rate (CVR) estimation is crucial for online advertising systems.<n>Despite advances in causal approaches, CVR estimation still faces challenges due to Covariate Shift.<n>This study proposes an Exposure-Guided Embedding Alignment Network (EGEAN) to address this problem.
arXiv Detail & Related papers (2024-12-08T10:17:02Z) - Improved Anomaly Detection through Conditional Latent Space VAE Ensembles [49.1574468325115]
Conditional Latent space Variational Autoencoder (CL-VAE) improved pre-processing for anomaly detection on data with known inlier classes and unknown outlier classes.
Model shows increased accuracy in anomaly detection, achieving an AUC of 97.4% on the MNIST dataset.
In addition, the CL-VAE shows increased benefits from ensembling, a more interpretable latent space, and an increased ability to learn patterns in complex data with limited model sizes.
arXiv Detail & Related papers (2024-10-16T07:48:53Z) - Data-driven Conditional Instrumental Variables for Debiasing Recommender Systems [25.632817469744325]
In recommender systems, latent variables can cause user-item interaction data to deviate from true user preferences.
We propose a novel data-driven conditional IV (CIV) debiasing method for recommender systems, called CIV4Rec.4Rec.
arXiv Detail & Related papers (2024-08-19T02:17:22Z) - Downstream-Pretext Domain Knowledge Traceback for Active Learning [138.02530777915362]
We propose a downstream-pretext domain knowledge traceback (DOKT) method that traces the data interactions of downstream knowledge and pre-training guidance.
DOKT consists of a traceback diversity indicator and a domain-based uncertainty estimator.
Experiments conducted on ten datasets show that our model outperforms other state-of-the-art methods.
arXiv Detail & Related papers (2024-07-20T01:34:13Z) - RAT: Retrieval-Augmented Transformer for Click-Through Rate Prediction [68.34355552090103]
This paper develops a Retrieval-Augmented Transformer (RAT), aiming to acquire fine-grained feature interactions within and across samples.
We then build Transformer layers with cascaded attention to capture both intra- and cross-sample feature interactions.
Experiments on real-world datasets substantiate the effectiveness of RAT and suggest its advantage in long-tail scenarios.
arXiv Detail & Related papers (2024-04-02T19:14:23Z) - Contrastive Learning for Conversion Rate Prediction [6.607531486024888]
We propose Contrastive Learning for CVR prediction (CL4CVR) framework.
It associates the supervised CVR prediction task with a contrastive learning task, which can learn better data representations.
Experimental results on two real-world conversion datasets demonstrate the superior performance of CL4CVR.
arXiv Detail & Related papers (2023-07-12T07:42:52Z) - Entire Space Counterfactual Learning: Tuning, Analytical Properties and
Industrial Applications [5.9460659646670875]
Post-click conversion rate (CVR) estimation has long been plagued by sample selection bias and data sparsity issues.
This paper proposes a principled method named entire space counterfactual multi-task model (ESCM$2$), which employs a counterfactual risk minimizer to handle both IEB and PIP issues at once.
arXiv Detail & Related papers (2022-10-20T06:19:50Z) - Debiasing Learning for Membership Inference Attacks Against Recommender
Systems [79.48353547307887]
Learned recommender systems may inadvertently leak information about their training data, leading to privacy violations.
We investigate privacy threats faced by recommender systems through the lens of membership inference.
We propose a Debiasing Learning for Membership Inference Attacks against recommender systems (DL-MIA) framework that has four main components.
arXiv Detail & Related papers (2022-06-24T17:57:34Z) - ADT-SSL: Adaptive Dual-Threshold for Semi-Supervised Learning [68.53717108812297]
Semi-Supervised Learning (SSL) has advanced classification tasks by inputting both labeled and unlabeled data to train a model jointly.
This paper proposes an Adaptive Dual-Threshold method for Semi-Supervised Learning (ADT-SSL)
Experimental results show that the proposed ADT-SSL achieves state-of-the-art classification accuracy.
arXiv Detail & Related papers (2022-05-21T11:52:08Z) - Evaluating Prediction-Time Batch Normalization for Robustness under
Covariate Shift [81.74795324629712]
We call prediction-time batch normalization, which significantly improves model accuracy and calibration under covariate shift.
We show that prediction-time batch normalization provides complementary benefits to existing state-of-the-art approaches for improving robustness.
The method has mixed results when used alongside pre-training, and does not seem to perform as well under more natural types of dataset shift.
arXiv Detail & Related papers (2020-06-19T05:08:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.