On Predicting Post-Click Conversion Rate via Counterfactual Inference
- URL: http://arxiv.org/abs/2510.04816v1
- Date: Mon, 06 Oct 2025 13:57:49 GMT
- Title: On Predicting Post-Click Conversion Rate via Counterfactual Inference
- Authors: Junhyung Ahn, Sanghack Lee,
- Abstract summary: We propose a method to counterfactually generate conversion labels for non-clicked samples.<n>Our approach is named the Entire Space Counterfactual Inference Multi-task Model (ESCIM)
- Score: 10.398909602421018
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Accurately predicting conversion rate (CVR) is essential in various recommendation domains such as online advertising systems and e-commerce. These systems utilize user interaction logs, which consist of exposures, clicks, and conversions. CVR prediction models are typically trained solely based on clicked samples, as conversions can only be determined following clicks. However, the sparsity of clicked instances necessitates the collection of a substantial amount of logs for effective model training. Recent works address this issue by devising frameworks that leverage non-clicked samples. While these frameworks aim to reduce biases caused by the discrepancy between clicked and non-clicked samples, they often rely on heuristics. Against this background, we propose a method to counterfactually generate conversion labels for non-clicked samples by using causality as a guiding principle, attempting to answer the question, "Would the user have converted if he or she had clicked the recommended item?" Our approach is named the Entire Space Counterfactual Inference Multi-task Model (ESCIM). We initially train a structural causal model (SCM) of user sequential behaviors and conduct a hypothetical intervention (i.e., click) on non-clicked items to infer counterfactual CVRs. We then introduce several approaches to transform predicted counterfactual CVRs into binary counterfactual conversion labels for the non-clicked samples. Finally, the generated samples are incorporated into the training process. Extensive experiments on public datasets illustrate the superiority of the proposed algorithm. Online A/B testing further empirically validates the effectiveness of our proposed algorithm in real-world scenarios. In addition, we demonstrate the improved performance of the proposed method on latent conversion data, showcasing its robustness and superior generalization capabilities.
Related papers
- No One Left Behind: How to Exploit the Incomplete and Skewed Multi-Label Data for Conversion Rate Prediction [48.578518946398354]
In most real-world online advertising systems, advertisers typically have diverse customer acquisition goals.<n>A common solution is to use multi-task learning to train a unified model on post-click data to estimate the conversion rate (CVR) for diverse targets.<n>In practice, CVR prediction often encounters missing conversion data as many advertisers submit only a subset of user conversion actions due to privacy or other constraints.
arXiv Detail & Related papers (2025-12-15T13:14:20Z) - ChorusCVR: Chorus Supervision for Entire Space Post-Click Conversion Rate Modeling [31.709370437858322]
Post-click conversion rate estimation is a vital task in recommender systems of revenue businesses.<n>For lack of post-event labels for un-clicked samples, CVR learning task commonly only utilizes clicked samples.<n>We propose a novel ChorusCVR model to realize debiased CVR learning in entire-space.
arXiv Detail & Related papers (2025-02-12T10:31:45Z) - Entire-Space Variational Information Exploitation for Post-Click Conversion Rate Prediction [11.675652028256428]
We propose an entire-space variational information exploitation framework (EVI) for CVR prediction.<n>First, EVI uses a conditional entire-space CVR teacher to generate unbiased pseudo labels.<n>Then, it applies variational information exploitation and logit distillation to transfer non-click space information to the target CVR estimator.
arXiv Detail & Related papers (2024-12-17T09:37:01Z) - EGEAN: An Exposure-Guided Embedding Alignment Network for Post-Click Conversion Estimation [6.178133899988549]
Post-click conversion rate (CVR) estimation is crucial for online advertising systems.<n>Despite advances in causal approaches, CVR estimation still faces challenges due to Covariate Shift.<n>This study proposes an Exposure-Guided Embedding Alignment Network (EGEAN) to address this problem.
arXiv Detail & Related papers (2024-12-08T10:17:02Z) - RAT: Retrieval-Augmented Transformer for Click-Through Rate Prediction [68.34355552090103]
This paper develops a Retrieval-Augmented Transformer (RAT), aiming to acquire fine-grained feature interactions within and across samples.
We then build Transformer layers with cascaded attention to capture both intra- and cross-sample feature interactions.
Experiments on real-world datasets substantiate the effectiveness of RAT and suggest its advantage in long-tail scenarios.
arXiv Detail & Related papers (2024-04-02T19:14:23Z) - DCRMTA: Unbiased Causal Representation for Multi-touch Attribution [0.2417342411475111]
Multi-touch attribution (MTA) currently plays a pivotal role in achieving a fair estimation of the contributions of each advertising to-wards conversion behavior.
Previous works attempted to eliminate the bias caused by user preferences to achieve the unbiased assumption of the conversion model.
This paper re-defines the causal effect of user features on con-versions and proposes a novel end-to-end ap-proach, Deep Causal Representation for MTA.
arXiv Detail & Related papers (2024-01-16T23:16:18Z) - Towards Automatic Sampling of User Behaviors for Sequential Recommender Systems [5.46875780473223]
We propose a novel automatic sampling framework for sequential recommendation, named AutoSAM, to non-uniformly treat historical behaviors.<n>We conduct extensive experiments on benchmark recommendation models and four real-world datasets.
arXiv Detail & Related papers (2023-11-01T09:25:21Z) - An Offline Metric for the Debiasedness of Click Models [52.25681483524383]
Click models are a common method for extracting information from user clicks.<n>Recent work shows that the current evaluation practices in the community fail to guarantee that a well-performing click model generalizes well to downstream tasks.<n>We introduce the concept of debiasedness in click modeling and derive a metric for measuring it.
arXiv Detail & Related papers (2023-04-19T10:59:34Z) - Explaining Cross-Domain Recognition with Interpretable Deep Classifier [100.63114424262234]
Interpretable Deep (IDC) learns the nearest source samples of a target sample as evidence upon which the classifier makes the decision.
Our IDC leads to a more explainable model with almost no accuracy degradation and effectively calibrates classification for optimum reject options.
arXiv Detail & Related papers (2022-11-15T15:58:56Z) - Probabilistic and Variational Recommendation Denoising [56.879165033014026]
Learning from implicit feedback is one of the most common cases in the application of recommender systems.
We propose probabilistic and variational recommendation denoising for implicit feedback.
We employ the proposed DPI and DVAE on four state-of-the-art recommendation models and conduct experiments on three datasets.
arXiv Detail & Related papers (2021-05-20T08:59:44Z) - Learning over no-Preferred and Preferred Sequence of items for Robust
Recommendation [66.8722561224499]
We propose a theoretically founded sequential strategy for training large-scale Recommender Systems (RS) over implicit feedback.
We present two variants of this strategy where model parameters are updated using either the momentum method or a gradient-based approach.
arXiv Detail & Related papers (2020-12-12T22:10:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.