Delayed Feedback Modeling for the Entire Space Conversion Rate
Prediction
- URL: http://arxiv.org/abs/2011.11826v1
- Date: Tue, 24 Nov 2020 01:14:03 GMT
- Title: Delayed Feedback Modeling for the Entire Space Conversion Rate
Prediction
- Authors: Yanshi Wang, Jie Zhang, Qing Da, Anxiang Zeng
- Abstract summary: Estimating post-click conversion rate (CVR) accurately is crucial in E-commerce.
We propose a novel neural network framework ESDF to tackle the above three challenges simultaneously.
- Score: 15.579755993971657
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Estimating post-click conversion rate (CVR) accurately is crucial in
E-commerce. However, CVR prediction usually suffers from three major challenges
in practice: i) data sparsity: compared with impressions, conversion samples
are often extremely scarce; ii) sample selection bias: conventional CVR models
are trained with clicked impressions while making inference on the entire space
of all impressions; iii) delayed feedback: many conversions can only be
observed after a relatively long and random delay since clicks happened,
resulting in many false negative labels during training. Previous studies
mainly focus on one or two issues while ignoring the others. In this paper, we
propose a novel neural network framework ESDF to tackle the above three
challenges simultaneously. Unlike existing methods, ESDF models the CVR
prediction from a perspective of entire space, and combines the advantage of
user sequential behavior pattern and the time delay factor. Specifically, ESDF
utilizes sequential behavior of user actions on the entire space with all
impressions to alleviate the sample selection bias problem. By sharing the
embedding parameters between CTR and CVR networks, data sparsity problem is
greatly relieved. Different from conventional delayed feedback methods, ESDF
does not make any special assumption about the delay distribution. We
discretize the delay time by day slot and model the probability based on
survival analysis with deep neural network, which is more practical and
suitable for industrial situations. Extensive experiments are conducted to
evaluate the effectiveness of our method. To the best of our knowledge, ESDF is
the first attempt to unitedly solve the above three challenges in CVR
prediction area.
Related papers
- Deep Neural Networks Tend To Extrapolate Predictably [51.303814412294514]
neural network predictions tend to be unpredictable and overconfident when faced with out-of-distribution (OOD) inputs.
We observe that neural network predictions often tend towards a constant value as input data becomes increasingly OOD.
We show how one can leverage our insights in practice to enable risk-sensitive decision-making in the presence of OOD inputs.
arXiv Detail & Related papers (2023-10-02T03:25:32Z) - The effect of data augmentation and 3D-CNN depth on Alzheimer's Disease
detection [51.697248252191265]
This work summarizes and strictly observes best practices regarding data handling, experimental design, and model evaluation.
We focus on Alzheimer's Disease (AD) detection, which serves as a paradigmatic example of challenging problem in healthcare.
Within this framework, we train predictive 15 models, considering three different data augmentation strategies and five distinct 3D CNN architectures.
arXiv Detail & Related papers (2023-09-13T10:40:41Z) - Contrastive Learning for Conversion Rate Prediction [6.607531486024888]
We propose Contrastive Learning for CVR prediction (CL4CVR) framework.
It associates the supervised CVR prediction task with a contrastive learning task, which can learn better data representations.
Experimental results on two real-world conversion datasets demonstrate the superior performance of CL4CVR.
arXiv Detail & Related papers (2023-07-12T07:42:52Z) - Generative Time Series Forecasting with Diffusion, Denoise, and
Disentanglement [51.55157852647306]
Time series forecasting has been a widely explored task of great importance in many applications.
It is common that real-world time series data are recorded in a short time period, which results in a big gap between the deep model and the limited and noisy time series.
We propose to address the time series forecasting problem with generative modeling and propose a bidirectional variational auto-encoder equipped with diffusion, denoise, and disentanglement.
arXiv Detail & Related papers (2023-01-08T12:20:46Z) - Entire Space Counterfactual Learning: Tuning, Analytical Properties and
Industrial Applications [5.9460659646670875]
Post-click conversion rate (CVR) estimation has long been plagued by sample selection bias and data sparsity issues.
This paper proposes a principled method named entire space counterfactual multi-task model (ESCM$2$), which employs a counterfactual risk minimizer to handle both IEB and PIP issues at once.
arXiv Detail & Related papers (2022-10-20T06:19:50Z) - Conversion Rate Prediction via Meta Learning in Small-Scale
Recommendation Scenarios [17.02759665047561]
We propose a novel CVR method named MetaCVR from a perspective of meta learning to address the Data Distribution Fluctuation (DDF) issue.
To the best of our knowledge, this is the first study of CVR prediction targeting the DDF issue in small-scale recommendation scenarios.
arXiv Detail & Related papers (2021-12-27T16:05:42Z) - Regularizing Variational Autoencoder with Diversity and Uncertainty
Awareness [61.827054365139645]
Variational Autoencoder (VAE) approximates the posterior of latent variables based on amortized variational inference.
We propose an alternative model, DU-VAE, for learning a more Diverse and less Uncertain latent space.
arXiv Detail & Related papers (2021-10-24T07:58:13Z) - When Liebig's Barrel Meets Facial Landmark Detection: A Practical Model [87.25037167380522]
We propose a model that is accurate, robust, efficient, generalizable, and end-to-end trainable.
In order to achieve a better accuracy, we propose two lightweight modules.
DQInit dynamically initializes the queries of decoder from the inputs, enabling the model to achieve as good accuracy as the ones with multiple decoder layers.
QAMem is designed to enhance the discriminative ability of queries on low-resolution feature maps by assigning separate memory values to each query rather than a shared one.
arXiv Detail & Related papers (2021-05-27T13:51:42Z) - Robust Neural Networks Outperform Attitude Estimation Filters [0.3867363075280543]
RIANN is a real-time-capable neural network for robust IMU-based attitude estimation.
We exploit two publicly available datasets for the method development and the training.
We add four completely different datasets for evaluation of the trained neural network in three different test scenarios.
arXiv Detail & Related papers (2021-04-15T11:40:25Z) - Capturing Delayed Feedback in Conversion Rate Prediction via
Elapsed-Time Sampling [29.77426549280091]
Conversion rate (CVR) prediction is one of the most critical tasks for digital display advertising.
We propose Elapsed-Time Sampling Delayed Feedback Model (ES-DFM), which models the relationship between the observed conversion distribution and the true conversion distribution.
arXiv Detail & Related papers (2020-12-06T12:20:50Z) - Consistency Guided Scene Flow Estimation [159.24395181068218]
CGSF is a self-supervised framework for the joint reconstruction of 3D scene structure and motion from stereo video.
We show that the proposed model can reliably predict disparity and scene flow in challenging imagery.
It achieves better generalization than the state-of-the-art, and adapts quickly and robustly to unseen domains.
arXiv Detail & Related papers (2020-06-19T17:28:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.