Cross DQN: Cross Deep Q Network for Ads Allocation in Feed
- URL: http://arxiv.org/abs/2109.04353v1
- Date: Thu, 9 Sep 2021 15:44:13 GMT
- Title: Cross DQN: Cross Deep Q Network for Ads Allocation in Feed
- Authors: Guogang Liao, Ze Wang, Xiaoxu Wu, Xiaowen Shi, Chuheng Zhang, Yongkang
Wang, Xingxing Wang, Dong Wang
- Abstract summary: E-commerce platforms usually display a mixed list of ads and organic items in feed.
We propose Cross Deep Q Network to extract the arrangement signal by crossing the embeddings of different items.
Our model results in higher revenue and better user experience than state-of-the-art baselines in offline experiments.
- Score: 11.752703912138944
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: E-commerce platforms usually display a mixed list of ads and organic items in
feed. One key problem is to allocate the limited slots in the feed to maximize
the overall revenue as well as improve user experience, which requires a good
model for user preference. Instead of modeling the influence of individual
items on user behaviors, the arrangement signal models the influence of the
arrangement of items and may lead to a better allocation strategy. However,
most of previous strategies fail to model such a signal and therefore result in
suboptimal performance. To this end, we propose Cross Deep Q Network (Cross
DQN) to extract the arrangement signal by crossing the embeddings of different
items and processing the crossed sequence in the feed. Our model results in
higher revenue and better user experience than state-of-the-art baselines in
offline experiments. Moreover, our model demonstrates a significant improvement
in the online A/B test and has been fully deployed on Meituan feed to serve
more than 300 millions of customers.
Related papers
- LESS: Selecting Influential Data for Targeted Instruction Tuning [64.78894228923619]
We propose LESS, an efficient algorithm to estimate data influences and perform Low-rank gradiEnt Similarity Search for instruction data selection.
We show that training on a LESS-selected 5% of the data can often outperform training on the full dataset across diverse downstream tasks.
Our method goes beyond surface form cues to identify data that the necessary reasoning skills for the intended downstream application.
arXiv Detail & Related papers (2024-02-06T19:18:04Z) - UniMatch: A Unified User-Item Matching Framework for the Multi-purpose
Merchant Marketing [27.459774494479227]
We present a unified user-item matching framework to simultaneously conduct item recommendation and user targeting with just one model.
Our framework results in significant performance gains in comparison with the state-of-the-art methods, with greatly reduced cost on computing resources and daily maintenance.
arXiv Detail & Related papers (2023-07-19T13:49:35Z) - Ordinal Graph Gamma Belief Network for Social Recommender Systems [54.9487910312535]
We develop a hierarchical Bayesian model termed ordinal graph factor analysis (OGFA), which jointly models user-item and user-user interactions.
OGFA not only achieves good recommendation performance, but also extracts interpretable latent factors corresponding to representative user preferences.
We extend OGFA to ordinal graph gamma belief network, which is a multi-stochastic-layer deep probabilistic model.
arXiv Detail & Related papers (2022-09-12T09:19:22Z) - Targeted Advertising on Social Networks Using Online Variational Tensor
Regression [19.586412285513962]
We propose what we believe is the first contextual bandit framework for online targeted advertising.
The proposed framework is designed to accommodate any number of feature vectors in the form of multi-mode tensor.
We empirically confirm that the proposedUCB algorithm achieves a significant improvement in influence tasks over the benchmarks.
arXiv Detail & Related papers (2022-08-22T22:10:45Z) - VFed-SSD: Towards Practical Vertical Federated Advertising [53.08038962443853]
We propose a semi-supervised split distillation framework VFed-SSD to alleviate the two limitations.
Specifically, we develop a self-supervised task MatchedPair Detection (MPD) to exploit the vertically partitioned unlabeled data.
Our framework provides an efficient federation-enhanced solution for real-time display advertising with minimal deploying cost and significant performance lift.
arXiv Detail & Related papers (2022-05-31T17:45:30Z) - Preference Enhanced Social Influence Modeling for Network-Aware Cascade
Prediction [59.221668173521884]
We propose a novel framework to promote cascade size prediction by enhancing the user preference modeling.
Our end-to-end method makes the user activating process of information diffusion more adaptive and accurate.
arXiv Detail & Related papers (2022-04-18T09:25:06Z) - Deep Page-Level Interest Network in Reinforcement Learning for Ads
Allocation [14.9065245548275]
We propose Deep Page-level Interest Network (DPIN) to model the page-level user preference and exploit multiple types of feedback.
Specifically, we introduce four different types of page-level feedback as input, and capture user preference for item arrangement under different receptive fields.
arXiv Detail & Related papers (2022-04-01T11:58:00Z) - DCAP: Deep Cross Attentional Product Network for User Response
Prediction [20.17934000984361]
We propose a novel architecture Deep Cross Attentional Product Network (DCAP)
DCAP keeps cross network's benefits in modeling high-order feature interactions explicitly at the vector-wise level.
Our proposed model can be easily implemented and train in parallel.
arXiv Detail & Related papers (2021-05-18T16:27:20Z) - PreSizE: Predicting Size in E-Commerce using Transformers [76.33790223551074]
PreSizE is a novel deep learning framework which utilizes Transformers for accurate size prediction.
We demonstrate that PreSizE is capable of achieving superior prediction performance compared to previous state-of-the-art baselines.
As a proof of concept, we demonstrate that size predictions made by PreSizE can be effectively integrated into an existing production recommender system.
arXiv Detail & Related papers (2021-05-04T15:23:59Z) - Learning Transferrable Parameters for Long-tailed Sequential User
Behavior Modeling [70.64257515361972]
We argue that focusing on tail users could bring more benefits and address the long tails issue.
Specifically, we propose a gradient alignment and adopt an adversarial training scheme to facilitate knowledge transfer from the head to the tail.
arXiv Detail & Related papers (2020-10-22T03:12:02Z) - TPG-DNN: A Method for User Intent Prediction Based on Total Probability
Formula and GRU Loss with Multi-task Learning [36.38658213969406]
We propose a novel user intent prediction model, TPG-DNN, to complete the challenging task.
The proposed model has been widely used for the coupon allocation, advertisement and recommendation on Taobao platform.
arXiv Detail & Related papers (2020-08-05T13:25:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.