Impression Allocation and Policy Search in Display Advertising
- URL: http://arxiv.org/abs/2203.07073v1
- Date: Fri, 11 Mar 2022 08:55:13 GMT
- Title: Impression Allocation and Policy Search in Display Advertising
- Authors: Di Wu and Cheng Chen and Xiujun Chen and Junwei Pan and Xun Yang and
Qing Tan and Jian Xu and Kuang-Chih Lee
- Abstract summary: In online display advertising, guaranteed contracts and real-time bidding (RTB) are two major ways to sell impressions for a publisher.
We formulate impression allocation as an auction problem where each guaranteed contract submits virtual bids for individual impressions.
We propose a multi-agent reinforcement learning method to adjust the bids from each guaranteed contract, which is simple, converging efficiently and scalable.
- Score: 20.665879360586448
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In online display advertising, guaranteed contracts and real-time bidding
(RTB) are two major ways to sell impressions for a publisher. For large
publishers, simultaneously selling impressions through both guaranteed
contracts and in-house RTB has become a popular choice. Generally speaking, a
publisher needs to derive an impression allocation strategy between guaranteed
contracts and RTB to maximize its overall outcome (e.g., revenue and/or
impression quality). However, deriving the optimal strategy is not a trivial
task, e.g., the strategy should encourage incentive compatibility in RTB and
tackle common challenges in real-world applications such as unstable traffic
patterns (e.g., impression volume and bid landscape changing). In this paper,
we formulate impression allocation as an auction problem where each guaranteed
contract submits virtual bids for individual impressions. With this
formulation, we derive the optimal bidding functions for the guaranteed
contracts, which result in the optimal impression allocation. In order to
address the unstable traffic pattern challenge and achieve the optimal overall
outcome, we propose a multi-agent reinforcement learning method to adjust the
bids from each guaranteed contract, which is simple, converging efficiently and
scalable. The experiments conducted on real-world datasets demonstrate the
effectiveness of our method.
Related papers
- Learning Multi-Agent Intention-Aware Communication for Optimal
Multi-Order Execution in Finance [96.73189436721465]
We first present a multi-agent RL (MARL) method for multi-order execution considering practical constraints.
We propose a learnable multi-round communication protocol, for the agents communicating the intended actions with each other.
Experiments on the data from two real-world markets have illustrated superior performance with significantly better collaboration effectiveness.
arXiv Detail & Related papers (2023-07-06T16:45:40Z) - Bidding Strategies for Proportional Representation in Advertisement
Campaigns [8.269283912626873]
We show that equitable bidding may not result in equitable outcomes due to heterogeneous levels of competition for different types of individuals.
We consider alterations that make no change to the platform mechanism and instead change the bidding strategies used by advertisers.
arXiv Detail & Related papers (2023-05-22T23:29:05Z) - Functional Optimization Reinforcement Learning for Real-Time Bidding [14.5826735379053]
Real-time bidding is the new paradigm of programmatic advertising.
Existing approaches are struggling to provide a satisfactory solution for bidding optimization.
This paper proposes a multi-agent reinforcement learning architecture for RTB with functional optimization.
arXiv Detail & Related papers (2022-06-25T06:12:17Z) - VFed-SSD: Towards Practical Vertical Federated Advertising [53.08038962443853]
We propose a semi-supervised split distillation framework VFed-SSD to alleviate the two limitations.
Specifically, we develop a self-supervised task MatchedPair Detection (MPD) to exploit the vertically partitioned unlabeled data.
Our framework provides an efficient federation-enhanced solution for real-time display advertising with minimal deploying cost and significant performance lift.
arXiv Detail & Related papers (2022-05-31T17:45:30Z) - A Unified Framework for Campaign Performance Forecasting in Online
Display Advertising [9.005665883444902]
Interpretable and accurate results could enable advertisers to manage and optimize their campaign criteria.
New framework reproduces campaign performance on historical logs under various bidding types with a unified replay algorithm.
Method captures mixture calibration patterns among related forecast indicators to map the estimated results to the true ones.
arXiv Detail & Related papers (2022-02-24T03:04:29Z) - Sequential Information Design: Markov Persuasion Process and Its
Efficient Reinforcement Learning [156.5667417159582]
This paper proposes a novel model of sequential information design, namely the Markov persuasion processes (MPPs)
Planning in MPPs faces the unique challenge in finding a signaling policy that is simultaneously persuasive to the myopic receivers and inducing the optimal long-term cumulative utilities of the sender.
We design a provably efficient no-regret learning algorithm, the Optimism-Pessimism Principle for Persuasion Process (OP4), which features a novel combination of both optimism and pessimism principles.
arXiv Detail & Related papers (2022-02-22T05:41:43Z) - Bid Optimization using Maximum Entropy Reinforcement Learning [0.3149883354098941]
This paper focuses on optimizing a single advertiser's bidding strategy using reinforcement learning (RL) in real-time bidding (RTB)
We first utilize a widely accepted linear bidding function to compute every impression's base price and optimize it by a mutable adjustment factor derived from the RTB auction environment.
Finally, the empirical study on a public dataset demonstrates that the proposed bidding strategy has superior performance compared with the baselines.
arXiv Detail & Related papers (2021-10-11T06:53:53Z) - A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in
Online Advertising [53.636153252400945]
We propose a general Multi-Agent reinforcement learning framework for Auto-Bidding, namely MAAB, to learn the auto-bidding strategies.
Our approach outperforms several baseline methods in terms of social welfare and guarantees the ad platform's revenue.
arXiv Detail & Related papers (2021-06-11T08:07:14Z) - Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential
Advertising [52.3825928886714]
We formulate the sequential advertising strategy optimization as a dynamic knapsack problem.
We propose a theoretically guaranteed bilevel optimization framework, which significantly reduces the solution space of the original optimization space.
To improve the exploration efficiency of reinforcement learning, we also devise an effective action space reduction approach.
arXiv Detail & Related papers (2020-06-29T18:50:35Z) - Optimal Bidding Strategy without Exploration in Real-time Bidding [14.035270361462576]
maximizing utility with a budget constraint is the primary goal for advertisers in real-time bidding (RTB) systems.
Previous works ignore the losing auctions to alleviate the difficulty with censored states.
We propose a novel practical framework using the maximum entropy principle to imitate the behavior of the true distribution observed in real-time traffic.
arXiv Detail & Related papers (2020-03-31T20:43:28Z) - Online Joint Bid/Daily Budget Optimization of Internet Advertising
Campaigns [115.96295568115251]
We study the problem of automating the online joint bid/daily budget optimization of pay-per-click advertising campaigns over multiple channels.
For every campaign, we capture the dependency of the number of clicks on the bid and daily budget by Gaussian Processes.
We design four algorithms and show that they suffer from a regret that is upper bounded with high probability as O(sqrtT)
We present the results of the adoption of our algorithms in a real-world application with a daily average spent of 1,000 Euros for more than one year.
arXiv Detail & Related papers (2020-03-03T11:07:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.