Learning to Bid Optimally and Efficiently in Adversarial First-price
Auctions
- URL: http://arxiv.org/abs/2007.04568v1
- Date: Thu, 9 Jul 2020 05:40:39 GMT
- Title: Learning to Bid Optimally and Efficiently in Adversarial First-price
Auctions
- Authors: Yanjun Han, Zhengyuan Zhou, Aaron Flores, Erik Ordentlich, Tsachy
Weissman
- Abstract summary: We develop the first minimax optimal online bidding algorithm that achieves an $widetildeO(sqrtT)$ regret.
We test our algorithm on three real-world first-price auction datasets obtained from Verizon Media.
- Score: 40.30925727499806
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: First-price auctions have very recently swept the online advertising
industry, replacing second-price auctions as the predominant auction mechanism
on many platforms. This shift has brought forth important challenges for a
bidder: how should one bid in a first-price auction, where unlike in
second-price auctions, it is no longer optimal to bid one's private value
truthfully and hard to know the others' bidding behaviors? In this paper, we
take an online learning angle and address the fundamental problem of learning
to bid in repeated first-price auctions, where both the bidder's private
valuations and other bidders' bids can be arbitrary. We develop the first
minimax optimal online bidding algorithm that achieves an
$\widetilde{O}(\sqrt{T})$ regret when competing with the set of all Lipschitz
bidding policies, a strong oracle that contains a rich set of bidding
strategies. This novel algorithm is built on the insight that the presence of a
good expert can be leveraged to improve performance, as well as an original
hierarchical expert-chaining structure, both of which could be of independent
interest in online learning. Further, by exploiting the product structure that
exists in the problem, we modify this algorithm--in its vanilla form
statistically optimal but computationally infeasible--to a computationally
efficient and space efficient algorithm that also retains the same
$\widetilde{O}(\sqrt{T})$ minimax optimal regret guarantee. Additionally,
through an impossibility result, we highlight that one is unlikely to compete
this favorably with a stronger oracle (than the considered Lipschitz bidding
policies). Finally, we test our algorithm on three real-world first-price
auction datasets obtained from Verizon Media and demonstrate our algorithm's
superior performance compared to several existing bidding algorithms.
Related papers
- Procurement Auctions via Approximately Optimal Submodular Optimization [53.93943270902349]
We study procurement auctions, where an auctioneer seeks to acquire services from strategic sellers with private costs.
Our goal is to design computationally efficient auctions that maximize the difference between the quality of the acquired services and the total cost of the sellers.
arXiv Detail & Related papers (2024-11-20T18:06:55Z) - Strategically-Robust Learning Algorithms for Bidding in First-Price Auctions [11.988955088595858]
Learning to bid in repeated first-price auctions is a fundamental problem at the interface of game theory and machine learning.
We propose a novel concave formulation for pure-strategy bidding in first-price auctions, and use it to analyze natural Gradient-Ascent-based algorithms for this problem.
arXiv Detail & Related papers (2024-02-12T01:33:33Z) - Autobidders with Budget and ROI Constraints: Efficiency, Regret, and Pacing Dynamics [53.62091043347035]
We study a game between autobidding algorithms that compete in an online advertising platform.
We propose a gradient-based learning algorithm that is guaranteed to satisfy all constraints and achieves vanishing individual regret.
arXiv Detail & Related papers (2023-01-30T21:59:30Z) - A Reinforcement Learning Approach in Multi-Phase Second-Price Auction
Design [158.0041488194202]
We study reserve price optimization in multi-phase second price auctions.
From the seller's perspective, we need to efficiently explore the environment in the presence of potentially nontruthful bidders.
Third, the seller's per-step revenue is unknown, nonlinear, and cannot even be directly observed from the environment.
arXiv Detail & Related papers (2022-10-19T03:49:05Z) - No-regret Learning in Repeated First-Price Auctions with Budget
Constraints [5.834615090865286]
We propose an RL-based bidding algorithm against the optimal non-anticipating strategy under stationary competition.
Our algorithm obtains $widetilde O(sqrt T)$-regret if the bids are all revealed at the end of each round.
arXiv Detail & Related papers (2022-05-29T04:32:05Z) - Fast Rate Learning in Stochastic First Price Bidding [0.0]
First-price auctions have largely replaced traditional bidding approaches based on Vickrey auctions in programmatic advertising.
We show how to achieve significantly lower regret when the opponents' maximal bid distribution is known.
Our algorithms converge much faster than alternatives proposed in the literature for various bid distributions.
arXiv Detail & Related papers (2021-07-05T07:48:52Z) - Efficient Algorithms for Stochastic Repeated Second-price Auctions [0.0]
We develop efficient sequential bidding strategies for repeated auctions.
We provide the first parametric lower bound for this problem.
We propose more explainable strategies which are reminiscent of the Explore Then Commit bandit algorithm.
arXiv Detail & Related papers (2020-11-10T12:45:02Z) - A novel auction system for selecting advertisements in Real-Time bidding [68.8204255655161]
Real-Time Bidding is a new Internet advertising system that has become very popular in recent years.
We propose an alternative betting system with a new approach that not only considers the economic aspect but also other relevant factors for the functioning of the advertising system.
arXiv Detail & Related papers (2020-10-22T18:36:41Z) - ProportionNet: Balancing Fairness and Revenue for Auction Design with
Deep Learning [55.76903822619047]
We study the design of revenue-maximizing auctions with strong incentive guarantees.
We extend techniques for approximating auctions using deep learning to address concerns of fairness while maintaining high revenue and strong incentive guarantees.
arXiv Detail & Related papers (2020-10-13T13:54:21Z) - Optimal No-regret Learning in Repeated First-price Auctions [38.908235632001116]
We study online learning in repeated first-price auctions.
We develop the first learning algorithm that achieves a near-optimal $widetildeO(sqrtT)$ regret bound.
arXiv Detail & Related papers (2020-03-22T03:32:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.