Pairwise Ranking Losses of Click-Through Rates Prediction for Welfare
Maximization in Ad Auctions
- URL: http://arxiv.org/abs/2306.01799v1
- Date: Thu, 1 Jun 2023 15:42:50 GMT
- Title: Pairwise Ranking Losses of Click-Through Rates Prediction for Welfare
Maximization in Ad Auctions
- Authors: Boxiang Lyu, Zhe Feng, Zachary Robertson, Sanmi Koyejo
- Abstract summary: We study the design of loss functions for click-through rates (CTR) to optimize (social) welfare in advertising auctions.
We propose a novel weighted rankloss to train the CTR model.
We demonstrate the advantages of the proposed loss on synthetic and real-world data.
- Score: 12.67468905841272
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We study the design of loss functions for click-through rates (CTR) to
optimize (social) welfare in advertising auctions. Existing works either only
focus on CTR predictions without consideration of business objectives (e.g.,
welfare) in auctions or assume that the distribution over the participants'
expected cost-per-impression (eCPM) is known a priori, then use various
additional assumptions on the parametric form of the distribution to derive
loss functions for predicting CTRs. In this work, we bring back the welfare
objectives of ad auctions into CTR predictions and propose a novel weighted
rankloss to train the CTR model. Compared to existing literature, our approach
provides a provable guarantee on welfare but without assumptions on the eCPMs'
distribution while also avoiding the intractability of naively applying
existing learning-to-rank methods. Further, we propose a theoretically
justifiable technique for calibrating the losses using labels generated from a
teacher network, only assuming that the teacher network has bounded $\ell_2$
generalization error. Finally, we demonstrate the advantages of the proposed
loss on synthetic and real-world data.
Related papers
- Rejection via Learning Density Ratios [50.91522897152437]
Classification with rejection emerges as a learning paradigm which allows models to abstain from making predictions.
We propose a different distributional perspective, where we seek to find an idealized data distribution which maximizes a pretrained model's performance.
Our framework is tested empirically over clean and noisy datasets.
arXiv Detail & Related papers (2024-05-29T01:32:17Z) - Conformal Online Auction Design [6.265829744417118]
COAD incorporates both the bidder and item features to provide an incentive-compatible mechanism for online auctions.
It employs a distribution-free, prediction interval-based approach using conformal prediction techniques.
COAD admits the use of a broad array of modern machine-learning methods, including random forests, kernel methods, and deep neural nets.
arXiv Detail & Related papers (2024-05-11T15:28:25Z) - Adversarial Reweighting with $α$-Power Maximization for Domain Adaptation [56.859005008344276]
We propose a novel approach, dubbed Adversarial Reweighting with $alpha$-Power Maximization (ARPM)
In ARPM, we propose a novel adversarial reweighting model that adversarially learns to reweight source domain data to identify source-private class samples.
We show that our method is superior to recent PDA methods.
arXiv Detail & Related papers (2024-04-26T09:29:55Z) - Ranking with Abstention [27.3569897539488]
We introduce a novel framework of ranking with abstention, where the learner can abstain from making prediction at some limited cost $c$.
We present a series of $H$-consistency bounds for both the family of linear functions and that of neural networks with one hidden-layer.
arXiv Detail & Related papers (2023-07-05T05:37:13Z) - Inference-time Stochastic Ranking with Risk Control [19.20938164194589]
Learning to Rank methods are vital in online economies, affecting users and item providers.
We propose a novel method that performs ranking at inference time with guanranteed utility or fairness given pretrained scoring functions.
arXiv Detail & Related papers (2023-06-12T15:44:58Z) - Benefits of Permutation-Equivariance in Auction Mechanisms [90.42990121652956]
An auction mechanism that maximizes the auctioneer's revenue while minimizes bidders' ex-post regret is an important yet intricate problem in economics.
Remarkable progress has been achieved through learning the optimal auction mechanism by neural networks.
arXiv Detail & Related papers (2022-10-11T16:13:25Z) - Rethinking Position Bias Modeling with Knowledge Distillation for CTR
Prediction [8.414183573280779]
This work proposes a knowledge distillation framework to alleviate the impact of position bias and leverage position information to improve CTR prediction.
The proposed method has been deployed in the real world online ads systems, serving main traffic on one of the world's largest e-commercial platforms.
arXiv Detail & Related papers (2022-04-01T07:58:38Z) - Leveraging Unlabeled Data to Predict Out-of-Distribution Performance [63.740181251997306]
Real-world machine learning deployments are characterized by mismatches between the source (training) and target (test) distributions.
In this work, we investigate methods for predicting the target domain accuracy using only labeled source data and unlabeled target data.
We propose Average Thresholded Confidence (ATC), a practical method that learns a threshold on the model's confidence, predicting accuracy as the fraction of unlabeled examples.
arXiv Detail & Related papers (2022-01-11T23:01:12Z) - Arbitrary Distribution Modeling with Censorship in Real-Time Bidding
Advertising [2.562910030418378]
The purpose of Inventory Pricing is to bid the right prices to online ad opportunities, which is crucial for a Demand-Side Platform (DSP) to win auctions in Real-Time Bidding (RTB)
Most of the previous works made strong assumptions on the distribution form of the winning price, which reduced their accuracy and weakened their ability to make generalizations.
We propose a novel loss function, Neighborhood Likelihood Loss (NLL), collaborating with a proposed framework, Arbitrary Distribution Modeling (ADM) to predict the winning price distribution under censorship.
arXiv Detail & Related papers (2021-10-26T11:40:00Z) - Off-policy Reinforcement Learning with Optimistic Exploration and
Distribution Correction [73.77593805292194]
We train a separate exploration policy to maximize an approximate upper confidence bound of the critics in an off-policy actor-critic framework.
To mitigate the off-policy-ness, we adapt the recently introduced DICE framework to learn a distribution correction ratio for off-policy actor-critic training.
arXiv Detail & Related papers (2021-10-22T22:07:51Z) - Online Learning of Competitive Equilibria in Exchange Economies [94.24357018178867]
In economics, the sharing of scarce resources among multiple rational agents is a classical problem.
We propose an online learning mechanism to learn agent preferences.
We demonstrate the effectiveness of this mechanism through numerical simulations.
arXiv Detail & Related papers (2021-06-11T21:32:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.