A Regret Analysis of Bilateral Trade
- URL: http://arxiv.org/abs/2102.08754v1
- Date: Tue, 16 Feb 2021 08:53:17 GMT
- Title: A Regret Analysis of Bilateral Trade
- Authors: Nicol\`o Cesa-Bianchi, Tommaso Cesari (TSE), Roberto Colomboni (IIT),
Federico Fusco, Stefano Leonardi
- Abstract summary: We cast for the first time the bilateral trade problem in a regret minimization framework over rounds of seller/buyer interactions.
Our main contribution is a complete characterization of the regret regimes for fixedprice mechanisms with different models of feedback and private valuations.
- Score: 5.031063690574698
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Bilateral trade, a fundamental topic in economics, models the problem of
intermediating between two strategic agents, a seller and a buyer, willing to
trade a good for which they hold private valuations. Despite the simplicity of
this problem, a classical result by Myerson and Satterthwaite (1983) affirms
the impossibility of designing a mechanism which is simultaneously efficient,
incentive compatible, individually rational, and budget balanced. This
impossibility result fostered an intense investigation of meaningful trade-offs
between these desired properties. Much work has focused on approximately
efficient fixed-price mechanisms, i.e., Blumrosen and Dobzinski (2014; 2016),
Colini-Baldeschi et al. (2016), which have been shown to fully characterize
strong budget balanced and ex-post individually rational direct revelation
mechanisms. All these results, however, either assume some knowledge on the
priors of the seller/buyer valuations, or a black box access to some samples of
the distributions, as in D{\"u}tting et al. (2021). In this paper, we cast for
the first time the bilateral trade problem in a regret minimization framework
over rounds of seller/buyer interactions, with no prior knowledge on the
private seller/buyer valuations. Our main contribution is a complete
characterization of the regret regimes for fixed-price mechanisms with
different models of feedback and private valuations, using as benchmark the
best fixed price in hindsight. More precisely, we prove the following bounds on
the regret:
$\bullet$ $\widetilde{\Theta}(\sqrt{T})$ for full-feedback (i.e., direct
revelation mechanisms);
$\bullet$ $\widetilde{\Theta}(T^{2/3})$ for realistic feedback (i.e.,
posted-price mechanisms) and independent seller/buyer valuations with bounded
densities;
$\bullet$ $\Theta(T)$ for realistic feedback and seller/buyer valuations with
bounded densities;
$\bullet$ $\Theta(T)$ for realistic feedback and independent seller/buyer
valuations;
$\bullet$ $\Theta(T)$ for the adversarial setting.
Related papers
- Mind the Gap: A Causal Perspective on Bias Amplification in Prediction & Decision-Making [58.06306331390586]
We introduce the notion of a margin complement, which measures how much a prediction score $S$ changes due to a thresholding operation.
We show that under suitable causal assumptions, the influences of $X$ on the prediction score $S$ are equal to the influences of $X$ on the true outcome $Y$.
arXiv Detail & Related papers (2024-05-24T11:22:19Z) - Fair Online Bilateral Trade [20.243000364933472]
We present a complete characterization of the regret for fair gain from trade when, after each interaction, the platform only learns whether each trader accepted the current price.
We conclude by providing tight regret bounds when, after each interaction, the platform is allowed to observe the true traders' valuations.
arXiv Detail & Related papers (2024-05-22T18:49:11Z) - Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback [58.66941279460248]
Learning from human feedback plays an important role in aligning generative models, such as large language models (LLM)
We study a model within this problem domain--contextual dueling bandits with adversarial feedback, where the true preference label can be flipped by an adversary.
We propose an algorithm namely robust contextual dueling bandit (algo), which is based on uncertainty-weighted maximum likelihood estimation.
arXiv Detail & Related papers (2024-04-16T17:59:55Z) - No-Regret Learning in Bilateral Trade via Global Budget Balance [29.514323697659613]
We provide the first no-regret algorithms for adversarial bilateral trade under various feedback models.
We show that in the full-feedback model, the learner can guarantee $tilde O(sqrtT)$ regret against the best fixed prices in hindsight.
We also provide a learning algorithm guaranteeing a $tilde O(T3/4)$ regret upper bound with one-bit feedback.
arXiv Detail & Related papers (2023-10-18T22:34:32Z) - Repeated Bilateral Trade Against a Smoothed Adversary [5.939280057673226]
We study repeated bilateral trade where an adaptive $sigma$-smooth adversary generates the valuations of sellers and buyers.
We provide a complete characterization of the regret regimes for fixed-price mechanisms under different feedback models.
arXiv Detail & Related papers (2023-02-21T16:30:10Z) - A Reinforcement Learning Approach in Multi-Phase Second-Price Auction
Design [158.0041488194202]
We study reserve price optimization in multi-phase second price auctions.
From the seller's perspective, we need to efficiently explore the environment in the presence of potentially nontruthful bidders.
Third, the seller's per-step revenue is unknown, nonlinear, and cannot even be directly observed from the environment.
arXiv Detail & Related papers (2022-10-19T03:49:05Z) - An $α$-regret analysis of Adversarial Bilateral Trade [10.275531964940425]
We study sequential bilateral trade where sellers and buyers valuations are completely arbitrary.
We consider gain from trade which is harder to approximate than social welfare.
arXiv Detail & Related papers (2022-10-13T08:57:30Z) - Near-Optimal $\Phi$-Regret Learning in Extensive-Form Games [85.78272987312343]
We establish efficient and uncoupled learning dynamics so that the trigger regret of each player grows as $O(log T)$ after $T$ repetitions of play.
This improves exponentially over the prior best known trigger-regret bound of $O(T1/4)$.
arXiv Detail & Related papers (2022-08-20T20:48:58Z) - Bilateral Trade: A Regret Minimization Perspective [5.031063690574698]
We cast the bilateral trade problem in a regret minimization framework over $T$ rounds of seller/buyer interactions.
Our main contribution is a complete characterization of the regret regimes for fixed-price mechanisms with different feedback models and private valuations.
arXiv Detail & Related papers (2021-09-08T22:11:48Z) - Certifying Strategyproof Auction Networks [53.37051312298459]
We focus on the RegretNet architecture, which can represent auctions with arbitrary numbers of items and participants.
We propose ways to explicitly verify strategyproofness under a particular valuation profile using techniques from the neural network verification literature.
arXiv Detail & Related papers (2020-06-15T20:22:48Z) - VCG Mechanism Design with Unknown Agent Values under Stochastic Bandit
Feedback [104.06766271716774]
We study a multi-round welfare-maximising mechanism design problem in instances where agents do not know their values.
We first define three notions of regret for the welfare, the individual utilities of each agent and that of the mechanism.
Our framework also provides flexibility to control the pricing scheme so as to trade-off between the agent and seller regrets.
arXiv Detail & Related papers (2020-04-19T18:00:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.