Learning Strategies in Decentralized Matching Markets under Uncertain
Preferences
- URL: http://arxiv.org/abs/2011.00159v2
- Date: Tue, 23 Nov 2021 07:44:14 GMT
- Title: Learning Strategies in Decentralized Matching Markets under Uncertain
Preferences
- Authors: Xiaowu Dai and Michael I. Jordan
- Abstract summary: We study the problem of decision-making in the setting of a scarcity of shared resources when the preferences of agents are unknown a priori.
Our approach is based on the representation of preferences in a reproducing kernel Hilbert space.
We derive optimal strategies that maximize agents' expected payoffs.
- Score: 91.3755431537592
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We study the problem of decision-making in the setting of a scarcity of
shared resources when the preferences of agents are unknown a priori and must
be learned from data. Taking the two-sided matching market as a running
example, we focus on the decentralized setting, where agents do not share their
learned preferences with a central authority. Our approach is based on the
representation of preferences in a reproducing kernel Hilbert space, and a
learning algorithm for preferences that accounts for uncertainty due to the
competition among the agents in the market. Under regularity conditions, we
show that our estimator of preferences converges at a minimax optimal rate.
Given this result, we derive optimal strategies that maximize agents' expected
payoffs and we calibrate the uncertain state by taking opportunity costs into
account. We also derive an incentive-compatibility property and show that the
outcome from the learned strategies has a stability property. Finally, we prove
a fairness property that asserts that there exists no justified envy according
to the learned strategies.
Related papers
- Putting Gale & Shapley to Work: Guaranteeing Stability Through Learning [14.448192914855674]
Two-sided matching markets describe a class of problems wherein participants from one side of the market must be matched to those from the other side according to their preferences.
We exploit the structure of stable solutions to devise algorithms that improve the likelihood of finding stable solutions.
arXiv Detail & Related papers (2024-10-06T06:47:53Z) - Causal Strategic Learning with Competitive Selection [10.237954203296187]
We study the problem of agent selection in causal strategic learning under multiple decision makers.
We show that the optimal selection rule is a trade-off between selecting the best agents and providing incentives to maximise the agents' improvement.
We provide a cooperative protocol which all decision makers must collectively adopt to recover the true causal parameters.
arXiv Detail & Related papers (2023-08-30T18:43:11Z) - Proportional Fairness in Obnoxious Facility Location [70.64736616610202]
We propose a hierarchy of distance-based proportional fairness concepts for the problem.
We consider deterministic and randomized mechanisms, and compute tight bounds on the price of proportional fairness.
We prove existence results for two extensions to our model.
arXiv Detail & Related papers (2023-01-11T07:30:35Z) - Decentralized, Communication- and Coordination-free Learning in
Structured Matching Markets [2.9833943723592764]
We study the problem of online learning in competitive settings in the context of two-sided matching markets.
We propose a class of decentralized, communication- and coordination-free algorithms that agents can use to reach to their stable match.
arXiv Detail & Related papers (2022-06-06T04:08:04Z) - Monotonic Improvement Guarantees under Non-stationarity for
Decentralized PPO [66.5384483339413]
We present a new monotonic improvement guarantee for optimizing decentralized policies in cooperative Multi-Agent Reinforcement Learning (MARL)
We show that a trust region constraint can be effectively enforced in a principled way by bounding independent ratios based on the number of agents in training.
arXiv Detail & Related papers (2022-01-31T20:39:48Z) - Learning Equilibria in Matching Markets from Bandit Feedback [139.29934476625488]
We develop a framework and algorithms for learning stable market outcomes under uncertainty.
Our work takes a first step toward elucidating when and how stable matchings arise in large, data-driven marketplaces.
arXiv Detail & Related papers (2021-08-19T17:59:28Z) - Multi-Stage Decentralized Matching Markets: Uncertain Preferences and
Strategic Behaviors [91.3755431537592]
This article develops a framework for learning optimal strategies in real-world matching markets.
We show that there exists a welfare-versus-fairness trade-off that is characterized by the uncertainty level of acceptance.
We prove that participants can be better off with multi-stage matching compared to single-stage matching.
arXiv Detail & Related papers (2021-02-13T19:25:52Z) - Decentralized Reinforcement Learning: Global Decision-Making via Local
Economic Transactions [80.49176924360499]
We establish a framework for directing a society of simple, specialized, self-interested agents to solve sequential decision problems.
We derive a class of decentralized reinforcement learning algorithms.
We demonstrate the potential advantages of a society's inherent modular structure for more efficient transfer learning.
arXiv Detail & Related papers (2020-07-05T16:41:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.