Regularized Proportional Fairness Mechanism for Resource Allocation Without Money
- URL: http://arxiv.org/abs/2501.01111v1
- Date: Thu, 02 Jan 2025 07:17:23 GMT
- Title: Regularized Proportional Fairness Mechanism for Resource Allocation Without Money
- Authors: Sihan Zeng, Sujay Bhatt, Alec Koppel, Sumitra Ganesh,
- Abstract summary: We design an innovative neural network architecture tailored to the resource allocation problem, which we name Regularized Proportional Fairness Network (RPF-Net)<n>RPF-Net regularizes the output of the proportional fairness (PF) mechanism by a learned function approximator of the most exploitable allocation.<n>We derive generalization bounds that guarantee the mechanism performance when trained under finite and out-of-distribution samples.
- Score: 18.593061465167363
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Mechanism design in resource allocation studies dividing limited resources among self-interested agents whose satisfaction with the allocation depends on privately held utilities. We consider the problem in a payment-free setting, with the aim of maximizing social welfare while enforcing incentive compatibility (IC), i.e., agents cannot inflate allocations by misreporting their utilities. The well-known proportional fairness (PF) mechanism achieves the maximum possible social welfare but incurs an undesirably high exploitability (the maximum unilateral inflation in utility from misreport and a measure of deviation from IC). In fact, it is known that no mechanism can achieve the maximum social welfare and exact incentive compatibility (IC) simultaneously without the use of monetary incentives (Cole et al., 2013). Motivated by this fact, we propose learning an approximate mechanism that desirably trades off the competing objectives. Our main contribution is to design an innovative neural network architecture tailored to the resource allocation problem, which we name Regularized Proportional Fairness Network (RPF-Net). RPF-Net regularizes the output of the PF mechanism by a learned function approximator of the most exploitable allocation, with the aim of reducing the incentive for any agent to misreport. We derive generalization bounds that guarantee the mechanism performance when trained under finite and out-of-distribution samples and experimentally demonstrate the merits of the proposed mechanism compared to the state-of-the-art.
Related papers
- Toward a Sustainable Federated Learning Ecosystem: A Practical Least Core Mechanism for Payoff Allocation [71.86087908416255]
We introduce a payoff allocation framework based on the least core (LC) concept.<n>Unlike traditional methods, the LC prioritizes the cohesion of the federation by minimizing the maximum dissatisfaction.<n>Case studies in federated intrusion detection demonstrate that our mechanism correctly identifies pivotal contributors and strategic alliances.
arXiv Detail & Related papers (2026-02-03T11:10:50Z) - Credit Fairness: Online Fairness In Shared Resource Pools [3.8888484256987685]
We consider a setting in which a group of agents share resources that must be allocated among them in each discrete time period.<n>Recent work has shown that this max-min mechanism can lead to large disparities in the total resources received by agents.<n>We introduce credit fairness, a strengthening of sharing incentives that ensures agents who lend resources in early rounds are able to recoup them in later rounds.
arXiv Detail & Related papers (2026-01-25T18:44:24Z) - Outcome-Grounded Advantage Reshaping for Fine-Grained Credit Assignment in Mathematical Reasoning [60.00161035836637]
Group Relative Policy Optimization has emerged as a promising critic-free reinforcement learning paradigm for reasoning tasks.<n>We introduce Outcome-grounded Advantage Reshaping (OAR), a fine-grained credit assignment mechanism that redistributes advantages based on how much each token influences the model's final answer.<n>OAR-G achieves comparable gains with negligible computational overhead, both significantly outperforming a strong GRPO baseline.
arXiv Detail & Related papers (2026-01-12T10:48:02Z) - Multi-agent Adaptive Mechanism Design [13.027684227860322]
We introduce Distributionally Robust Adaptive Mechanism (DRAM), a general framework combining insights from both mechanism design and online learning.<n>Our mechanism guarantees truthful reporting with high probability while achieving $tildeO(sqrtT)$ optimal cumulative regret.
arXiv Detail & Related papers (2025-12-25T21:59:51Z) - A General Incentives-Based Framework for Fairness in Multi-agent Resource Allocation [4.930376365020355]
We introduce the General Incentives-based Framework for Fairness (GIFF)<n>GIFF is a novel approach for fair multi-agent resource allocation that infers fair decision-making from standard value functions.
arXiv Detail & Related papers (2025-10-30T17:37:51Z) - An Explainable Equity-Aware P2P Energy Trading Framework for Socio-Economically Diverse Microgrid [0.0]
This paper proposes a novel framework that integrates multi-objective mixed-integer linear programming (MILP), cooperative game theory, and a dynamic equity-adjustment mechanism driven by reinforcement learning (RL)<n>The framework demonstrates peak demand reductions of up to 72.6%, and significant cooperative gains.
arXiv Detail & Related papers (2025-07-24T18:38:51Z) - SEEA-R1: Tree-Structured Reinforcement Fine-Tuning for Self-Evolving Embodied Agents [58.174206358223415]
Self-Evolving Embodied Agents-R1, or SEEA-R1, is the first reinforcement fine-tuning framework designed for self-evolving embodied agents.<n>We show that SEEA-R1 can support autonomous adaptation and reward-driven self-evolution.
arXiv Detail & Related papers (2025-06-26T18:00:07Z) - CARE: Compatibility-Aware Incentive Mechanisms for Federated Learning with Budgeted Requesters [25.69656488516605]
Federated learning (FL) allows requesters to obtain local training models from workers (e.g., clients)
In this paper, we investigate the scenario in FL where multiple budgeted requesters seek training services from incompatible workers with private training costs.
We develop novel compatibility-aware incentive mechanisms, CARE-CO and CARE-NO, for both settings to elicit true private costs and determine workers to hire for requesters and their rewards.
arXiv Detail & Related papers (2025-04-22T12:42:45Z) - Likelihood Reward Redistribution [0.0]
We propose a emphLikelihood Reward Redistribution (LRR) framework for reward redistribution.
When integrated with an off-policy algorithm such as Soft Actor-Critic, LRR yields dense and informative reward signals.
arXiv Detail & Related papers (2025-03-20T20:50:49Z) - Learning Fair and Preferable Allocations through Neural Network [24.15688619889342]
We aim to design mechanisms that strictly satisfy good properties with replicating expert knowledge.
Formal algorithms struggle to find preferable outcomes, and directly replicating these implicit rules can result in unfair allocations.
We develop a neural network that parameterizes RR and can be trained to learn the agent ordering used for RR.
arXiv Detail & Related papers (2024-10-23T01:47:55Z) - Deep Learning Meets Mechanism Design: Key Results and Some Novel
Applications [1.2661010067882734]
We present, from relevant literature, technical details of using a deep learning approach for mechanism design.
We demonstrate the power of this approach for three illustrative case studies.
arXiv Detail & Related papers (2024-01-11T06:09:32Z) - Learning Payment-Free Resource Allocation Mechanisms [19.60309632660988]
We consider the design of mechanisms that limited resources among self-interested agents using neural networks.
We contribute a new end-to-end neural network architecture, ExS-Net, that accommodates the idea of "money-burning" for mechanism design without payments.
arXiv Detail & Related papers (2023-11-18T01:21:54Z) - Refined Mechanism Design for Approximately Structured Priors via Active
Regression [50.71772232237571]
We consider the problem of a revenue-maximizing seller with a large number of items for sale to $n$ strategic bidders.
It is well-known that optimal and even approximately-optimal mechanisms for this setting are notoriously difficult to characterize or compute.
arXiv Detail & Related papers (2023-10-11T20:34:17Z) - Benefits of Permutation-Equivariance in Auction Mechanisms [90.42990121652956]
An auction mechanism that maximizes the auctioneer's revenue while minimizes bidders' ex-post regret is an important yet intricate problem in economics.
Remarkable progress has been achieved through learning the optimal auction mechanism by neural networks.
arXiv Detail & Related papers (2022-10-11T16:13:25Z) - Monotonic Improvement Guarantees under Non-stationarity for
Decentralized PPO [66.5384483339413]
We present a new monotonic improvement guarantee for optimizing decentralized policies in cooperative Multi-Agent Reinforcement Learning (MARL)
We show that a trust region constraint can be effectively enforced in a principled way by bounding independent ratios based on the number of agents in training.
arXiv Detail & Related papers (2022-01-31T20:39:48Z) - Strategyproof and Proportionally Fair Facility Location [77.16035689756859]
We focus on a simple, one-dimensional collective decision problem (often referred to as the facility location problem)
We analyze a hierarchy of proportionality-based fairness axioms of varying strength.
For each axiom, we characterize the family of mechanisms that satisfy the axiom and strategyproofness.
arXiv Detail & Related papers (2021-11-02T12:41:32Z) - Online Learning of Competitive Equilibria in Exchange Economies [94.24357018178867]
In economics, the sharing of scarce resources among multiple rational agents is a classical problem.
We propose an online learning mechanism to learn agent preferences.
We demonstrate the effectiveness of this mechanism through numerical simulations.
arXiv Detail & Related papers (2021-06-11T21:32:17Z) - VCG Mechanism Design with Unknown Agent Values under Stochastic Bandit
Feedback [104.06766271716774]
We study a multi-round welfare-maximising mechanism design problem in instances where agents do not know their values.
We first define three notions of regret for the welfare, the individual utilities of each agent and that of the mechanism.
Our framework also provides flexibility to control the pricing scheme so as to trade-off between the agent and seller regrets.
arXiv Detail & Related papers (2020-04-19T18:00:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.