Group Meritocratic Fairness in Linear Contextual Bandits
- URL: http://arxiv.org/abs/2206.03150v1
- Date: Tue, 7 Jun 2022 09:54:38 GMT
- Title: Group Meritocratic Fairness in Linear Contextual Bandits
- Authors: Riccardo Grazzi, Arya Akhavan, John Isak Texas Falk, Leonardo Cella,
Massimiliano Pontil
- Abstract summary: We study the linear contextual bandit problem where an agent has to select one candidate from a pool and each candidate belongs to a sensitive group.
We propose a notion of fairness that states that the agent's policy is fair when it selects a candidate with highest relative rank.
- Score: 32.15680917495674
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We study the linear contextual bandit problem where an agent has to select
one candidate from a pool and each candidate belongs to a sensitive group. In
this setting, candidates' rewards may not be directly comparable between
groups, for example when the agent is an employer hiring candidates from
different ethnic groups and some groups have a lower reward due to
discriminatory bias and/or social injustice. We propose a notion of fairness
that states that the agent's policy is fair when it selects a candidate with
highest relative rank, which measures how good the reward is when compared to
candidates from the same group. This is a very strong notion of fairness, since
the relative rank is not directly observed by the agent and depends on the
underlying reward model and on the distribution of rewards. Thus we study the
problem of learning a policy which approximates a fair policy under the
condition that the contexts are independent between groups and the distribution
of rewards of each group is absolutely continuous. In particular, we design a
greedy policy which at each round constructs a ridge regression estimator from
the observed context-reward pairs, and then computes an estimate of the
relative rank of each candidate using the empirical cumulative distribution
function. We prove that the greedy policy achieves, after $T$ rounds, up to log
factors and with high probability, a fair pseudo-regret of order $\sqrt{dT}$,
where $d$ is the dimension of the context vectors. The policy also satisfies
demographic parity at each round when averaged over all possible information
available before the selection. We finally show with a proof of concept
simulation that our policy achieves sub-linear fair pseudo-regret also in
practice.
Related papers
- On the Hardness of Decentralized Multi-Agent Policy Evaluation under Byzantine Attacks [12.696705862929337]
We study a fully-decentralized multi-agent policy evaluation problem in the presence of up to $f$ faulty agents.
In particular, we focus on the so-called Byzantine faulty model with model poisoning setting.
arXiv Detail & Related papers (2024-09-19T16:27:08Z) - Information Capacity Regret Bounds for Bandits with Mediator Feedback [55.269551124587224]
We introduce the policy set capacity as an information-theoretic measure for the complexity of the policy set.
Adopting the classical EXP4 algorithm, we provide new regret bounds depending on the policy set capacity.
For a selection of policy set families, we prove nearly-matching lower bounds, scaling similarly with the capacity.
arXiv Detail & Related papers (2024-02-15T19:18:47Z) - Theoretical guarantees on the best-of-n alignment policy [110.21094183592358]
We show that the KL divergence between the best-of-$n$ policy and the base policy is equal to $log (n) - (n-1)/n.$
We propose a new estimator for the KL divergence and empirically show that it provides a tight approximation through a few examples.
arXiv Detail & Related papers (2024-01-03T18:39:13Z) - Thompson Exploration with Best Challenger Rule in Best Arm
Identification [66.33448474838342]
We study the fixed-confidence best arm identification problem in the bandit framework.
We propose a novel policy that combines Thompson sampling with a computationally efficient approach known as the best challenger rule.
arXiv Detail & Related papers (2023-10-01T01:37:02Z) - Fair Group-Shared Representations with Normalizing Flows [68.29997072804537]
We develop a fair representation learning algorithm which is able to map individuals belonging to different groups in a single group.
We show experimentally that our methodology is competitive with other fair representation learning algorithms.
arXiv Detail & Related papers (2022-01-17T10:49:49Z) - Linear Contextual Bandits with Adversarial Corruptions [91.38793800392108]
We study the linear contextual bandit problem in the presence of adversarial corruption.
We present a variance-aware algorithm that is adaptive to the level of adversarial contamination $C$.
arXiv Detail & Related papers (2021-10-25T02:53:24Z) - Fairness in Ranking under Uncertainty [42.51950847766776]
Unfairness occurs when an agent with higher merit obtains a worse outcome than an agent with lower merit.
Our central point is that a primary cause of unfairness is uncertainty.
We show how to compute rankings that optimally trade off approximate fairness against utility to the principal.
arXiv Detail & Related papers (2021-07-14T14:10:16Z) - Fairness Preferences, Actual and Hypothetical: A Study of Crowdworker
Incentives [1.854931308524932]
This paper outlines a research program and experimental designs for investigating these questions.
The voting is hypothetical (not tied to an outcome) for half the group and actual (tied to the actual payment outcome) for the other half, so that we can understand the relation between a group's actual preferences and hypothetical (stated) preferences.
arXiv Detail & Related papers (2020-12-08T05:00:57Z) - On Fair Selection in the Presence of Implicit Variance [17.517529275692322]
We argue that even in the absence of implicit bias, the estimates of candidates' quality from different groups may differ in another fundamental way, namely, in their variance.
We propose a simple model in which candidates have a true latent quality that is drawn from a group-independent normal distribution.
We show that the demographic parity mechanism always increases the selection utility, while any $gamma$-rule weakly increases it.
arXiv Detail & Related papers (2020-06-24T13:08:31Z) - Distributional Individual Fairness in Clustering [7.303841123034983]
We introduce a framework for assigning individuals, embedded in a metric space, to probability distributions over a bounded number of cluster centers.
We provide an algorithm for clustering with $p$-norm objective and individual fairness constraints with provable approximation guarantee.
arXiv Detail & Related papers (2020-06-22T20:02:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.