Subset Selection Based On Multiple Rankings in the Presence of Bias:
Effectiveness of Fairness Constraints for Multiwinner Voting Score Functions
- URL: http://arxiv.org/abs/2306.09835v1
- Date: Fri, 16 Jun 2023 13:25:27 GMT
- Title: Subset Selection Based On Multiple Rankings in the Presence of Bias:
Effectiveness of Fairness Constraints for Multiwinner Voting Score Functions
- Authors: Niclas Boehmer, L. Elisa Celis, Lingxiao Huang, Anay Mehrotra,
Nisheeth K. Vishnoi
- Abstract summary: We consider the problem of subset selection where one is given multiple rankings of items and the goal is to select the highest quality''
We show that for fairness constraints to be effective, different multiwinner score functions may require a drastically different number of rankings.
- Score: 42.23968578772441
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We consider the problem of subset selection where one is given multiple
rankings of items and the goal is to select the highest ``quality'' subset.
Score functions from the multiwinner voting literature have been used to
aggregate rankings into quality scores for subsets. We study this setting of
subset selection problems when, in addition, rankings may contain systemic or
unconscious biases toward a group of items. For a general model of input
rankings and biases, we show that requiring the selected subset to satisfy
group fairness constraints can improve the quality of the selection with
respect to unbiased rankings. Importantly, we show that for fairness
constraints to be effective, different multiwinner score functions may require
a drastically different number of rankings: While for some functions, fairness
constraints need an exponential number of rankings to recover a
close-to-optimal solution, for others, this dependency is only polynomial. This
result relies on a novel notion of ``smoothness'' of submodular functions in
this setting that quantifies how well a function can ``correctly'' assess the
quality of items in the presence of bias. The results in this paper can be used
to guide the choice of multiwinner score functions for the subset selection
setting considered here; we additionally provide a tool to empirically enable
this.
Related papers
- Enhancing Neural Subset Selection: Integrating Background Information into Set Representations [53.15923939406772]
We show that when the target value is conditioned on both the input set and subset, it is essential to incorporate an textitinvariant sufficient statistic of the superset into the subset of interest.
This ensures that the output value remains invariant to permutations of the subset and its corresponding superset, enabling identification of the specific superset from which the subset originated.
arXiv Detail & Related papers (2024-02-05T16:09:35Z) - Bipartite Ranking Fairness through a Model Agnostic Ordering Adjustment [54.179859639868646]
We propose a model agnostic post-processing framework xOrder for achieving fairness in bipartite ranking.
xOrder is compatible with various classification models and ranking fairness metrics, including supervised and unsupervised fairness metrics.
We evaluate our proposed algorithm on four benchmark data sets and two real-world patient electronic health record repositories.
arXiv Detail & Related papers (2023-07-27T07:42:44Z) - Explainable Disparity Compensation for Efficient Fair Ranking [0.3759936323189418]
Ranking functions that are used in decision systems often produce disparate results for different populations because of bias in the underlying data.
Recent compensatory measures have mostly focused on opaque transformations of the ranking functions to satisfy fairness guarantees.
In this paper we propose easily explainable data-driven compensatory measures for ranking functions.
arXiv Detail & Related papers (2023-07-25T09:12:50Z) - Stop Overcomplicating Selective Classification: Use Max-Logit [2.3677503557659705]
We tackle the problem of Selective Classification where the goal is to achieve the best performance on the desired coverages of the dataset.
Recent state-of-the-art selective methods come with architectural changes either via introducing a separate selection head or an extra abstention logit.
We present surprising results for Selective Classification by confirming that the superior performance of state-of-the-art methods is owed to training a more generalizable classifier.
arXiv Detail & Related papers (2022-06-17T22:23:11Z) - Ranking with submodular functions on a budget [17.877243904657952]
We propose a novel formulation for ranking items with submodular valuations and budget constraints.
For the MSR problem with cardinality- and knapsack-type budget constraints, we propose practical algorithms with approximation guarantees.
arXiv Detail & Related papers (2022-04-08T16:29:45Z) - Adaptive Cascade Submodular Maximization [19.29174615532181]
We study the cascade submodular problem under the adaptive setting.
Our objective is to identify the best sequence of selecting items so as to maximize the expected utility of the selected items.
arXiv Detail & Related papers (2020-07-07T16:21:56Z) - Towards Model-Agnostic Post-Hoc Adjustment for Balancing Ranking
Fairness and Algorithm Utility [54.179859639868646]
Bipartite ranking aims to learn a scoring function that ranks positive individuals higher than negative ones from labeled data.
There have been rising concerns on whether the learned scoring function can cause systematic disparity across different protected groups.
We propose a model post-processing framework for balancing them in the bipartite ranking scenario.
arXiv Detail & Related papers (2020-06-15T10:08:39Z) - SetRank: A Setwise Bayesian Approach for Collaborative Ranking from
Implicit Feedback [50.13745601531148]
We propose a novel setwise Bayesian approach for collaborative ranking, namely SetRank, to accommodate the characteristics of implicit feedback in recommender system.
Specifically, SetRank aims at maximizing the posterior probability of novel setwise preference comparisons.
We also present the theoretical analysis of SetRank to show that the bound of excess risk can be proportional to $sqrtM/N$.
arXiv Detail & Related papers (2020-02-23T06:40:48Z) - Optimal Clustering from Noisy Binary Feedback [75.17453757892152]
We study the problem of clustering a set of items from binary user feedback.
We devise an algorithm with a minimal cluster recovery error rate.
For adaptive selection, we develop an algorithm inspired by the derivation of the information-theoretical error lower bounds.
arXiv Detail & Related papers (2019-10-14T09:18:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.