Eliciting Kemeny Rankings
- URL: http://arxiv.org/abs/2312.11663v1
- Date: Mon, 18 Dec 2023 19:14:42 GMT
- Title: Eliciting Kemeny Rankings
- Authors: Anne-Marie George, Christos Dimitrakakis
- Abstract summary: We find approximation bounds for Kemeny rankings dependant on confidence intervals over estimated winning probabilities of arms.
We formulate several adaptive sampling methods that use look-aheads to estimate how much confidence intervals might be tightened.
- Score: 6.971011179091351
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We formulate the problem of eliciting agents' preferences with the goal of
finding a Kemeny ranking as a Dueling Bandits problem. Here the bandits' arms
correspond to alternatives that need to be ranked and the feedback corresponds
to a pairwise comparison between alternatives by a randomly sampled agent. We
consider both sampling with and without replacement, i.e., the possibility to
ask the same agent about some comparison multiple times or not.
We find approximation bounds for Kemeny rankings dependant on confidence
intervals over estimated winning probabilities of arms. Based on these we state
algorithms to find Probably Approximately Correct (PAC) solutions and elaborate
on their sample complexity for sampling with or without replacement.
Furthermore, if all agents' preferences are strict rankings over the
alternatives, we provide means to prune confidence intervals and thereby guide
a more efficient elicitation. We formulate several adaptive sampling methods
that use look-aheads to estimate how much confidence intervals (and thus
approximation guarantees) might be tightened. All described methods are
compared on synthetic data.
Related papers
- Best Arm Identification with Fixed Budget: A Large Deviation Perspective [54.305323903582845]
We present sred, a truly adaptive algorithm that can reject arms in it any round based on the observed empirical gaps between the rewards of various arms.
In particular, we present sred, a truly adaptive algorithm that can reject arms in it any round based on the observed empirical gaps between the rewards of various arms.
arXiv Detail & Related papers (2023-12-19T13:17:43Z) - Pure Exploration under Mediators' Feedback [63.56002444692792]
Multi-armed bandits are a sequential-decision-making framework, where, at each interaction step, the learner selects an arm and observes a reward.
We consider the scenario in which the learner has access to a set of mediators, each of which selects the arms on the agent's behalf according to a and possibly unknown policy.
We propose a sequential decision-making strategy for discovering the best arm under the assumption that the mediators' policies are known to the learner.
arXiv Detail & Related papers (2023-08-29T18:18:21Z) - Covariance Adaptive Best Arm Identification [0.0]
The goal is to identify the arm with the highest mean reward with a probability of at least 1 -- $delta$, while minimizing the number of arm pulls.
We propose a more flexible scenario where arms can be dependent and rewards can be sampled simultaneously.
This framework is relevant in various applications, such as clinical trials, where similarities between patients or drugs suggest underlying correlations.
arXiv Detail & Related papers (2023-06-05T06:57:09Z) - Top Two Algorithms Revisited [14.783452541904365]
Top Two algorithms arose as an adaptation of Thompson sampling to best arm identification in multi-armed bandit models.
We provide a general analysis of Top Two methods, which identifies desirable properties of the leader, the challenger, and the (possibly non-parametric) distributions of the arms.
Our proof method demonstrates in particular that the sampling step used to select the leader inherited from Thompson sampling can be replaced by other choices.
arXiv Detail & Related papers (2022-06-13T09:03:24Z) - Algorithms for Adaptive Experiments that Trade-off Statistical Analysis
with Reward: Combining Uniform Random Assignment and Reward Maximization [50.725191156128645]
Multi-armed bandit algorithms like Thompson Sampling can be used to conduct adaptive experiments.
We present simulations for 2-arm experiments that explore two algorithms that combine the benefits of uniform randomization for statistical analysis.
arXiv Detail & Related papers (2021-12-15T22:11:58Z) - Mean-based Best Arm Identification in Stochastic Bandits under Reward
Contamination [80.53485617514707]
This paper proposes two algorithms, a gap-based algorithm and one based on the successive elimination, for best arm identification in sub-Gaussian bandits.
Specifically, for the gap-based algorithm, the sample complexity is optimal up to constant factors, while for the successive elimination, it is optimal up to logarithmic factors.
arXiv Detail & Related papers (2021-11-14T21:49:58Z) - Adaptive Sampling for Heterogeneous Rank Aggregation from Noisy Pairwise
Comparisons [85.5955376526419]
In rank aggregation problems, users exhibit various accuracy levels when comparing pairs of items.
We propose an elimination-based active sampling strategy, which estimates the ranking of items via noisy pairwise comparisons.
We prove that our algorithm can return the true ranking of items with high probability.
arXiv Detail & Related papers (2021-10-08T13:51:55Z) - Peer Selection with Noisy Assessments [43.307040330622186]
We extend PeerNomination, the most accurate peer reviewing algorithm to date, into WeightedPeerNomination.
We show analytically that a weighting scheme can improve the overall accuracy of the selection significantly.
arXiv Detail & Related papers (2021-07-21T14:47:11Z) - Towards Model-Agnostic Post-Hoc Adjustment for Balancing Ranking
Fairness and Algorithm Utility [54.179859639868646]
Bipartite ranking aims to learn a scoring function that ranks positive individuals higher than negative ones from labeled data.
There have been rising concerns on whether the learned scoring function can cause systematic disparity across different protected groups.
We propose a model post-processing framework for balancing them in the bipartite ranking scenario.
arXiv Detail & Related papers (2020-06-15T10:08:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.