Necessarily Optimal One-Sided Matchings
- URL: http://arxiv.org/abs/2007.09079v3
- Date: Tue, 13 Apr 2021 21:45:37 GMT
- Title: Necessarily Optimal One-Sided Matchings
- Authors: Hadi Hosseini, Vijay Menon, Nisarg Shah, Sujoy Sikdar
- Abstract summary: We study the classical problem of matching $n$ agents to $n$ objects, where the agents have ranked preferences over the objects.
Instead of asking the agents to report their complete preferences, our goal is to learn a desirable matching from partial preferences.
We design efficient algorithms to check if a given matching is NPO or NRM, and to check whether such a matching exists given top-$k$ partial preferences.
- Score: 49.0517583218216
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We study the classical problem of matching $n$ agents to $n$ objects, where
the agents have ranked preferences over the objects. We focus on two popular
desiderata from the matching literature: Pareto optimality and rank-maximality.
Instead of asking the agents to report their complete preferences, our goal is
to learn a desirable matching from partial preferences, specifically a matching
that is necessarily Pareto optimal (NPO) or necessarily rank-maximal (NRM)
under any completion of the partial preferences. We focus on the top-$k$ model
in which agents reveal a prefix of their preference rankings. We design
efficient algorithms to check if a given matching is NPO or NRM, and to check
whether such a matching exists given top-$k$ partial preferences. We also study
online algorithms for eliciting partial preferences adaptively, and prove
bounds on their competitive ratio.
Related papers
- Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization [105.3612692153615]
A common technique for aligning large language models (LLMs) relies on acquiring human preferences.
We propose a new axis that is based on eliciting preferences jointly over the instruction-response pairs.
We find that joint preferences over instruction and response pairs can significantly enhance the alignment of LLMs.
arXiv Detail & Related papers (2024-03-31T02:05:40Z) - Human-in-the-loop: Provably Efficient Preference-based Reinforcement
Learning with General Function Approximation [107.54516740713969]
We study human-in-the-loop reinforcement learning (RL) with trajectory preferences.
Instead of receiving a numeric reward at each step, the agent only receives preferences over trajectory pairs from a human overseer.
We propose the first optimistic model-based algorithm for PbRL with general function approximation.
arXiv Detail & Related papers (2022-05-23T09:03:24Z) - Deep Probabilistic Graph Matching [72.6690550634166]
We propose a deep learning-based graph matching framework that works for the original QAP without compromising on the matching constraints.
The proposed method is evaluated on three popularly tested benchmarks (Pascal VOC, Willow Object and SPair-71k) and it outperforms all previous state-of-the-arts on all benchmarks.
arXiv Detail & Related papers (2022-01-05T13:37:27Z) - Trustworthy Preference Completion in Social Choice [36.91054060923998]
It is impractical to ask agents to provide linear orders over all alternatives, for these partial rankings it is necessary to conduct preference completion.
A trust-based anchor-kNN algorithm is proposed to find $k$-nearest trustworthy neighbors of the agent with trust-oriented Kendall-Tau distances.
A certain common voting rule for the first $k$ trustworthy neighboring agents based on certainty and conflict can be taken to conduct the trustworthy preference completion.
arXiv Detail & Related papers (2020-12-14T03:03:13Z) - Improving Welfare in One-sided Matching using Simple Threshold Queries [9.270928705464193]
We study one-sided matching problems where $n$ agents have preferences over $m$ objects.
We focus on learning more about their cardinal preferences using simple threshold queries.
arXiv Detail & Related papers (2020-11-27T20:02:57Z) - Ranking a set of objects: a graph based least-square approach [70.7866286425868]
We consider the problem of ranking $N$ objects starting from a set of noisy pairwise comparisons provided by a crowd of equal workers.
We propose a class of non-adaptive ranking algorithms that rely on a least-squares intrinsic optimization criterion for the estimation of qualities.
arXiv Detail & Related papers (2020-02-26T16:19:09Z) - SetRank: A Setwise Bayesian Approach for Collaborative Ranking from
Implicit Feedback [50.13745601531148]
We propose a novel setwise Bayesian approach for collaborative ranking, namely SetRank, to accommodate the characteristics of implicit feedback in recommender system.
Specifically, SetRank aims at maximizing the posterior probability of novel setwise preference comparisons.
We also present the theoretical analysis of SetRank to show that the bound of excess risk can be proportional to $sqrtM/N$.
arXiv Detail & Related papers (2020-02-23T06:40:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.