Sorting Big Data by Revealed Preference with Application to College
Ranking
- URL: http://arxiv.org/abs/2003.12198v1
- Date: Fri, 27 Mar 2020 01:11:47 GMT
- Title: Sorting Big Data by Revealed Preference with Application to College
Ranking
- Authors: Xingwei Hu
- Abstract summary: When ranking big data observations, diverse consumers reveal heterogeneous preferences.
A properly sorted solution could help consumers make the right choices, and governments make wise policy decisions.
The employed approach can be applied in many other areas, such as sports team ranking, academic journal ranking, voting, and real effective exchange rates.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: When ranking big data observations such as colleges in the United States,
diverse consumers reveal heterogeneous preferences. The objective of this paper
is to sort out a linear ordering for these observations and to recommend
strategies to improve their relative positions in the ranking. A properly
sorted solution could help consumers make the right choices, and governments
make wise policy decisions. Previous researchers have applied exogenous
weighting or multivariate regression approaches to sort big data objects,
ignoring their variety and variability. By recognizing the diversity and
heterogeneity among both the observations and the consumers, we instead apply
endogenous weighting to these contradictory revealed preferences. The outcome
is a consistent steady-state solution to the counterbalance equilibrium within
these contradictions. The solution takes into consideration the spillover
effects of multiple-step interactions among the observations. When information
from data is efficiently revealed in preferences, the revealed preferences
greatly reduce the volume of the required data in the sorting process. The
employed approach can be applied in many other areas, such as sports team
ranking, academic journal ranking, voting, and real effective exchange rates.
Related papers
- Rate-Optimal Rank Aggregation with Private Pairwise Rankings [12.511220449652384]
We address the challenge of preserving privacy while ensuring the utility of rank aggregation based on pairwise rankings.
Motivated by this, we propose adaptively debiasing the rankings from the randomized response mechanism.
arXiv Detail & Related papers (2024-02-26T18:05:55Z) - Tackling Diverse Minorities in Imbalanced Classification [80.78227787608714]
Imbalanced datasets are commonly observed in various real-world applications, presenting significant challenges in training classifiers.
We propose generating synthetic samples iteratively by mixing data samples from both minority and majority classes.
We demonstrate the effectiveness of our proposed framework through extensive experiments conducted on seven publicly available benchmark datasets.
arXiv Detail & Related papers (2023-08-28T18:48:34Z) - Don't Throw it Away! The Utility of Unlabeled Data in Fair Decision
Making [14.905698014932488]
We propose a novel method based on a variational autoencoder for practical fair decision-making.
Our method learns an unbiased data representation leveraging both labeled and unlabeled data.
Our method converges to the optimal (fair) policy according to the ground-truth with low variance.
arXiv Detail & Related papers (2022-05-10T10:33:11Z) - Unbiased Pairwise Learning to Rank in Recommender Systems [4.058828240864671]
Unbiased learning to rank algorithms are appealing candidates and have already been applied in many applications with single categorical labels.
We propose a novel unbiased LTR algorithm to tackle the challenges, which innovatively models position bias in the pairwise fashion.
Experiment results on public benchmark datasets and internal live traffic show the superior results of the proposed method for both categorical and continuous labels.
arXiv Detail & Related papers (2021-11-25T06:04:59Z) - Multi-characteristic Subject Selection from Biased Datasets [79.82881947891589]
We present a constrained optimization-based method that finds the best possible sampling fractions for the different population subgroups.
Our results show that our proposed method outperforms the baselines for all problem variations by up to 90%.
arXiv Detail & Related papers (2020-12-18T15:55:27Z) - Taking the Counterfactual Online: Efficient and Unbiased Online
Evaluation for Ranking [74.46448041224247]
We introduce the novel Logging-Policy Optimization Algorithm (LogOpt), which optimize the policy for logging data.
LogOpt turns the counterfactual approach - which is indifferent to the logging policy - into an online approach, where the algorithm decides what rankings to display.
We prove that, as an online evaluation method, LogOpt is unbiased w.r.t. position and item-selection bias, unlike existing interleaving methods.
arXiv Detail & Related papers (2020-07-24T18:05:58Z) - Towards Model-Agnostic Post-Hoc Adjustment for Balancing Ranking
Fairness and Algorithm Utility [54.179859639868646]
Bipartite ranking aims to learn a scoring function that ranks positive individuals higher than negative ones from labeled data.
There have been rising concerns on whether the learned scoring function can cause systematic disparity across different protected groups.
We propose a model post-processing framework for balancing them in the bipartite ranking scenario.
arXiv Detail & Related papers (2020-06-15T10:08:39Z) - Directional Multivariate Ranking [39.81227580524465]
We propose a directional multi-aspect ranking criterion to enable a holistic ranking of items with respect to multiple aspects.
Our key insight is that the direction of the difference vector between two multi-aspect preference vectors reveals the pairwise order of comparison.
arXiv Detail & Related papers (2020-06-09T22:43:03Z) - Contrastive Examples for Addressing the Tyranny of the Majority [83.93825214500131]
We propose to create a balanced training dataset, consisting of the original dataset plus new data points in which the group memberships are intervened.
We show that current generative adversarial networks are a powerful tool for learning these data points, called contrastive examples.
arXiv Detail & Related papers (2020-04-14T14:06:44Z) - Handling Position Bias for Unbiased Learning to Rank in Hotels Search [0.951828574518325]
We will investigate the importance of properly handling the position bias in an online test environment in Tripadvisor Hotels search.
We propose an empirically effective method of handling the position bias that fully leverages the user action data.
The online A/B test results show that this method leads to an improved search ranking model.
arXiv Detail & Related papers (2020-02-28T03:48:42Z) - SetRank: A Setwise Bayesian Approach for Collaborative Ranking from
Implicit Feedback [50.13745601531148]
We propose a novel setwise Bayesian approach for collaborative ranking, namely SetRank, to accommodate the characteristics of implicit feedback in recommender system.
Specifically, SetRank aims at maximizing the posterior probability of novel setwise preference comparisons.
We also present the theoretical analysis of SetRank to show that the bound of excess risk can be proportional to $sqrtM/N$.
arXiv Detail & Related papers (2020-02-23T06:40:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.