Predicting user demographics based on interest analysis
- URL: http://arxiv.org/abs/2108.01014v1
- Date: Mon, 2 Aug 2021 16:25:09 GMT
- Title: Predicting user demographics based on interest analysis
- Authors: Reza Shafiloo, Marjan Kaedi, Ali Pourmiri
- Abstract summary: This paper proposes a framework to predict users' demographic based on ratings registered by users in a system.
Using all ratings registered by users improves the prediction accuracy by at least 16% compared with previously studied models.
- Score: 1.7403133838762448
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: These days, due to the increasing amount of information generated on the web,
most web service providers try to personalize their services. Users also
interact with web-based systems in multiple ways and state their interests and
preferences by rating the provided items. This paper proposes a framework to
predict users' demographic based on ratings registered by users in a system. To
the best of our knowledge, this is the first time that the item ratings are
employed for users' demographic prediction problems, which have extensively
been studied in recommendation systems and service personalization. We apply
the framework to the Movielens dataset's ratings and predict users' age and
gender. The experimental results show that using all ratings registered by
users improves the prediction accuracy by at least 16% compared with previously
studied models. Moreover, by classifying the items as popular and unpopular, we
eliminate ratings that belong to 95% of items and still reach an acceptable
level of accuracy. This significantly reduces update costs in a time-varying
environment. Besides this classification, we propose other methods to reduce
data volume while keeping the predictions accurate.
Related papers
- Analytical and Empirical Study of Herding Effects in Recommendation Systems [72.6693986712978]
We study how to manage product ratings via rating aggregation rules and shortlisted representative reviews.
We show that proper recency aware rating aggregation rules can improve the speed of convergence in Amazon and TripAdvisor.
arXiv Detail & Related papers (2024-08-20T14:29:23Z) - When the Majority is Wrong: Modeling Annotator Disagreement for Subjective Tasks [45.14664901245331]
A crucial problem in hate speech detection is determining whether a statement is offensive to a demographic group.
We construct a model that predicts individual annotator ratings on potentially offensive text.
We find that annotator ratings can be predicted using their demographic information and opinions on online content.
arXiv Detail & Related papers (2023-05-11T07:55:20Z) - ASPEST: Bridging the Gap Between Active Learning and Selective
Prediction [56.001808843574395]
Selective prediction aims to learn a reliable model that abstains from making predictions when uncertain.
Active learning aims to lower the overall labeling effort, and hence human dependence, by querying the most informative examples.
In this work, we introduce a new learning paradigm, active selective prediction, which aims to query more informative samples from the shifted target domain.
arXiv Detail & Related papers (2023-04-07T23:51:07Z) - Application of Machine Learning for Online Reputation Systems [0.4125187280299248]
This paper proposes a new reputation system using machine learning to predict reliability of consumers from consumer profile.
The proposed model has been evaluated over three MovieLens benchmarking datasets, using 10-Folds cross validation.
arXiv Detail & Related papers (2022-09-10T12:31:40Z) - Personalizing Intervened Network for Long-tailed Sequential User
Behavior Modeling [66.02953670238647]
Tail users suffer from significantly lower-quality recommendation than the head users after joint training.
A model trained on tail users separately still achieve inferior results due to limited data.
We propose a novel approach that significantly improves the recommendation performance of the tail users.
arXiv Detail & Related papers (2022-08-19T02:50:19Z) - D-BIAS: A Causality-Based Human-in-the-Loop System for Tackling
Algorithmic Bias [57.87117733071416]
We propose D-BIAS, a visual interactive tool that embodies human-in-the-loop AI approach for auditing and mitigating social biases.
A user can detect the presence of bias against a group by identifying unfair causal relationships in the causal network.
For each interaction, say weakening/deleting a biased causal edge, the system uses a novel method to simulate a new (debiased) dataset.
arXiv Detail & Related papers (2022-08-10T03:41:48Z) - Revisiting Popularity and Demographic Biases in Recommender Evaluation
and Effectiveness [6.210698627561645]
We investigate how recommender performance varies according to popularity and demographics.
We find statistically significant differences in recommender performance by both age and gender.
We observe that recommendation utility steadily degrades for older users, and is lower for women than men.
arXiv Detail & Related papers (2021-10-15T20:30:51Z) - Evaluation of Summarization Systems across Gender, Age, and Race [0.0]
We show that summary evaluation is sensitive to protected attributes.
This can severely bias system development and evaluation, leading us to build models that cater for some groups rather than others.
arXiv Detail & Related papers (2021-10-08T21:30:20Z) - Correcting the User Feedback-Loop Bias for Recommendation Systems [34.44834423714441]
We propose a systematic and dynamic way to correct user feedback-loop bias in recommendation systems.
Our method includes a deep-learning component to learn each user's dynamic rating history embedding.
We empirically validated the existence of such user feedback-loop bias in real world recommendation systems.
arXiv Detail & Related papers (2021-09-13T15:02:55Z) - Balancing Biases and Preserving Privacy on Balanced Faces in the Wild [50.915684171879036]
There are demographic biases present in current facial recognition (FR) models.
We introduce our Balanced Faces in the Wild dataset to measure these biases across different ethnic and gender subgroups.
We find that relying on a single score threshold to differentiate between genuine and imposters sample pairs leads to suboptimal results.
We propose a novel domain adaptation learning scheme that uses facial features extracted from state-of-the-art neural networks.
arXiv Detail & Related papers (2021-03-16T15:05:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.