Learning to Rank For Push Notifications Using Pairwise Expected Regret
- URL: http://arxiv.org/abs/2201.07681v1
- Date: Wed, 19 Jan 2022 16:12:25 GMT
- Title: Learning to Rank For Push Notifications Using Pairwise Expected Regret
- Authors: Yuguang Yue, Yuanpu Xie, Huasen Wu, Haofeng Jia, Shaodan Zhai, Wenzhe
Shi, Jonathan J Hunt
- Abstract summary: New paradigms of content consumption present new challenges for ranking methods.
We introduce a novel ranking loss based on weighting the pairwise loss between candidates by the expected regret incurred for misordering the pair.
We demonstrate that the proposed method can outperform prior methods both in a simulated environment and in a production experiment on a major social network.
- Score: 8.990318688557
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Listwise ranking losses have been widely studied in recommender systems.
However, new paradigms of content consumption present new challenges for
ranking methods. In this work we contribute an analysis of learning to rank for
personalized mobile push notifications and discuss the unique challenges this
presents compared to traditional ranking problems. To address these challenges,
we introduce a novel ranking loss based on weighting the pairwise loss between
candidates by the expected regret incurred for misordering the pair. We
demonstrate that the proposed method can outperform prior methods both in a
simulated environment and in a production experiment on a major social network.
Related papers
- Towards Principled Learning for Re-ranking in Recommender Systems [10.881339468480467]
Two principles are proposed, including convergence consistency and adversarial consistency.
These two principles can be applied in the learning of a generic re-ranker and improve its performance.
arXiv Detail & Related papers (2025-04-05T14:14:36Z) - PageRank Bandits for Link Prediction [72.61386754332776]
Link prediction is a critical problem in graph learning with broad applications such as recommender systems and knowledge graph completion.
This paper reformulates link prediction as a sequential decision-making process, where each link prediction interaction occurs sequentially.
We propose a novel fusion algorithm, PRB (PageRank Bandits), which is the first to combine contextual bandits with PageRank for collaborative exploitation and exploration.
arXiv Detail & Related papers (2024-11-03T02:39:28Z) - Correcting for Popularity Bias in Recommender Systems via Item Loss Equalization [1.7771454131646311]
A small set of popular items dominate the recommendation results due to their high interaction rates.
This phenomenon disproportionately benefits users with mainstream tastes while neglecting those with niche interests.
We propose an in-processing approach to address this issue by intervening in the training process of recommendation models.
arXiv Detail & Related papers (2024-10-07T08:34:18Z) - Enhancing Consistency and Mitigating Bias: A Data Replay Approach for
Incremental Learning [100.7407460674153]
Deep learning systems are prone to catastrophic forgetting when learning from a sequence of tasks.
To mitigate the problem, a line of methods propose to replay the data of experienced tasks when learning new tasks.
However, it is not expected in practice considering the memory constraint or data privacy issue.
As a replacement, data-free data replay methods are proposed by inverting samples from the classification model.
arXiv Detail & Related papers (2024-01-12T12:51:12Z) - Prototypical quadruplet for few-shot class incremental learning [24.814045065163135]
We propose a novel method that improves classification robustness by identifying a better embedding space using an improved contrasting loss.
Our approach retains previously acquired knowledge in the embedding space, even when trained with new classes.
We demonstrate the effectiveness of our method by showing that the embedding space remains intact after training the model with new classes and outperforms existing state-of-the-art algorithms in terms of accuracy across different sessions.
arXiv Detail & Related papers (2022-11-05T17:19:14Z) - Large-Scale Sequential Learning for Recommender and Engineering Systems [91.3755431537592]
In this thesis, we focus on the design of an automatic algorithms that provide personalized ranking by adapting to the current conditions.
For the former, we propose novel algorithm called SAROS that take into account both kinds of feedback for learning over the sequence of interactions.
The proposed idea of taking into account the neighbour lines shows statistically significant results in comparison with the initial approach for faults detection in power grid.
arXiv Detail & Related papers (2022-05-13T21:09:41Z) - SURF: Semi-supervised Reward Learning with Data Augmentation for
Feedback-efficient Preference-based Reinforcement Learning [168.89470249446023]
We present SURF, a semi-supervised reward learning framework that utilizes a large amount of unlabeled samples with data augmentation.
In order to leverage unlabeled samples for reward learning, we infer pseudo-labels of the unlabeled samples based on the confidence of the preference predictor.
Our experiments demonstrate that our approach significantly improves the feedback-efficiency of the preference-based method on a variety of locomotion and robotic manipulation tasks.
arXiv Detail & Related papers (2022-03-18T16:50:38Z) - A Supervised Learning Approach to Rankability [0.6015898117103067]
rankability of data is a problem that considers the ability of a dataset, represented as a graph, to produce a meaningful ranking of the items it contains.
We propose new methods to assess rankability, which are amenable to efficient estimation.
arXiv Detail & Related papers (2022-03-14T17:55:43Z) - New Insights on Reducing Abrupt Representation Change in Online
Continual Learning [69.05515249097208]
We focus on the change in representations of observed data that arises when previously unobserved classes appear in the incoming data stream.
We show that applying Experience Replay causes the newly added classes' representations to overlap significantly with the previous classes.
We propose a new method which mitigates this issue by shielding the learned representations from drastic adaptation to accommodate new classes.
arXiv Detail & Related papers (2022-03-08T01:37:00Z) - A Pre-processing Method for Fairness in Ranking [0.0]
We propose a fair ranking framework that evaluates the order of training data in a pairwise manner.
We show that our method outperforms the existing methods in the trade-off between accuracy and fairness over real-world datasets.
arXiv Detail & Related papers (2021-10-29T02:55:32Z) - PiRank: Learning To Rank via Differentiable Sorting [85.28916333414145]
We propose PiRank, a new class of differentiable surrogates for ranking.
We show that PiRank exactly recovers the desired metrics in the limit of zero temperature.
arXiv Detail & Related papers (2020-12-12T05:07:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.