Should I send this notification? Optimizing push notifications decision
making by modeling the future
- URL: http://arxiv.org/abs/2202.08812v1
- Date: Thu, 17 Feb 2022 18:27:17 GMT
- Title: Should I send this notification? Optimizing push notifications decision
making by modeling the future
- Authors: Conor O'Brien, Huasen Wu, Shaodan Zhai, Dalin Guo, Wenzhe Shi,
Jonathan J Hunt
- Abstract summary: Most recommender systems are myopic, that is they optimize based on the immediate response of the user.
This may be misaligned with the true objective, such as creating long term user satisfaction.
In this work we focus on mobile push notifications, where the long term effects of recommender system decisions can be particularly strong.
- Score: 4.476351684070796
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Most recommender systems are myopic, that is they optimize based on the
immediate response of the user. This may be misaligned with the true objective,
such as creating long term user satisfaction. In this work we focus on mobile
push notifications, where the long term effects of recommender system decisions
can be particularly strong. For example, sending too many or irrelevant
notifications may annoy a user and cause them to disable notifications.
However, a myopic system will always choose to send a notification since
negative effects occur in the future. This is typically mitigated using
heuristics. However, heuristics can be hard to reason about or improve, require
retuning each time the system is changed, and may be suboptimal. To counter
these drawbacks, there is significant interest in recommender systems that
optimize directly for long-term value (LTV). Here, we describe a method for
maximising LTV by using model-based reinforcement learning (RL) to make
decisions about whether to send push notifications. We model the effects of
sending a notification on the user's future behavior. Much of the prior work
applying RL to maximise LTV in recommender systems has focused on session-based
optimization, while the time horizon for notification decision making in this
work extends over several days. We test this approach in an A/B test on a major
social network. We show that by optimizing decisions about push notifications
we are able to send less notifications and obtain a higher open rate than the
baseline system, while generating the same level of user engagement on the
platform as the existing, heuristic-based, system.
Related papers
- Prompt Tuning as User Inherent Profile Inference Machine [53.78398656789463]
We propose UserIP-Tuning, which uses prompt-tuning to infer user profiles.
A profile quantization codebook bridges the modality gap by profile embeddings into collaborative IDs.
Experiments on four public datasets show that UserIP-Tuning outperforms state-of-the-art recommendation algorithms.
arXiv Detail & Related papers (2024-08-13T02:25:46Z) - TIM: Temporal Interaction Model in Notification System [6.377444652197526]
We propose the Temporal Interaction Model (TIM), which models users' behavior patterns by estimating CTR in every time slot over a day in our short video application Kuaishou.
TIM is a reliable tool for forecasting user behavior, leading to a remarkable enhancement in user engagement without causing undue disturbance.
arXiv Detail & Related papers (2024-06-11T08:53:15Z) - System-2 Recommenders: Disentangling Utility and Engagement in Recommendation Systems via Temporal Point-Processes [80.97898201876592]
We propose a generative model in which past content interactions impact the arrival rates of users based on a self-exciting Hawkes process.
We show analytically that given samples it is possible to disentangle System-1 and System-2 and allow content optimization based on user utility.
arXiv Detail & Related papers (2024-05-29T18:19:37Z) - Prompt Optimization with Human Feedback [69.95991134172282]
We study the problem of prompt optimization with human feedback (POHF)
We introduce our algorithm named automated POHF (APOHF)
The results demonstrate that our APOHF can efficiently find a good prompt using a small number of preference feedback instances.
arXiv Detail & Related papers (2024-05-27T16:49:29Z) - Interest Clock: Time Perception in Real-Time Streaming Recommendation System [14.993810545170343]
Time modeling aims to enable recommendation systems to perceive time changes to capture users' dynamic preferences over time.
There is still a lack of effective time modeling methods for streaming recommendation systems.
In this paper, we propose an effective and universal method Interest Clock to perceive time information in recommendation systems.
arXiv Detail & Related papers (2024-04-30T08:38:09Z) - Latent User Intent Modeling for Sequential Recommenders [92.66888409973495]
Sequential recommender models learn to predict the next items a user is likely to interact with based on his/her interaction history on the platform.
Most sequential recommenders however lack a higher-level understanding of user intents, which often drive user behaviors online.
Intent modeling is thus critical for understanding users and optimizing long-term user experience.
arXiv Detail & Related papers (2022-11-17T19:00:24Z) - FedGRec: Federated Graph Recommender System with Lazy Update of Latent
Embeddings [108.77460689459247]
We propose a Federated Graph Recommender System (FedGRec) to mitigate privacy concerns.
In our system, users and the server explicitly store latent embeddings for users and items, where the latent embeddings summarize different orders of indirect user-item interactions.
We perform extensive empirical evaluations to verify the efficacy of using latent embeddings as a proxy of missing interaction graph.
arXiv Detail & Related papers (2022-10-25T01:08:20Z) - A State Transition Model for Mobile Notifications via Survival Analysis [10.638942431625381]
We propose a state transition framework to quantitatively evaluate the effectiveness of notifications.
We develop a survival model for badging notifications assuming a log-linear structure and a Weibull distribution.
Our results show that this model achieves more flexibility for applications and superior prediction accuracy than a logistic regression model.
arXiv Detail & Related papers (2022-07-07T05:38:39Z) - Offline Reinforcement Learning for Mobile Notifications [1.965345368500676]
Mobile notification systems have taken a major role in driving and maintaining user engagement for online platforms.
Most machine learning applications in notification systems are built around response-prediction models.
We argue that reinforcement learning is a better framework for notification systems in terms of performance and iteration speed.
arXiv Detail & Related papers (2022-02-04T22:22:22Z) - Reward Constrained Interactive Recommendation with Natural Language
Feedback [158.8095688415973]
We propose a novel constraint-augmented reinforcement learning (RL) framework to efficiently incorporate user preferences over time.
Specifically, we leverage a discriminator to detect recommendations violating user historical preference.
Our proposed framework is general and is further extended to the task of constrained text generation.
arXiv Detail & Related papers (2020-05-04T16:23:34Z) - A Snooze-less User-Aware Notification System for Proactive
Conversational Agents [6.4378876455245235]
We propose an alert and notification framework that intelligently issues, suppresses and aggregates notifications.
Our framework can be deployed as a backend service, but is better suited to be integrated into proactive conversational agents.
arXiv Detail & Related papers (2020-03-04T14:31:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.