Reinforcement Learning for Strategic Recommendations
- URL: http://arxiv.org/abs/2009.07346v1
- Date: Tue, 15 Sep 2020 20:45:48 GMT
- Title: Reinforcement Learning for Strategic Recommendations
- Authors: Georgios Theocharous, Yash Chandak, Philip S. Thomas, Frits de Nijs
- Abstract summary: Strategic recommendations (SR) refer to the problem where an intelligent agent observes the sequential behaviors and activities of users and decides when and how to interact with them to optimize some long-term objectives, both for the user and the business.
At Adobe research, we have been implementing such systems for various use-cases, including points of interest recommendations, tutorial recommendations, next step guidance in multi-media editing software, and ad recommendation for optimizing lifetime value.
There are many research challenges when building these systems, such as modeling the sequential behavior of users, deciding when to intervene and offer recommendations without annoying the user, evaluating policies offline with
- Score: 32.73903761398027
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Strategic recommendations (SR) refer to the problem where an intelligent
agent observes the sequential behaviors and activities of users and decides
when and how to interact with them to optimize some long-term objectives, both
for the user and the business. These systems are in their infancy in the
industry and in need of practical solutions to some fundamental research
challenges. At Adobe research, we have been implementing such systems for
various use-cases, including points of interest recommendations, tutorial
recommendations, next step guidance in multi-media editing software, and ad
recommendation for optimizing lifetime value. There are many research
challenges when building these systems, such as modeling the sequential
behavior of users, deciding when to intervene and offer recommendations without
annoying the user, evaluating policies offline with high confidence, safe
deployment, non-stationarity, building systems from passive data that do not
contain past recommendations, resource constraint optimization in multi-user
systems, scaling to large and dynamic actions spaces, and handling and
incorporating human cognitive biases. In this paper we cover various use-cases
and research challenges we solved to make these systems practical.
Related papers
- Relevance meets Diversity: A User-Centric Framework for Knowledge Exploration through Recommendations [15.143224593682012]
We propose a novel recommendation strategy that combines relevance and diversity by a copula function.
We use diversity as a surrogate of the amount of knowledge obtained by the user while interacting with the system.
Our strategy outperforms several state-of-the-art competitors.
arXiv Detail & Related papers (2024-08-07T13:48:24Z) - EASRec: Elastic Architecture Search for Efficient Long-term Sequential
Recommender Systems [82.76483989905961]
Current Sequential Recommender Systems (SRSs) suffer from computational and resource inefficiencies.
We develop the Elastic Architecture Search for Efficient Long-term Sequential Recommender Systems (EASRec)
EASRec introduces data-aware gates that leverage historical information from input data batch to improve the performance of the recommendation network.
arXiv Detail & Related papers (2024-02-01T07:22:52Z) - Embedding in Recommender Systems: A Survey [67.67966158305603]
A crucial aspect is embedding techniques that covert the high-dimensional discrete features, such as user and item IDs, into low-dimensional continuous vectors.
Applying embedding techniques captures complex entity relationships and has spurred substantial research.
This survey covers embedding methods like collaborative filtering, self-supervised learning, and graph-based techniques.
arXiv Detail & Related papers (2023-10-28T06:31:06Z) - Modeling Recommender Ecosystems: Research Challenges at the Intersection
of Mechanism Design, Reinforcement Learning and Generative Models [17.546954143602818]
We argue that modeling the incentives and behaviors of all actors in the system is strictly necessary to maximize the value the system brings to these actors and improve overall ecosystem "health"
We propose a conceptual framework that encompasses these elements, and articulate a number of research challenges that emerge at the intersection of these different disciplines.
arXiv Detail & Related papers (2023-09-08T03:20:58Z) - Fisher-Weighted Merge of Contrastive Learning Models in Sequential
Recommendation [0.0]
We are the first to apply the Fisher-Merging method to Sequential Recommendation, addressing and resolving practical challenges associated with it.
We demonstrate the effectiveness of our proposed methods, highlighting their potential to advance the state-of-the-art in sequential learning and recommendation systems.
arXiv Detail & Related papers (2023-07-05T05:58:56Z) - A Review on Pushing the Limits of Baseline Recommendation Systems with
the integration of Opinion Mining & Information Retrieval Techniques [0.0]
Recommendation Systems allow users to identify trending items among a community while being timely and relevant to the user's expectations.
Deep Learning methods have been brought forward to achieve better quality recommendations.
Researchers have tried to expand on the capabilities of standard recommendation systems to provide the most effective recommendations.
arXiv Detail & Related papers (2022-05-03T22:13:33Z) - Knowledge-Enhanced Hierarchical Graph Transformer Network for
Multi-Behavior Recommendation [56.12499090935242]
This work proposes a Knowledge-Enhanced Hierarchical Graph Transformer Network (KHGT) to investigate multi-typed interactive patterns between users and items in recommender systems.
KHGT is built upon a graph-structured neural architecture to capture type-specific behavior characteristics.
We show that KHGT consistently outperforms many state-of-the-art recommendation methods across various evaluation settings.
arXiv Detail & Related papers (2021-10-08T09:44:00Z) - User Tampering in Reinforcement Learning Recommender Systems [2.28438857884398]
We highlight a unique safety concern prevalent in reinforcement learning (RL)-based recommendation algorithms -- 'user tampering'
User tampering is a situation where an RL-based recommender system may manipulate a media user's opinions through its suggestions as part of a policy to maximize long-term user engagement.
arXiv Detail & Related papers (2021-09-09T07:53:23Z) - Generative Inverse Deep Reinforcement Learning for Online Recommendation [62.09946317831129]
We propose a novel inverse reinforcement learning approach, namely InvRec, for online recommendation.
InvRec extracts the reward function from user's behaviors automatically, for online recommendation.
arXiv Detail & Related papers (2020-11-04T12:12:25Z) - Optimizing Interactive Systems via Data-Driven Objectives [70.3578528542663]
We propose an approach that infers the objective directly from observed user interactions.
These inferences can be made regardless of prior knowledge and across different types of user behavior.
We introduce Interactive System (ISO), a novel algorithm that uses these inferred objectives for optimization.
arXiv Detail & Related papers (2020-06-19T20:49:14Z) - A Survey on Knowledge Graph-Based Recommender Systems [65.50486149662564]
We conduct a systematical survey of knowledge graph-based recommender systems.
We focus on how the papers utilize the knowledge graph for accurate and explainable recommendation.
We introduce datasets used in these works.
arXiv Detail & Related papers (2020-02-28T02:26:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.