Sequence Adaptation via Reinforcement Learning in Recommender Systems
- URL: http://arxiv.org/abs/2108.01442v1
- Date: Sat, 31 Jul 2021 13:56:46 GMT
- Title: Sequence Adaptation via Reinforcement Learning in Recommender Systems
- Authors: Stefanos Antaris, Dimitrios Rafailidis
- Abstract summary: We propose the SAR model, which learns the sequential patterns and adjusts the sequence length of user-item interactions in a personalized manner.
In addition, we optimize a joint loss function to align the accuracy of the sequential recommendations with the expected cumulative rewards of the critic network.
Our experimental evaluation on four real-world datasets demonstrates the superiority of our proposed model over several baseline approaches.
- Score: 8.909115457491522
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Accounting for the fact that users have different sequential patterns, the
main drawback of state-of-the-art recommendation strategies is that a fixed
sequence length of user-item interactions is required as input to train the
models. This might limit the recommendation accuracy, as in practice users
follow different trends on the sequential recommendations. Hence, baseline
strategies might ignore important sequential interactions or add noise to the
models with redundant interactions, depending on the variety of users'
sequential behaviours. To overcome this problem, in this study we propose the
SAR model, which not only learns the sequential patterns but also adjusts the
sequence length of user-item interactions in a personalized manner. We first
design an actor-critic framework, where the RL agent tries to compute the
optimal sequence length as an action, given the user's state representation at
a certain time step. In addition, we optimize a joint loss function to align
the accuracy of the sequential recommendations with the expected cumulative
rewards of the critic network, while at the same time we adapt the sequence
length with the actor network in a personalized manner. Our experimental
evaluation on four real-world datasets demonstrates the superiority of our
proposed model over several baseline approaches. Finally, we make our
implementation publicly available at https://github.com/stefanosantaris/sar.
Related papers
- Long-Sequence Recommendation Models Need Decoupled Embeddings [49.410906935283585]
We identify and characterize a neglected deficiency in existing long-sequence recommendation models.
A single set of embeddings struggles with learning both attention and representation, leading to interference between these two processes.
We propose the Decoupled Attention and Representation Embeddings (DARE) model, where two distinct embedding tables are learned separately to fully decouple attention and representation.
arXiv Detail & Related papers (2024-10-03T15:45:15Z) - Does It Look Sequential? An Analysis of Datasets for Evaluation of Sequential Recommendations [0.8437187555622164]
Sequential recommender systems aim to use the order of interactions in a user's history to predict future interactions.
It is crucial to use datasets that exhibit a sequential structure to evaluate sequential recommenders properly.
We apply several methods based on the random shuffling of the user's sequence of interactions to assess the strength of sequential structure across 15 datasets.
arXiv Detail & Related papers (2024-08-21T21:40:07Z) - AdaptSSR: Pre-training User Model with Augmentation-Adaptive
Self-Supervised Ranking [19.1857792382924]
We propose Augmentation-Supervised Ranking (AdaptSSR) to replace the contrastive learning task.
We adopt a multiple pairwise ranking loss which trains the user model to capture the similarity orders between the implicitly augmented view, the explicitly augmented view, and views from other users.
Experiments on both public and industrial datasets with six downstream tasks verify the effectiveness of AdaptSSR.
arXiv Detail & Related papers (2023-10-15T02:19:28Z) - Latent User Intent Modeling for Sequential Recommenders [92.66888409973495]
Sequential recommender models learn to predict the next items a user is likely to interact with based on his/her interaction history on the platform.
Most sequential recommenders however lack a higher-level understanding of user intents, which often drive user behaviors online.
Intent modeling is thus critical for understanding users and optimizing long-term user experience.
arXiv Detail & Related papers (2022-11-17T19:00:24Z) - Ordinal Graph Gamma Belief Network for Social Recommender Systems [54.9487910312535]
We develop a hierarchical Bayesian model termed ordinal graph factor analysis (OGFA), which jointly models user-item and user-user interactions.
OGFA not only achieves good recommendation performance, but also extracts interpretable latent factors corresponding to representative user preferences.
We extend OGFA to ordinal graph gamma belief network, which is a multi-stochastic-layer deep probabilistic model.
arXiv Detail & Related papers (2022-09-12T09:19:22Z) - Modeling Dynamic User Preference via Dictionary Learning for Sequential
Recommendation [133.8758914874593]
Capturing the dynamics in user preference is crucial to better predict user future behaviors because user preferences often drift over time.
Many existing recommendation algorithms -- including both shallow and deep ones -- often model such dynamics independently.
This paper considers the problem of embedding a user's sequential behavior into the latent space of user preferences.
arXiv Detail & Related papers (2022-04-02T03:23:46Z) - Sequential Search with Off-Policy Reinforcement Learning [48.88165680363482]
We propose a highly scalable hybrid learning model that consists of an RNN learning framework and an attention model.
As a novel optimization step, we fit multiple short user sequences in a single RNN pass within a training batch, by solving a greedy knapsack problem on the fly.
We also explore the use of off-policy reinforcement learning in multi-session personalized search ranking.
arXiv Detail & Related papers (2022-02-01T06:52:40Z) - Contrastive Self-supervised Sequential Recommendation with Robust
Augmentation [101.25762166231904]
Sequential Recommendationdescribes a set of techniques to model dynamic user behavior in order to predict future interactions in sequential user data.
Old and new issues remain, including data-sparsity and noisy data.
We propose Contrastive Self-Supervised Learning for sequential Recommendation (CoSeRec)
arXiv Detail & Related papers (2021-08-14T07:15:25Z) - Denoising User-aware Memory Network for Recommendation [11.145186013006375]
We propose a novel CTR model named denoising user-aware memory network (DUMN)
DUMN uses the representation of explicit feedback to purify the representation of implicit feedback, and effectively denoise the implicit feedback.
Experiments on two real e-commerce user behavior datasets show that DUMN has a significant improvement over the state-of-the-art baselines.
arXiv Detail & Related papers (2021-07-12T14:39:36Z) - Position-enhanced and Time-aware Graph Convolutional Network for
Sequential Recommendations [3.286961611175469]
We propose a new deep learning-based sequential recommendation approach based on a Position-enhanced and Time-aware Graph Convolutional Network (PTGCN)
PTGCN models the sequential patterns and temporal dynamics between user-item interactions by defining a position-enhanced and time-aware graph convolution operation.
It realizes the high-order connectivity between users and items by stacking multi-layer graph convolutions.
arXiv Detail & Related papers (2021-07-12T07:34:20Z) - Sequential recommendation with metric models based on frequent sequences [0.688204255655161]
We propose to use frequent sequences to identify the most relevant part of the user history for the recommendation.
The most salient items are then used in a unified metric model that embeds items based on user preferences and sequential dynamics.
arXiv Detail & Related papers (2020-08-12T22:08:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.