Federated Privacy-preserving Collaborative Filtering for On-Device Next
App Prediction
- URL: http://arxiv.org/abs/2303.04744v1
- Date: Sun, 5 Feb 2023 10:29:57 GMT
- Title: Federated Privacy-preserving Collaborative Filtering for On-Device Next
App Prediction
- Authors: Albert Sayapin, Gleb Balitskiy, Daniel Bershatsky, Aleksandr Katrutsa,
Evgeny Frolov, Alexey Frolov, Ivan Oseledets, Vitaliy Kharin
- Abstract summary: We propose a novel SeqMF model to solve the problem of predicting the next app launch during mobile device usage.
We modify the structure of the classical matrix factorization model and update the training procedure to sequential learning.
One more ingredient of the proposed approach is a new privacy mechanism that guarantees the protection of the sent data from the users to the remote server.
- Score: 52.16923290335873
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In this study, we propose a novel SeqMF model to solve the problem of
predicting the next app launch during mobile device usage. Although this
problem can be represented as a classical collaborative filtering problem, it
requires proper modification since the data are sequential, the user feedback
is distributed among devices and the transmission of users' data to aggregate
common patterns must be protected against leakage. According to such
requirements, we modify the structure of the classical matrix factorization
model and update the training procedure to sequential learning. Since the data
about user experience are distributed among devices, the federated learning
setup is used to train the proposed sequential matrix factorization model. One
more ingredient of the proposed approach is a new privacy mechanism that
guarantees the protection of the sent data from the users to the remote server.
To demonstrate the efficiency of the proposed model we use publicly available
mobile user behavior data. We compare our model with sequential rules and
models based on the frequency of app launches. The comparison is conducted in
static and dynamic environments. The static environment evaluates how our model
processes sequential data compared to competitors. Therefore, the standard
train-validation-test evaluation procedure is used. The dynamic environment
emulates the real-world scenario, where users generate new data by running apps
on devices, and evaluates our model in this case. Our experiments show that the
proposed model provides comparable quality with other methods in the static
environment. However, more importantly, our method achieves a better
privacy-utility trade-off than competitors in the dynamic environment, which
provides more accurate simulations of real-world usage.
Related papers
- GenRec: A Flexible Data Generator for Recommendations [1.384948712833979]
GenRec is a novel framework for generating synthetic user-item interactions that exhibit realistic and well-known properties.
The framework is based on a generative process based on latent factor modeling.
arXiv Detail & Related papers (2024-07-23T15:53:17Z) - Dynamic Collaborative Filtering for Matrix- and Tensor-based Recommender
Systems [5.1148288291550505]
We introduce a novel collaborative filtering model for sequential problems known as TIRecA.
TIRecA efficiently updates its parameters using only the new data segment, allowing incremental addition of new users and items to the recommender system.
Our comparison with general matrix and tensor-based baselines in terms of prediction quality and computational time reveals that TIRecA achieves comparable quality to the baseline methods, while being 10-20 times faster in training time.
arXiv Detail & Related papers (2023-12-04T20:45:51Z) - Going beyond research datasets: Novel intent discovery in the industry
setting [60.90117614762879]
This paper proposes methods to improve the intent discovery pipeline deployed in a large e-commerce platform.
We show the benefit of pre-training language models on in-domain data: both self-supervised and with weak supervision.
We also devise the best method to utilize the conversational structure (i.e., question and answer) of real-life datasets during fine-tuning for clustering tasks, which we call Conv.
arXiv Detail & Related papers (2023-05-09T14:21:29Z) - Latent User Intent Modeling for Sequential Recommenders [92.66888409973495]
Sequential recommender models learn to predict the next items a user is likely to interact with based on his/her interaction history on the platform.
Most sequential recommenders however lack a higher-level understanding of user intents, which often drive user behaviors online.
Intent modeling is thus critical for understanding users and optimizing long-term user experience.
arXiv Detail & Related papers (2022-11-17T19:00:24Z) - Sequential Ensembling for Semantic Segmentation [4.030520171276982]
We benchmark the popular ensembling approach of combining predictions of multiple, independently-trained, state-of-the-art models.
We propose a novel method inspired by boosting to sequentially ensemble networks that significantly outperforms the naive ensemble baseline.
arXiv Detail & Related papers (2022-10-08T22:13:59Z) - HyperImpute: Generalized Iterative Imputation with Automatic Model
Selection [77.86861638371926]
We propose a generalized iterative imputation framework for adaptively and automatically configuring column-wise models.
We provide a concrete implementation with out-of-the-box learners, simulators, and interfaces.
arXiv Detail & Related papers (2022-06-15T19:10:35Z) - Modeling Dynamic User Preference via Dictionary Learning for Sequential
Recommendation [133.8758914874593]
Capturing the dynamics in user preference is crucial to better predict user future behaviors because user preferences often drift over time.
Many existing recommendation algorithms -- including both shallow and deep ones -- often model such dynamics independently.
This paper considers the problem of embedding a user's sequential behavior into the latent space of user preferences.
arXiv Detail & Related papers (2022-04-02T03:23:46Z) - Top-N Recommendation with Counterfactual User Preference Simulation [26.597102553608348]
Top-N recommendation, which aims to learn user ranking-based preference, has long been a fundamental problem in a wide range of applications.
In this paper, we propose to reformulate the recommendation task within the causal inference framework to handle the data scarce problem.
arXiv Detail & Related papers (2021-09-02T14:28:46Z) - Contrastive Self-supervised Sequential Recommendation with Robust
Augmentation [101.25762166231904]
Sequential Recommendationdescribes a set of techniques to model dynamic user behavior in order to predict future interactions in sequential user data.
Old and new issues remain, including data-sparsity and noisy data.
We propose Contrastive Self-Supervised Learning for sequential Recommendation (CoSeRec)
arXiv Detail & Related papers (2021-08-14T07:15:25Z) - Multitarget Tracking with Transformers [21.81266872964314]
Multitarget Tracking (MTT) is a problem of tracking the states of an unknown number of objects using noisy measurements.
In this paper, we propose a high-performing deep-learning method for MTT based on the Transformer architecture.
arXiv Detail & Related papers (2021-04-01T19:14:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.