JDRec: Practical Actor-Critic Framework for Online Combinatorial
Recommender System
- URL: http://arxiv.org/abs/2207.13311v1
- Date: Wed, 27 Jul 2022 05:47:12 GMT
- Title: JDRec: Practical Actor-Critic Framework for Online Combinatorial
Recommender System
- Authors: Xin Zhao (1), Zhiwei Fang (1), Yuchen Guo (2), Jie He (1), Wenlong
Chen (1), Changping Peng (1) ((1) JD.com, (2) Tsinghua University)
- Abstract summary: A recommender (CR) system feeds a list of items to a user at a time in the result page, in which the user behavior is affected by both contextual information and items.
Despite its importance, it is still a challenge to build a practical CR system, due to the efficiency, dynamics, personalization requirement in online environment.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A combinatorial recommender (CR) system feeds a list of items to a user at a
time in the result page, in which the user behavior is affected by both
contextual information and items. The CR is formulated as a combinatorial
optimization problem with the objective of maximizing the recommendation reward
of the whole list. Despite its importance, it is still a challenge to build a
practical CR system, due to the efficiency, dynamics, personalization
requirement in online environment. In particular, we tear the problem into two
sub-problems, list generation and list evaluation. Novel and practical model
architectures are designed for these sub-problems aiming at jointly optimizing
effectiveness and efficiency. In order to adapt to online case, a bootstrap
algorithm forming an actor-critic reinforcement framework is given to explore
better recommendation mode in long-term user interaction. Offline and online
experiment results demonstrate the efficacy of proposed JDRec framework. JDRec
has been applied in online JD recommendation, improving click through rate by
2.6% and synthetical value for the platform by 5.03%. We will publish the
large-scale dataset used in this study to contribute to the research community.
Related papers
- An End-to-End Multi-objective Ensemble Ranking Framework for Video Recommendation [20.59012057446529]
We propose a novel End-to-end Multi-objective Ensemble Ranking framework (EMER) for the multi-objective ensemble ranking module.<n>EMER enhances by replacing manually-designed formulas with an end-to-end modeling paradigm.<n>Our framework has been deployed in the primary scenarios of Kuaishou, a short video recommendation platform with hundreds of millions of daily active users.
arXiv Detail & Related papers (2025-08-07T07:21:46Z) - RADAR: Recall Augmentation through Deferred Asynchronous Retrieval [0.0]
We introduce Recall Augmentation through Deferred Asynchronous Retrieval (RADAR)<n>RADAR pre-ranks a significantly larger candidate set for users using the full complexity ranking model.<n>RADAR significantly boosts recall by effectively combining a larger retrieved candidate set with a more powerful ranking model.
arXiv Detail & Related papers (2025-06-08T19:21:46Z) - Search-Based Interaction For Conversation Recommendation via Generative Reward Model Based Simulated User [117.82681846559909]
Conversational recommendation systems (CRSs) use multi-turn interaction to capture user preferences and provide personalized recommendations.
We propose a generative reward model based simulated user, named GRSU, for automatic interaction with CRSs.
arXiv Detail & Related papers (2025-04-29T06:37:30Z) - Review, Refine, Repeat: Understanding Iterative Decoding of AI Agents with Dynamic Evaluation and Selection [71.92083784393418]
Inference-time methods such as Best-of-N (BON) sampling offer a simple yet effective alternative to improve performance.
We propose Iterative Agent Decoding (IAD) which combines iterative refinement with dynamic candidate evaluation and selection guided by a verifier.
arXiv Detail & Related papers (2025-04-02T17:40:47Z) - Efficient and Robust Regularized Federated Recommendation [52.24782464815489]
The recommender system (RSRS) addresses both user preference and privacy concerns.
We propose a novel method that incorporates non-uniform gradient descent to improve communication efficiency.
RFRecF's superior robustness compared to diverse baselines.
arXiv Detail & Related papers (2024-11-03T12:10:20Z) - Online and Offline Evaluations of Collaborative Filtering and Content Based Recommender Systems [0.0]
This study provides a comparative analysis of a large-scale recommender system operating in Iran.
The system employs user-based and item-based recommendations using content-based, collaborative filtering, trend-based methods, and hybrid approaches.
Our methods of evaluation include manual evaluation, offline tests including accuracy and ranking metrics like hit-rate@k and nDCG, and online tests consisting of click-through rate (CTR)
arXiv Detail & Related papers (2024-11-02T20:05:31Z) - CART: A Generative Cross-Modal Retrieval Framework with Coarse-To-Fine Semantic Modeling [53.97609687516371]
Cross-modal retrieval aims to search for instances, which are semantically related to the query through the interaction of different modal data.<n>Traditional solutions utilize a single-tower or dual-tower framework to explicitly compute the score between queries and candidates.<n>We propose a generative cross-modal retrieval framework (CART) based on coarse-to-fine semantic modeling.
arXiv Detail & Related papers (2024-06-25T12:47:04Z) - A Thorough Performance Benchmarking on Lightweight Embedding-based Recommender Systems [67.52782366565658]
State-of-the-art recommender systems (RSs) depend on categorical features, which ecoded by embedding vectors, resulting in excessively large embedding tables.
Despite the prosperity of lightweight embedding-based RSs, a wide diversity is seen in evaluation protocols.
This study investigates various LERS' performance, efficiency, and cross-task transferability via a thorough benchmarking process.
arXiv Detail & Related papers (2024-06-25T07:45:00Z) - EASRec: Elastic Architecture Search for Efficient Long-term Sequential
Recommender Systems [82.76483989905961]
Current Sequential Recommender Systems (SRSs) suffer from computational and resource inefficiencies.
We develop the Elastic Architecture Search for Efficient Long-term Sequential Recommender Systems (EASRec)
EASRec introduces data-aware gates that leverage historical information from input data batch to improve the performance of the recommendation network.
arXiv Detail & Related papers (2024-02-01T07:22:52Z) - Broad Recommender System: An Efficient Nonlinear Collaborative Filtering
Approach [56.12815715932561]
We propose a new broad recommender system called Broad Collaborative Filtering (BroadCF)
Instead of Deep Neural Networks (DNNs), Broad Learning System (BLS) is used as a mapping function to learn the complex nonlinear relationships between users and items.
Extensive experiments conducted on seven benchmark datasets have confirmed the effectiveness of the proposed BroadCF algorithm.
arXiv Detail & Related papers (2022-04-20T01:25:08Z) - Building an Efficient and Effective Retrieval-based Dialogue System via
Mutual Learning [27.04857039060308]
We propose to combine the best of both worlds to build a retrieval system.
We employ a fast bi-encoder to replace the traditional feature-based pre-retrieval model.
We train the pre-retrieval model and the re-ranking model at the same time via mutual learning.
arXiv Detail & Related papers (2021-10-01T01:32:33Z) - Leveraging Historical Interaction Data for Improving Conversational
Recommender System [105.90963882850265]
We propose a novel pre-training approach to integrate item- and attribute-based preference sequence.
Experiment results on two real-world datasets have demonstrated the effectiveness of our approach.
arXiv Detail & Related papers (2020-08-19T03:43:50Z) - Self-Supervised Reinforcement Learning for Recommender Systems [77.38665506495553]
We propose self-supervised reinforcement learning for sequential recommendation tasks.
Our approach augments standard recommendation models with two output layers: one for self-supervised learning and the other for RL.
Based on such an approach, we propose two frameworks namely Self-Supervised Q-learning(SQN) and Self-Supervised Actor-Critic(SAC)
arXiv Detail & Related papers (2020-06-10T11:18:57Z) - A Text-based Deep Reinforcement Learning Framework for Interactive
Recommendation [15.723042747172688]
We propose a Text-based Deep Deterministic Policy Gradient framework (TDDPG-Rec) for interactive recommender systems (IRSs)
Specifically, we leverage textual information to map items and users into a feature space, which greatly alleviates the sparsity problem.
We show that TDDPG-Rec achieves state-of-the-art performance over several baselines in a time-efficient manner.
arXiv Detail & Related papers (2020-04-14T16:46:01Z) - RNE: A Scalable Network Embedding for Billion-scale Recommendation [21.6366085346674]
We propose RNE, a data-efficient Recommendation-based Network Embedding method, to give personalized and diverse items to users.
On the one hand, the method is able to preserve the local structure between the users and items while modeling the diversity and dynamic property of the user interest to boost the recommendation quality.
We deploy RNE on a recommendation scenario of Taobao, the largest E-commerce platform in China, and train it on a billion-scale user-item graph.
arXiv Detail & Related papers (2020-03-10T07:08:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.