Related papers: Goal-Oriented Next Best Activity Recommendation using Reinforcement Learning

Goal-Oriented Next Best Activity Recommendation using Reinforcement Learning

URL: http://arxiv.org/abs/2205.03219v1
Date: Fri, 6 May 2022 13:48:14 GMT
Title: Goal-Oriented Next Best Activity Recommendation using Reinforcement Learning
Authors: Prerna Agarwal, Avani Gupta, Renuka Sindhgatta, Sampath Dechu
Abstract summary: We propose a goal-oriented next best activity recommendation framework. A deep learning model predicts the next best activity and an estimated value of a goal given the activity. A reinforcement learning method explores the sequence of activities based on the estimates likely to meet one or more goals.
Score: 4.128679340077271
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Recommending a sequence of activities for an ongoing case requires that the recommendations conform to the underlying business process and meet the performance goal of either completion time or process outcome. Existing work on next activity prediction can predict the future activity but cannot provide guarantees of the prediction being conformant or meeting the goal. Hence, we propose a goal-oriented next best activity recommendation. Our proposed framework uses a deep learning model to predict the next best activity and an estimated value of a goal given the activity. A reinforcement learning method explores the sequence of activities based on the estimates likely to meet one or more goals. We further address a real-world problem of multiple goals by introducing an additional reward function to balance the outcome of a recommended activity and satisfy the goal. We demonstrate the effectiveness of the proposed method on four real-world datasets with different characteristics. The results show that the recommendations from our proposed approach outperform in goal satisfaction and conformance compared to the existing state-of-the-art next best activity recommendation techniques.

Related papers

World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning [60.100794160682646]
We propose a new learning framework that jointly optimize state prediction and action selection through preference learning. To automatically collect trajectories and stepwise preference data without human annotation, we introduce a tree search mechanism for extensive exploration via trial-and-error. Our method significantly outperforms existing methods and GPT-4o when applied to Qwen2-VL (7B), LLaVA-1.6 (7B), and LLaMA-3.2 (11B)
arXiv Detail & Related papers (2025-03-13T15:49:56Z)
Future-Conditioned Recommendations with Multi-Objective Controllable Decision Transformer [12.252515483035737]
Current recommendation strategies grapple with two significant hurdles. We introduce a future-conditioned strategy for multi-objective controllable recommendations. We present the Multi-Objective Controllable Decision Transformer (MocDT), an offline Reinforcement Learning (RL) model capable of autonomously learning the mapping from multiple objectives to item sequences.
arXiv Detail & Related papers (2025-01-13T11:12:43Z)
Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos [48.15438373870542]
VidAssist is an integrated framework designed for zero/few-shot goal-oriented planning in instructional videos. It employs a breadth-first search algorithm for optimal plan generation. Experiments demonstrate that VidAssist offers a unified framework for different goal-oriented planning setups.
arXiv Detail & Related papers (2024-09-30T17:57:28Z)
From Recognition to Prediction: Leveraging Sequence Reasoning for Action Anticipation [30.161471749050833]
We propose a novel end-to-end video modeling architecture that utilizes attention mechanisms, named Anticipation via Recognition and Reasoning (ARR) ARR decomposes the action anticipation task into action recognition and reasoning tasks, and effectively learns the statistical relationship between actions by next action prediction (NAP) In addition, to address the challenge of relationship modeling that requires extensive training data, we propose an innovative approach for the unsupervised pre-training of the decoder.
arXiv Detail & Related papers (2024-08-05T18:38:29Z)
Deep Pareto Reinforcement Learning for Multi-Objective Recommender Systems [60.91599969408029]
optimizing multiple objectives simultaneously is an important task for recommendation platforms. Existing multi-objective recommender systems do not systematically consider such dynamic relationships.
arXiv Detail & Related papers (2024-07-04T02:19:49Z)
Code Models are Zero-shot Precondition Reasoners [83.8561159080672]
We use code representations to reason about action preconditions for sequential decision making tasks. We propose a precondition-aware action sampling strategy that ensures actions predicted by a policy are consistent with preconditions.
arXiv Detail & Related papers (2023-11-16T06:19:27Z)
Action Anticipation with Goal Consistency [19.170733994203367]
We propose to harness high-level intent information to anticipate actions that will take place in the future. We show the effectiveness of the proposed approach and demonstrate that our method achieves state-of-the-art results on two large-scale datasets.
arXiv Detail & Related papers (2023-06-26T20:04:23Z)
Towards Out-of-Distribution Sequential Event Prediction: A Causal Treatment [72.50906475214457]
The goal of sequential event prediction is to estimate the next event based on a sequence of historical events. In practice, the next-event prediction models are trained with sequential data collected at one time. We propose a framework with hierarchical branching structures for learning context-specific representations.
arXiv Detail & Related papers (2022-10-24T07:54:13Z)
Learning to act: a Reinforcement Learning approach to recommend the best next activities [4.511664266033014]
This paper investigates an approach that learns, by means of Reinforcement Learning, an optimal policy from the observation of past executions. The potentiality of the approach has been demonstrated on two scenarios taken from real-life data.
arXiv Detail & Related papers (2022-03-29T09:43:39Z)
Learning to Plan Optimistically: Uncertainty-Guided Deep Exploration via Latent Model Ensembles [73.15950858151594]
This paper presents Latent Optimistic Value Exploration (LOVE), a strategy that enables deep exploration through optimism in the face of uncertain long-term rewards. We combine latent world models with value function estimation to predict infinite-horizon returns and recover associated uncertainty via ensembling. We apply LOVE to visual robot control tasks in continuous action spaces and demonstrate on average more than 20% improved sample efficiency in comparison to state-of-the-art and other exploration objectives.
arXiv Detail & Related papers (2020-10-27T22:06:57Z)
Reward Maximisation through Discrete Active Inference [1.2074552857379273]
We show how and when active inference agents perform actions that are optimal for maximising reward. We show the conditions under which active inference produces the optimal solution to the Bellman equation. We append the analysis with a discussion of the broader relationship between active inference and reinforcement learning.
arXiv Detail & Related papers (2020-09-17T07:13:59Z)
Long-Term Anticipation of Activities with Cycle Consistency [90.79357258104417]
We propose a framework for anticipating future activities directly from the features of the observed frames and train it in an end-to-end fashion. Our framework achieves state-the-art results on two datasets: the Breakfast dataset and 50Salads.
arXiv Detail & Related papers (2020-09-02T15:41:32Z)

This list is automatically generated from the titles and abstracts of the papers in this site.