Probabilistic Planning with Prioritized Preferences over Temporal Logic
Objectives
- URL: http://arxiv.org/abs/2304.11641v1
- Date: Sun, 23 Apr 2023 13:03:27 GMT
- Title: Probabilistic Planning with Prioritized Preferences over Temporal Logic
Objectives
- Authors: Lening Li, Hazhar Rahmani, Jie Fu
- Abstract summary: We study temporal planning in probabilistic environments, modeled as labeled Markov decision processes (MDPs)
This paper introduces a new specification language, termed prioritized qualitative choice linear temporal logic on finite traces.
We formulate and solve a problem of computing an optimal policy that minimizes the expected score of dissatisfaction given user preferences.
- Score: 26.180359884973566
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper studies temporal planning in probabilistic environments, modeled
as labeled Markov decision processes (MDPs), with user preferences over
multiple temporal goals. Existing works reflect such preferences as a
prioritized list of goals. This paper introduces a new specification language,
termed prioritized qualitative choice linear temporal logic on finite traces,
which augments linear temporal logic on finite traces with prioritized
conjunction and ordered disjunction from prioritized qualitative choice logic.
This language allows for succinctly specifying temporal objectives with
corresponding preferences accomplishing each temporal task. The finite traces
that describe the system's behaviors are ranked based on their dissatisfaction
scores with respect to the formula. We propose a systematic translation from
the new language to a weighted deterministic finite automaton. Utilizing this
computational model, we formulate and solve a problem of computing an optimal
policy that minimizes the expected score of dissatisfaction given user
preferences. We demonstrate the efficacy and applicability of the logic and the
algorithm on several case studies with detailed analyses for each.
Related papers
- Learning Optimal Signal Temporal Logic Decision Trees for Classification: A Max-Flow MILP Formulation [5.924780594614676]
This paper presents a novel framework for inferring timed temporal logic properties from data.
We formulate the inference process as a mixed integer linear programming optimization problem.
Applying a max-flow algorithm on the resultant tree transforms the problem into a global optimization challenge.
We conduct three case studies involving two-class, multi-class, and complex formula classification scenarios.
arXiv Detail & Related papers (2024-07-30T16:56:21Z) - Belief-State Query Policies for Planning With Preferences Under Partial Observability [18.821166966365315]
Planning in real-world settings often entails addressing partial observability while aligning with users' preferences.
We present a novel framework for expressing users' preferences about agent behavior in a partially observable setting using parameterized belief-state query (BSQ) preferences.
We show that BSQ preferences provide a computationally feasible approach for planning with preferences in partially observable settings.
arXiv Detail & Related papers (2024-05-24T20:04:51Z) - Preference-Based Planning in Stochastic Environments: From Partially-Ordered Temporal Goals to Most Preferred Policies [25.731912021122287]
We consider systems modeled as Markov decision processes, given a partially ordered preference over a set of temporally extended goals.
To plan with the partially ordered preference, we introduce order theory to map a preference over temporal goals to a preference over policies for the MDP.
A most preferred policy under a ordering induces a nondominated probability distribution over the finite paths in the MDP.
arXiv Detail & Related papers (2024-03-27T02:46:09Z) - DetermLR: Augmenting LLM-based Logical Reasoning from Indeterminacy to Determinacy [76.58614128865652]
We propose DetermLR, a novel perspective that rethinks the reasoning process as an evolution from indeterminacy to determinacy.
First, we categorize known conditions into two types: determinate and indeterminate premises This provides an oveall direction for the reasoning process and guides LLMs in converting indeterminate data into progressively determinate insights.
We automate the storage and extraction of available premises and reasoning paths with reasoning memory, preserving historical reasoning details for subsequent reasoning steps.
arXiv Detail & Related papers (2023-10-28T10:05:51Z) - Causal Feature Selection via Transfer Entropy [59.999594949050596]
Causal discovery aims to identify causal relationships between features with observational data.
We introduce a new causal feature selection approach that relies on the forward and backward feature selection procedures.
We provide theoretical guarantees on the regression and classification errors for both the exact and the finite-sample cases.
arXiv Detail & Related papers (2023-10-17T08:04:45Z) - Probabilistic Planning with Partially Ordered Preferences over Temporal
Goals [22.77805882908817]
We study planning in Markov decision processes (MDPs) with preferences over temporally extended goals.
We introduce a variant of deterministic finite automaton, referred to as a preference DFA, for specifying the user's preferences over temporally extended goals.
We prove that a weak-stochastic nondominated policy given the preference specification is optimal in the constructed multi-objective MDP.
arXiv Detail & Related papers (2022-09-25T17:13:24Z) - Regret Analysis in Deterministic Reinforcement Learning [78.31410227443102]
We study the problem of regret, which is central to the analysis and design of optimal learning algorithms.
We present logarithmic problem-specific regret lower bounds that explicitly depend on the system parameter.
arXiv Detail & Related papers (2021-06-27T23:41:57Z) - Probabilistic Planning with Preferences over Temporal Goals [21.35365462532568]
We present a formal language for specifying qualitative preferences over temporal goals and a preference-based planning method in systems.
Using automata-theoretic modeling, the proposed specification allows us to express preferences over different sets of outcomes, where each outcome describes a set of temporal sequences of subgoals.
We define the value of preference satisfaction given a process over possible outcomes and develop an algorithm for time-constrained probabilistic planning in labeled Markov decision processes.
arXiv Detail & Related papers (2021-03-26T14:26:40Z) - Multi-Agent Reinforcement Learning with Temporal Logic Specifications [65.79056365594654]
We study the problem of learning to satisfy temporal logic specifications with a group of agents in an unknown environment.
We develop the first multi-agent reinforcement learning technique for temporal logic specifications.
We provide correctness and convergence guarantees for our main algorithm.
arXiv Detail & Related papers (2021-02-01T01:13:03Z) - An AI-Assisted Design Method for Topology Optimization Without
Pre-Optimized Training Data [68.8204255655161]
An AI-assisted design method based on topology optimization is presented, which is able to obtain optimized designs in a direct way.
Designs are provided by an artificial neural network, the predictor, on the basis of boundary conditions and degree of filling as input data.
arXiv Detail & Related papers (2020-12-11T14:33:27Z) - Options of Interest: Temporal Abstraction with Interest Functions [58.30081828754683]
We provide a generalization of initiation sets suitable for general function approximation, by defining an interest function associated with an option.
We derive a gradient-based learning algorithm for interest functions, leading to a new interest-option-critic architecture.
arXiv Detail & Related papers (2020-01-01T21:24:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.