Gradient-Based Mixed Planning with Discrete and Continuous Actions
- URL: http://arxiv.org/abs/2110.10007v1
- Date: Tue, 19 Oct 2021 14:21:19 GMT
- Title: Gradient-Based Mixed Planning with Discrete and Continuous Actions
- Authors: Kebing Jin, Hankz Hankui Zhuo, Zhanhao Xiao, Hai Wan, Subbarao
Kambhampati
- Abstract summary: We propose a quadratic-based framework to simultaneously optimize continuous parameters and actions of candidate plans.
The framework is combined with a module to estimate the best plan candidate to transit initial state to the goal based on relaxation.
- Score: 34.885999774739055
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Dealing with planning problems with both discrete logical relations and
continuous numeric changes in real-world dynamic environments is challenging.
Existing numeric planning systems for the problem often discretize numeric
variables or impose convex quadratic constraints on numeric variables, which
harms the performance when solving the problem. In this paper, we propose a
novel algorithm framework to solve the numeric planning problems mixed with
discrete and continuous actions based on gradient descent. We cast the numeric
planning with discrete and continuous actions as an optimization problem by
integrating a heuristic function based on discrete effects. Specifically, we
propose a gradient-based framework to simultaneously optimize continuous
parameters and actions of candidate plans. The framework is combined with a
heuristic module to estimate the best plan candidate to transit initial state
to the goal based on relaxation. We repeatedly update numeric parameters and
compute candidate plan until it converges to a valid plan to the planning
problem. In the empirical study, we exhibit that our algorithm framework is
both effective and efficient, especially when solving non-convex planning
problems.
Related papers
- Accelerated zero-order SGD under high-order smoothness and overparameterized regime [79.85163929026146]
We present a novel gradient-free algorithm to solve convex optimization problems.
Such problems are encountered in medicine, physics, and machine learning.
We provide convergence guarantees for the proposed algorithm under both types of noise.
arXiv Detail & Related papers (2024-11-21T10:26:17Z) - Planning as In-Painting: A Diffusion-Based Embodied Task Planning
Framework for Environments under Uncertainty [56.30846158280031]
Task planning for embodied AI has been one of the most challenging problems.
We propose a task-agnostic method named 'planning as in-painting'
The proposed framework achieves promising performances in various embodied AI tasks.
arXiv Detail & Related papers (2023-12-02T10:07:17Z) - Multi-Valued Partial Order Plans in Numeric Planning [14.290119665435121]
We will start by reformulating a numeric planning problem known as restricted tasks as a search problem.
We will then show how an NP-complete fragment of numeric planning can be found by using Booleans.
To achieve this, we will develop the idea of multi-valued partial order plans.
arXiv Detail & Related papers (2023-07-27T07:24:30Z) - Numerical Methods for Convex Multistage Stochastic Optimization [86.45244607927732]
We focus on optimisation programming (SP), Optimal Control (SOC) and Decision Processes (MDP)
Recent progress in solving convex multistage Markov problems is based on cutting planes approximations of the cost-to-go functions of dynamic programming equations.
Cutting plane type methods can handle multistage problems with a large number of stages, but a relatively smaller number of state (decision) variables.
arXiv Detail & Related papers (2023-03-28T01:30:40Z) - Efficient Temporal Piecewise-Linear Numeric Planning with Lazy
Consistency Checking [4.834203844100679]
We propose a set of techniques that allow the planner to compute LP consistency checks lazily where possible.
We also propose an algorithm to perform duration-dependent goal checking more selectively.
The resultant planner is not only more efficient, but outperforms most state-of-the-art temporal-numeric and hybrid planners.
arXiv Detail & Related papers (2021-05-21T07:36:54Z) - Multiple Plans are Better than One: Diverse Stochastic Planning [26.887796946596243]
In planning problems, it is often challenging to fully model the desired specifications.
In particular, in human-robot interaction, such difficulty may arise due to human's preferences that are either private or complex to model.
We formulate a problem, called diverse planning, that aims to generate a set of representative behaviors that are near-optimal.
arXiv Detail & Related papers (2020-12-31T07:29:11Z) - Consistent Second-Order Conic Integer Programming for Learning Bayesian
Networks [2.7473982588529653]
We study the problem of learning the sparse DAG structure of a BN from continuous observational data.
The optimal solution to this mathematical program is known to have desirable statistical properties under certain conditions.
We propose a concrete early stopping criterion to terminate the branch-and-bound process in order to obtain a near-optimal solution.
arXiv Detail & Related papers (2020-05-29T00:13:15Z) - Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning [78.65083326918351]
We consider alternatives to an implicit sequential planning assumption.
We propose Divide-and-Conquer Monte Carlo Tree Search (DC-MCTS) for approximating the optimal plan.
We show that this algorithmic flexibility over planning order leads to improved results in navigation tasks in grid-worlds.
arXiv Detail & Related papers (2020-04-23T18:08:58Z) - STRIPS Action Discovery [67.73368413278631]
Recent approaches have shown the success of classical planning at synthesizing action models even when all intermediate states are missing.
We propose a new algorithm to unsupervisedly synthesize STRIPS action models with a classical planner when action signatures are unknown.
arXiv Detail & Related papers (2020-01-30T17:08:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.