Position Paper: Online Modeling for Offline Planning
- URL: http://arxiv.org/abs/2206.03356v1
- Date: Tue, 7 Jun 2022 14:48:08 GMT
- Title: Position Paper: Online Modeling for Offline Planning
- Authors: Eyal Weiss and Gal A. Kaminka
- Abstract summary: A key part of AI planning research is the representation of action models.
Despite the maturity of the field, AI planning technology is still rarely used outside the research community.
We argue that this is because the modeling process is assumed to have taken place and completed prior to the planning process.
- Score: 2.8326418377665346
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The definition and representation of planning problems is at the heart of AI
planning research. A key part is the representation of action models. Decades
of advances improving declarative action model representations resulted in
numerous theoretical advances, and capable, working, domain-independent
planners. However, despite the maturity of the field, AI planning technology is
still rarely used outside the research community, suggesting that current
representations fail to capture real-world requirements, such as utilizing
complex mathematical functions and models learned from data. We argue that this
is because the modeling process is assumed to have taken place and completed
prior to the planning process, i.e., offline modeling for offline planning.
There are several challenges inherent to this approach, including: limited
expressiveness of declarative modeling languages; early commitment to modeling
choices and computation, that preclude using the most appropriate resolution
for each action model -- which can only be known during planning; and
difficulty in reliably using non-declarative, learned, models.
We therefore suggest to change the AI planning process, such that is carries
out online modeling in offline planning, i.e., the use of action models that
are computed or even generated as part of the planning process, as they are
accessed. This generalizes the existing approach (offline modeling). The
proposed definition admits novel planning processes, and we suggest one
concrete implementation, demonstrating the approach. We sketch initial results
that were obtained as part of a first attempt to follow this approach by
planning with action cost estimators. We conclude by discussing open
challenges.
Related papers
- Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling [23.62433580021779]
We advocate a self-refining scheme that iteratively refines a draft plan until an equilibrium is reached.
A nested equilibrium sequence modeling procedure is devised for efficient closed-loop planning.
Our method is evaluated on the VirtualHome-Env benchmark, showing advanced performance with better scaling for inference.
arXiv Detail & Related papers (2024-10-02T11:42:49Z) - Adaptive Planning with Generative Models under Uncertainty [20.922248169620783]
Planning with generative models has emerged as an effective decision-making paradigm across a wide range of domains.
While continuous replanning at each timestep might seem intuitive because it allows decisions to be made based on the most recent environmental observations, it results in substantial computational challenges.
Our work addresses this challenge by introducing a simple adaptive planning policy that leverages the generative model's ability to predict long-horizon state trajectories.
arXiv Detail & Related papers (2024-08-02T18:07:53Z) - Automated Process Planning Based on a Semantic Capability Model and SMT [50.76251195257306]
In research of manufacturing systems and autonomous robots, the term capability is used for a machine-interpretable specification of a system function.
We present an approach that combines these two topics: starting from a semantic capability model, an AI planning problem is automatically generated.
arXiv Detail & Related papers (2023-12-14T10:37:34Z) - Planning as In-Painting: A Diffusion-Based Embodied Task Planning
Framework for Environments under Uncertainty [56.30846158280031]
Task planning for embodied AI has been one of the most challenging problems.
We propose a task-agnostic method named 'planning as in-painting'
The proposed framework achieves promising performances in various embodied AI tasks.
arXiv Detail & Related papers (2023-12-02T10:07:17Z) - Dual policy as self-model for planning [71.73710074424511]
We refer to the model used to simulate one's decisions as the agent's self-model.
Inspired by current reinforcement learning approaches and neuroscience, we explore the benefits and limitations of using a distilled policy network as the self-model.
arXiv Detail & Related papers (2023-06-07T13:58:45Z) - Planning with Sequence Models through Iterative Energy Minimization [22.594413287842574]
We suggest an approach towards integrating planning with sequence models based on the idea of iterative energy minimization.
We train a masked language model to capture an implicit energy function over trajectories of actions, and formulate planning as finding a trajectory of actions with minimum energy.
We illustrate how this procedure enables improved performance over recent approaches across BabyAI and Atari environments.
arXiv Detail & Related papers (2023-03-28T17:53:22Z) - Visual Learning-based Planning for Continuous High-Dimensional POMDPs [81.16442127503517]
Visual Tree Search (VTS) is a learning and planning procedure that combines generative models learned offline with online model-based POMDP planning.
VTS bridges offline model training and online planning by utilizing a set of deep generative observation models to predict and evaluate the likelihood of image observations in a Monte Carlo tree search planner.
We show that VTS is robust to different observation noises and, since it utilizes online, model-based planning, can adapt to different reward structures without the need to re-train.
arXiv Detail & Related papers (2021-12-17T11:53:31Z) - Safe Learning of Lifted Action Models [46.65973550325976]
We propose an algorithm for solving the model-free planning problem in classical planning.
The number of trajectories needed to solve future problems with high probability is linear in the potential size of the domain model.
arXiv Detail & Related papers (2021-07-09T01:24:01Z) - Forethought and Hindsight in Credit Assignment [62.05690959741223]
We work to understand the gains and peculiarities of planning employed as forethought via forward models or as hindsight operating with backward models.
We investigate the best use of models in planning, primarily focusing on the selection of states in which predictions should be (re)-evaluated.
arXiv Detail & Related papers (2020-10-26T16:00:47Z) - STRIPS Action Discovery [67.73368413278631]
Recent approaches have shown the success of classical planning at synthesizing action models even when all intermediate states are missing.
We propose a new algorithm to unsupervisedly synthesize STRIPS action models with a classical planner when action signatures are unknown.
arXiv Detail & Related papers (2020-01-30T17:08:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.