Humans decompose tasks by trading off utility and computational cost
- URL: http://arxiv.org/abs/2211.03890v1
- Date: Mon, 7 Nov 2022 22:45:46 GMT
- Title: Humans decompose tasks by trading off utility and computational cost
- Authors: Carlos G. Correa, Mark K. Ho, Frederick Callaway, Nathaniel D. Daw,
Thomas L. Griffiths
- Abstract summary: Human behavior emerges from planning over elaborate decompositions of tasks into goals, subgoals, and low-level actions.
We propose and evaluate a normative framework for task decomposition based on the simple idea that people decompose tasks to reduce the overall cost of planning.
- Score: 10.366866096121347
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Human behavior emerges from planning over elaborate decompositions of tasks
into goals, subgoals, and low-level actions. How are these decompositions
created and used? Here, we propose and evaluate a normative framework for task
decomposition based on the simple idea that people decompose tasks to reduce
the overall cost of planning while maintaining task performance. Analyzing
11,117 distinct graph-structured planning tasks, we find that our framework
justifies several existing heuristics for task decomposition and makes
predictions that can be distinguished from two alternative normative accounts.
We report a behavioral study of task decomposition ($N=806$) that uses 30
randomly sampled graphs, a larger and more diverse set than that of any
previous behavioral study on this topic. We find that human responses are more
consistent with our framework for task decomposition than alternative normative
accounts and are most consistent with a heuristic -- betweenness centrality --
that is justified by our approach. Taken together, our results provide new
theoretical insight into the computational principles underlying the
intelligent structuring of goal-directed behavior.
Related papers
- Learning Top-k Subtask Planning Tree based on Discriminative Representation Pre-training for Decision Making [9.302910360945042]
Planning with prior knowledge extracted from complicated real-world tasks is crucial for humans to make accurate decisions.
We introduce a multiple-encoder and individual-predictor regime to learn task-essential representations from sufficient data for simple subtasks.
We also use the attention mechanism to generate a top-k subtask planning tree, which customizes subtask execution plans in guiding complex decisions on unseen tasks.
arXiv Detail & Related papers (2023-12-18T09:00:31Z) - Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning [83.41487567765871]
Skipper is a model-based reinforcement learning framework.
It automatically generalizes the task given into smaller, more manageable subtasks.
It enables sparse decision-making and focused abstractions on the relevant parts of the environment.
arXiv Detail & Related papers (2023-09-30T02:25:18Z) - POMRL: No-Regret Learning-to-Plan with Increasing Horizons [43.693739167594295]
We study the problem of planning under model uncertainty in an online meta-reinforcement learning setting.
We propose an algorithm to meta-learn the underlying structure across tasks, utilize it to plan in each task, and upper-bound the regret of the planning loss.
arXiv Detail & Related papers (2022-12-30T03:09:45Z) - Inferring Versatile Behavior from Demonstrations by Matching Geometric
Descriptors [72.62423312645953]
Humans intuitively solve tasks in versatile ways, varying their behavior in terms of trajectory-based planning and for individual steps.
Current Imitation Learning algorithms often only consider unimodal expert demonstrations and act in a state-action-based setting.
Instead, we combine a mixture of movement primitives with a distribution matching objective to learn versatile behaviors that match the expert's behavior and versatility.
arXiv Detail & Related papers (2022-10-17T16:42:59Z) - Fast Inference and Transfer of Compositional Task Structures for
Few-shot Task Generalization [101.72755769194677]
We formulate it as a few-shot reinforcement learning problem where a task is characterized by a subtask graph.
Our multi-task subtask graph inferencer (MTSGI) first infers the common high-level task structure in terms of the subtask graph from the training tasks.
Our experiment results on 2D grid-world and complex web navigation domains show that the proposed method can learn and leverage the common underlying structure of the tasks for faster adaptation to the unseen tasks.
arXiv Detail & Related papers (2022-05-25T10:44:25Z) - Multi-task Learning of Order-Consistent Causal Graphs [59.9575145128345]
We consider the problem of discovering $K related Gaussian acyclic graphs (DAGs)
Under multi-task learning setting, we propose a $l_1/l$-regularized maximum likelihood estimator (MLE) for learning $K$ linear structural equation models.
We theoretically show that the joint estimator, by leveraging data across related tasks, can achieve a better sample complexity for recovering the causal order.
arXiv Detail & Related papers (2021-11-03T22:10:18Z) - An Integrated Dynamic Method for Allocating Roles and Planning Tasks for
Mixed Human-Robot Teams [0.0]
This paper proposes a novel integrated dynamic method for planning and allocating tasks in mixed human robot teams.
The Behavior Tree formulation allows encoding a single job as a compound of different tasks with temporal and logic constraints.
arXiv Detail & Related papers (2021-05-25T16:10:30Z) - Resource-rational Task Decomposition to Minimize Planning Costs [10.178049366671505]
People often plan hierarchically. That is, rather than planning over a monolithic representation of a task, they decompose the task into simpler subtasks and then plan to accomplish those.
Here, we propose that people decompose tasks in a manner that facilitates efficient use of limited cognitive resources given the structure of the environment and their own planning algorithms.
Our account provides a normative explanation for how people identify subtasks as well as a framework for studying how people reason, plan, and act using resource-rational representations.
arXiv Detail & Related papers (2020-07-27T20:59:26Z) - Dynamic Multi-Robot Task Allocation under Uncertainty and Temporal
Constraints [52.58352707495122]
We present a multi-robot allocation algorithm that decouples the key computational challenges of sequential decision-making under uncertainty and multi-agent coordination.
We validate our results over a wide range of simulations on two distinct domains: multi-arm conveyor belt pick-and-place and multi-drone delivery dispatch in a city.
arXiv Detail & Related papers (2020-05-27T01:10:41Z) - Hierarchical Reinforcement Learning as a Model of Human Task
Interleaving [60.95424607008241]
We develop a hierarchical model of supervisory control driven by reinforcement learning.
The model reproduces known empirical effects of task interleaving.
The results support hierarchical RL as a plausible model of task interleaving.
arXiv Detail & Related papers (2020-01-04T17:53:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.