Automatic discovery and description of human planning strategies
- URL: http://arxiv.org/abs/2109.14493v1
- Date: Wed, 29 Sep 2021 15:20:16 GMT
- Title: Automatic discovery and description of human planning strategies
- Authors: Julian Skirzynski, Yash Raj Jain, Falk Lieder
- Abstract summary: We leverage AI for strategy discovery for understanding human planning.
Our algorithm, called Human-Interpret, uses imitation learning to describe process-tracing data.
We find that the descriptions of human planning strategies obtained automatically are about as understandable as human-generated descriptions.
- Score: 0.7734726150561086
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Scientific discovery concerns finding patterns in data and creating
insightful hypotheses that explain these patterns. Traditionally, this process
required human ingenuity, but with the galloping advances in artificial
intelligence (AI) it becomes feasible to automate some parts of scientific
discovery. In this work we leverage AI for strategy discovery for understanding
human planning. In the state-of-the-art methods data about the process of human
planning is often used to group similar behaviors together and formulate verbal
descriptions of the strategies which might underlie those groups. Here, we
automate these two steps. Our algorithm, called Human-Interpret, uses imitation
learning to describe process-tracing data collected in psychological
experiments with the Mouselab-MDP paradigm in terms of a procedural formula.
Then, it translates that formula to natural language using a pre-defined
predicate dictionary. We test our method on a benchmark data set that
researchers have previously scrutinized manually. We find that the descriptions
of human planning strategies obtained automatically are about as understandable
as human-generated descriptions. They also cover a substantial proportion of
all types of human planning strategies that had been discovered manually. Our
method saves scientists' time and effort as all the reasoning about human
planning is done automatically. This might make it feasible to more rapidly
scale up the search for yet undiscovered cognitive strategies to many new
decision environments, populations, tasks, and domains. Given these results, we
believe that the presented work may accelerate scientific discovery in
psychology, and due to its generality, extend to problems from other fields.
Related papers
- Human-Modeling in Sequential Decision-Making: An Analysis through the Lens of Human-Aware AI [20.21053807133341]
We try to provide an account of what constitutes a human-aware AI system.
We see that human-aware AI is a design oriented paradigm, one that focuses on the need for modeling the humans it may interact with.
arXiv Detail & Related papers (2024-05-13T14:17:52Z) - Artificial intelligence to automate the systematic review of scientific
literature [0.0]
We present a survey of AI techniques proposed in the last 15 years to help researchers conduct systematic analyses of scientific literature.
We describe the tasks currently supported, the types of algorithms applied, and available tools proposed in 34 primary studies.
arXiv Detail & Related papers (2024-01-13T19:12:49Z) - Automated Process Planning Based on a Semantic Capability Model and SMT [50.76251195257306]
In research of manufacturing systems and autonomous robots, the term capability is used for a machine-interpretable specification of a system function.
We present an approach that combines these two topics: starting from a semantic capability model, an AI planning problem is automatically generated.
arXiv Detail & Related papers (2023-12-14T10:37:34Z) - Towards Generalizable Zero-Shot Manipulation via Translating Human
Interaction Plans [58.27029676638521]
We show how passive human videos can serve as a rich source of data for learning such generalist robots.
We learn a human plan predictor that, given a current image of a scene and a goal image, predicts the future hand and object configurations.
We show that our learned system can perform over 16 manipulation skills that generalize to 40 objects.
arXiv Detail & Related papers (2023-12-01T18:54:12Z) - AI planning in the imagination: High-level planning on learned abstract
search spaces [68.75684174531962]
We propose a new method, called PiZero, that gives an agent the ability to plan in an abstract search space that the agent learns during training.
We evaluate our method on multiple domains, including the traveling salesman problem, Sokoban, 2048, the facility location problem, and Pacman.
arXiv Detail & Related papers (2023-08-16T22:47:16Z) - IndigoVX: Where Human Intelligence Meets AI for Optimal Decision Making [0.0]
This paper defines a new approach for augmenting human intelligence with AI for optimal goal solving.
Our proposed AI, Indigo, is an acronym for Informed Numerical Decision-making through Iterative Goal-Oriented optimization.
We envisage this method being applied to games or business strategies, with the human providing strategic context and the AI offering optimal, data-driven moves.
arXiv Detail & Related papers (2023-07-21T11:54:53Z) - Machine Psychology [54.287802134327485]
We argue that a fruitful direction for research is engaging large language models in behavioral experiments inspired by psychology.
We highlight theoretical perspectives, experimental paradigms, and computational analysis techniques that this approach brings to the table.
It paves the way for a "machine psychology" for generative artificial intelligence (AI) that goes beyond performance benchmarks.
arXiv Detail & Related papers (2023-03-24T13:24:41Z) - Achieving mouse-level strategic evasion performance using real-time
computational planning [59.60094442546867]
Planning is an extraordinary ability in which the brain imagines and then enacts evaluated possible futures.
We develop a more efficient biologically-inspired planning algorithm, TLPPO, based on work on how the ecology of an animal governs the value of spatial planning.
We compare the performance of a real-time agent using TLPPO against the performance of live mice, all tasked with evading a robot predator.
arXiv Detail & Related papers (2022-11-04T18:34:36Z) - Have I done enough planning or should I plan more? [0.7734726150561086]
We show that people acquire this ability through learning and reverse-engineer the underlying learning mechanisms.
We find that people quickly adapt how much planning they perform to the cost and benefit of planning.
Our results suggest that the metacognitive ability to adjust the amount of planning might be learned through a policy-gradient mechanism.
arXiv Detail & Related papers (2022-01-03T17:11:07Z) - Human-in-the-Loop Disinformation Detection: Stance, Sentiment, or
Something Else? [93.91375268580806]
Both politics and pandemics have recently provided ample motivation for the development of machine learning-enabled disinformation (a.k.a. fake news) detection algorithms.
Existing literature has focused primarily on the fully-automated case, but the resulting techniques cannot reliably detect disinformation on the varied topics, sources, and time scales required for military applications.
By leveraging an already-available analyst as a human-in-the-loop, canonical machine learning techniques of sentiment analysis, aspect-based sentiment analysis, and stance detection become plausible methods to use for a partially-automated disinformation detection system.
arXiv Detail & Related papers (2021-11-09T13:30:34Z) - Interactive Natural Language-based Person Search [15.473033192858543]
We study how to design an algorithm to effectively acquire descriptions from humans.
An algorithm is proposed by adapting models, used for visual and language understanding, to search a person of interest (POI) in a principled way.
We then investigate an iterative question-answering (QA) strategy that enable robots to request additional information about the POI's appearance.
arXiv Detail & Related papers (2020-02-19T20:42:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.