Analyzing Human Models that Adapt Online
- URL: http://arxiv.org/abs/2103.05746v1
- Date: Tue, 9 Mar 2021 22:38:46 GMT
- Title: Analyzing Human Models that Adapt Online
- Authors: Andrea Bajcsy, Anand Siththaranjan, Claire J. Tomlin, Anca D. Dragan
- Abstract summary: Predictive human models often need to adapt their parameters online from human data.
This raises previously ignored safety-related questions for robots relying on these models.
We model the robot's learning algorithm as a dynamical system where the state is the current model parameter estimate and the control is the human data the robot observes.
- Score: 42.90591111619058
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Predictive human models often need to adapt their parameters online from
human data. This raises previously ignored safety-related questions for robots
relying on these models such as what the model could learn online and how
quickly could it learn it. For instance, when will the robot have a confident
estimate in a nearby human's goal? Or, what parameter initializations guarantee
that the robot can learn the human's preferences in a finite number of
observations? To answer such analysis questions, our key idea is to model the
robot's learning algorithm as a dynamical system where the state is the current
model parameter estimate and the control is the human data the robot observes.
This enables us to leverage tools from reachability analysis and optimal
control to compute the set of hypotheses the robot could learn in finite time,
as well as the worst and best-case time it takes to learn them. We demonstrate
the utility of our analysis tool in four human-robot domains, including
autonomous driving and indoor navigation.
Related papers
- InteRACT: Transformer Models for Human Intent Prediction Conditioned on Robot Actions [7.574421886354134]
InteRACT architecture pre-trains a conditional intent prediction model on large human-human datasets and fine-tunes on a small human-robot dataset.
We evaluate on a set of real-world collaborative human-robot manipulation tasks and show that our conditional model improves over various marginal baselines.
arXiv Detail & Related papers (2023-11-21T19:15:17Z) - Real-time Addressee Estimation: Deployment of a Deep-Learning Model on
the iCub Robot [52.277579221741746]
Addressee Estimation is a skill essential for social robots to interact smoothly with humans.
Inspired by human perceptual skills, a deep-learning model for Addressee Estimation is designed, trained, and deployed on an iCub robot.
The study presents the procedure of such implementation and the performance of the model deployed in real-time human-robot interaction.
arXiv Detail & Related papers (2023-11-09T13:01:21Z) - SACSoN: Scalable Autonomous Control for Social Navigation [62.59274275261392]
We develop methods for training policies for socially unobtrusive navigation.
By minimizing this counterfactual perturbation, we can induce robots to behave in ways that do not alter the natural behavior of humans in the shared space.
We collect a large dataset where an indoor mobile robot interacts with human bystanders.
arXiv Detail & Related papers (2023-06-02T19:07:52Z) - Towards Modeling and Influencing the Dynamics of Human Learning [26.961274302321343]
We take a step towards enabling robots to understand the influence they have, leverage it to better assist people, and help human models more quickly align with reality.
Our key idea is to model the human's learning as a nonlinear dynamical system which evolves the human's internal model given new observations.
We then formalize how robots can influence human learning by embedding the human's learning dynamics model into the robot planning problem.
arXiv Detail & Related papers (2023-01-02T23:59:45Z) - HERD: Continuous Human-to-Robot Evolution for Learning from Human
Demonstration [57.045140028275036]
We show that manipulation skills can be transferred from a human to a robot through the use of micro-evolutionary reinforcement learning.
We propose an algorithm for multi-dimensional evolution path searching that allows joint optimization of both the robot evolution path and the policy.
arXiv Detail & Related papers (2022-12-08T15:56:13Z) - Reasoning about Counterfactuals to Improve Human Inverse Reinforcement
Learning [5.072077366588174]
Humans naturally infer other agents' beliefs and desires by reasoning about their observable behavior.
We propose to incorporate the learner's current understanding of the robot's decision making into our model of human IRL.
We also propose a novel measure for estimating the difficulty for a human to predict instances of a robot's behavior in unseen environments.
arXiv Detail & Related papers (2022-03-03T17:06:37Z) - Dynamically Switching Human Prediction Models for Efficient Planning [32.180808286226075]
We give the robot access to a suite of human models and enable it to assess the performance-computation trade-off online.
Our experiments in a driving simulator showcase how the robot can achieve performance comparable to always using the best human model.
arXiv Detail & Related papers (2021-03-13T23:48:09Z) - Model Elicitation through Direct Questioning [22.907680615911755]
We show how a robot can interact to localize the human model from a set of models.
We show how to generate questions to refine the robot's understanding of the teammate's model.
arXiv Detail & Related papers (2020-11-24T18:17:16Z) - Human Grasp Classification for Reactive Human-to-Robot Handovers [50.91803283297065]
We propose an approach for human-to-robot handovers in which the robot meets the human halfway.
We collect a human grasp dataset which covers typical ways of holding objects with various hand shapes and poses.
We present a planning and execution approach that takes the object from the human hand according to the detected grasp and hand position.
arXiv Detail & Related papers (2020-03-12T19:58:03Z) - Learning Predictive Models From Observation and Interaction [137.77887825854768]
Learning predictive models from interaction with the world allows an agent, such as a robot, to learn about how the world works.
However, learning a model that captures the dynamics of complex skills represents a major challenge.
We propose a method to augment the training set with observational data of other agents, such as humans.
arXiv Detail & Related papers (2019-12-30T01:10:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.