Continually Learned Pavlovian Signalling Without Forgetting for
Human-in-the-Loop Robotic Control
- URL: http://arxiv.org/abs/2305.14365v1
- Date: Tue, 16 May 2023 15:37:16 GMT
- Title: Continually Learned Pavlovian Signalling Without Forgetting for
Human-in-the-Loop Robotic Control
- Authors: Adam S. R. Parker, Michael R. Dawson, and Patrick M. Pilarski
- Abstract summary: Pavlovian signalling is an approach for better modulating feedback in prostheses.
One challenge is that they can forget previously learned predictions when a user begins to successfully act upon delivered feedback.
This work contributes new insight into the challenges of providing learned predictive feedback from a prosthetic device.
- Score: 0.8258451067861933
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Artificial limbs are sophisticated devices to assist people with tasks of
daily living. Despite advanced robotic prostheses demonstrating similar motion
capabilities to biological limbs, users report them difficult and non-intuitive
to use. Providing more effective feedback from the device to the user has
therefore become a topic of increased interest. In particular, prediction
learning methods from the field of reinforcement learning -- specifically, an
approach termed Pavlovian signalling -- have been proposed as one approach for
better modulating feedback in prostheses since they can adapt during continuous
use. One challenge identified in these learning methods is that they can forget
previously learned predictions when a user begins to successfully act upon
delivered feedback. The present work directly addresses this challenge,
contributing new evidence on the impact of algorithmic choices, such as on- or
off-policy methods and representation choices, on the Pavlovian signalling from
a machine to a user during their control of a robotic arm. Two conditions of
algorithmic differences were studied using different scenarios of controlling a
robotic arm: an automated motion system and human participant piloting.
Contrary to expectations, off-policy learning did not provide the expected
solution to the forgetting problem. We instead identified beneficial properties
of a look-ahead state representation that made existing approaches able to
learn (and not forget) predictions in support of Pavlovian signalling. This
work therefore contributes new insight into the challenges of providing learned
predictive feedback from a prosthetic device, and demonstrates avenues for more
dynamic signalling in future human-machine interactions.
Related papers
- Adaptive Language-Guided Abstraction from Contrastive Explanations [53.48583372522492]
It is necessary to determine which features of the environment are relevant before determining how these features should be used to compute reward.
End-to-end methods for joint feature and reward learning often yield brittle reward functions that are sensitive to spurious state features.
This paper describes a method named ALGAE which alternates between using language models to iteratively identify human-meaningful features.
arXiv Detail & Related papers (2024-09-12T16:51:58Z) - DiAReL: Reinforcement Learning with Disturbance Awareness for Robust
Sim2Real Policy Transfer in Robot Control [0.0]
Delayed Markov decision processes fulfill the Markov property by augmenting the state space of agents with a finite time window of recently committed actions.
We introduce a disturbance-augmented Markov decision process in delayed settings as a novel representation to incorporate disturbance estimation in training on-policy reinforcement learning algorithms.
arXiv Detail & Related papers (2023-06-15T10:11:38Z) - Incremental procedural and sensorimotor learning in cognitive humanoid
robots [52.77024349608834]
This work presents a cognitive agent that can learn procedures incrementally.
We show the cognitive functions required in each substage and how adding new functions helps address tasks previously unsolved by the agent.
Results show that this approach is capable of solving complex tasks incrementally.
arXiv Detail & Related papers (2023-04-30T22:51:31Z) - A-ACT: Action Anticipation through Cycle Transformations [89.83027919085289]
We take a step back to analyze how the human capability to anticipate the future can be transferred to machine learning algorithms.
A recent study on human psychology explains that, in anticipating an occurrence, the human brain counts on both systems.
In this work, we study the impact of each system for the task of action anticipation and introduce a paradigm to integrate them in a learning framework.
arXiv Detail & Related papers (2022-04-02T21:50:45Z) - Learning to Control Complex Robots Using High-Dimensional Interfaces:
Preliminary Insights [22.719193009150867]
We explore the use of limited upper-body motions, captured via motion sensors, as inputs to control a 7 degree-of-freedom robotic arm.
It is possible that even dense sensor signals lack the salient information and independence necessary for reliable high-dimensional robot control.
arXiv Detail & Related papers (2021-10-09T23:38:22Z) - Improving Human Motion Prediction Through Continual Learning [2.720960618356385]
Human motion prediction is an essential component for enabling closer human-robot collaboration.
It is compounded by the variability of human motion, both at a skeletal level due to the varying size of humans and at a motion level due to individual movement idiosyncrasies.
We propose a modular sequence learning approach that allows end-to-end training while also having the flexibility of being fine-tuned.
arXiv Detail & Related papers (2021-07-01T15:34:41Z) - Social NCE: Contrastive Learning of Socially-aware Motion
Representations [87.82126838588279]
Experimental results show that the proposed method dramatically reduces the collision rates of recent trajectory forecasting, behavioral cloning and reinforcement learning algorithms.
Our method makes few assumptions about neural architecture designs, and hence can be used as a generic way to promote the robustness of neural motion models.
arXiv Detail & Related papers (2020-12-21T22:25:06Z) - Human-in-the-Loop Imitation Learning using Remote Teleoperation [72.2847988686463]
We build a data collection system tailored to 6-DoF manipulation settings.
We develop an algorithm to train the policy iteratively on new data collected by the system.
We demonstrate that agents trained on data collected by our intervention-based system and algorithm outperform agents trained on an equivalent number of samples collected by non-interventional demonstrators.
arXiv Detail & Related papers (2020-12-12T05:30:35Z) - Robotic self-representation improves manipulation skills and transfer
learning [14.863872352905629]
We develop a model that learns bidirectional action-effect associations to encode the representations of body schema and the peripersonal space from multisensory information.
We demonstrate that this approach significantly stabilizes the learning-based problem-solving under noisy conditions and that it improves transfer learning of robotic manipulation skills.
arXiv Detail & Related papers (2020-11-13T16:04:58Z) - Semi-supervised Learning From Demonstration Through Program Synthesis:
An Inspection Robot Case Study [0.0]
We present a hybrid semi-supervised system capable of learning interpretable and verifiable models from demonstrations.
The system induces a controller program by learning from immersive demonstrations using sequential importance sampling.
We successfully learn the hybrid system from an inspection scenario where an unmanned ground vehicle has to inspect, in a specific order, different areas of the environment.
arXiv Detail & Related papers (2020-07-23T01:32:21Z) - Learning Predictive Models From Observation and Interaction [137.77887825854768]
Learning predictive models from interaction with the world allows an agent, such as a robot, to learn about how the world works.
However, learning a model that captures the dynamics of complex skills represents a major challenge.
We propose a method to augment the training set with observational data of other agents, such as humans.
arXiv Detail & Related papers (2019-12-30T01:10:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.