Semi-supervised Learning From Demonstration Through Program Synthesis:
An Inspection Robot Case Study
- URL: http://arxiv.org/abs/2007.12500v1
- Date: Thu, 23 Jul 2020 01:32:21 GMT
- Title: Semi-supervised Learning From Demonstration Through Program Synthesis:
An Inspection Robot Case Study
- Authors: Sim\'on C. Smith (The University of Edinburgh), Subramanian
Ramamoorthy (The University of Edinburgh)
- Abstract summary: We present a hybrid semi-supervised system capable of learning interpretable and verifiable models from demonstrations.
The system induces a controller program by learning from immersive demonstrations using sequential importance sampling.
We successfully learn the hybrid system from an inspection scenario where an unmanned ground vehicle has to inspect, in a specific order, different areas of the environment.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Semi-supervised learning improves the performance of supervised machine
learning by leveraging methods from unsupervised learning to extract
information not explicitly available in the labels. Through the design of a
system that enables a robot to learn inspection strategies from a human
operator, we present a hybrid semi-supervised system capable of learning
interpretable and verifiable models from demonstrations. The system induces a
controller program by learning from immersive demonstrations using sequential
importance sampling. These visual servo controllers are parametrised by
proportional gains and are visually verifiable through observation of the
position of the robot in the environment. Clustering and effective particle
size filtering allows the system to discover goals in the state space. These
goals are used to label the original demonstration for end-to-end learning of
behavioural models. The behavioural models are used for autonomous model
predictive control and scrutinised for explanations. We implement causal
sensitivity analysis to identify salient objects and generate counterfactual
conditional explanations. These features enable decision making interpretation
and post hoc discovery of the causes of a failure. The proposed system expands
on previous approaches to program synthesis by incorporating repellers in the
attribution prior of the sampling process. We successfully learn the hybrid
system from an inspection scenario where an unmanned ground vehicle has to
inspect, in a specific order, different areas of the environment. The system
induces an interpretable computer program of the demonstration that can be
synthesised to produce novel inspection behaviours. Importantly, the robot
successfully runs the synthesised program on an unseen configuration of the
environment while presenting explanations of its autonomous behaviour.
Related papers
- Learning Manipulation by Predicting Interaction [85.57297574510507]
We propose a general pre-training pipeline that learns Manipulation by Predicting the Interaction.
The experimental results demonstrate that MPI exhibits remarkable improvement by 10% to 64% compared with previous state-of-the-art in real-world robot platforms.
arXiv Detail & Related papers (2024-06-01T13:28:31Z) - DiffGen: Robot Demonstration Generation via Differentiable Physics Simulation, Differentiable Rendering, and Vision-Language Model [72.66465487508556]
DiffGen is a novel framework that integrates differentiable physics simulation, differentiable rendering, and a vision-language model.
It can generate realistic robot demonstrations by minimizing the distance between the embedding of the language instruction and the embedding of the simulated observation.
Experiments demonstrate that with DiffGen, we could efficiently and effectively generate robot data with minimal human effort or training time.
arXiv Detail & Related papers (2024-05-12T15:38:17Z) - A Mathematical Model of the Hidden Feedback Loop Effect in Machine Learning Systems [44.99833362998488]
We introduce a repeated learning process to jointly describe several phenomena attributed to unintended hidden feedback loops.
A distinctive feature of such repeated learning setting is that the state of the environment becomes causally dependent on the learner itself over time.
We present a novel dynamical systems model of the repeated learning process and prove the limiting set of probability distributions for positive and negative feedback loop modes.
arXiv Detail & Related papers (2024-05-04T17:57:24Z) - Model-free tracking control of complex dynamical trajectories with
machine learning [0.2356141385409842]
We develop a model-free, machine-learning framework to control a two-arm robotic manipulator.
We demonstrate the effectiveness of the control framework using a variety of periodic and chaotic signals.
arXiv Detail & Related papers (2023-09-20T17:10:10Z) - Visual Affordance Prediction for Guiding Robot Exploration [56.17795036091848]
We develop an approach for learning visual affordances for guiding robot exploration.
We use a Transformer-based model to learn a conditional distribution in the latent embedding space of a VQ-VAE.
We show how the trained affordance model can be used for guiding exploration by acting as a goal-sampling distribution, during visual goal-conditioned policy learning in robotic manipulation.
arXiv Detail & Related papers (2023-05-28T17:53:09Z) - Continually Learned Pavlovian Signalling Without Forgetting for
Human-in-the-Loop Robotic Control [0.8258451067861933]
Pavlovian signalling is an approach for better modulating feedback in prostheses.
One challenge is that they can forget previously learned predictions when a user begins to successfully act upon delivered feedback.
This work contributes new insight into the challenges of providing learned predictive feedback from a prosthetic device.
arXiv Detail & Related papers (2023-05-16T15:37:16Z) - Interactive System-wise Anomaly Detection [66.3766756452743]
Anomaly detection plays a fundamental role in various applications.
It is challenging for existing methods to handle the scenarios where the instances are systems whose characteristics are not readily observed as data.
We develop an end-to-end approach which includes an encoder-decoder module that learns system embeddings.
arXiv Detail & Related papers (2023-04-21T02:20:24Z) - A Novel Anomaly Detection Algorithm for Hybrid Production Systems based
on Deep Learning and Timed Automata [73.38551379469533]
DAD:DeepAnomalyDetection is a new approach for automatic model learning and anomaly detection in hybrid production systems.
It combines deep learning and timed automata for creating behavioral model from observations.
The algorithm has been applied to few data sets including two from real systems and has shown promising results.
arXiv Detail & Related papers (2020-10-29T08:27:43Z) - Learning a generative model for robot control using visual feedback [7.171234436165255]
We introduce a novel formulation for incorporating visual feedback in controlling robots.
Inference in the model allows us to infer the robot state corresponding to target locations of the features.
We demonstrate the effectiveness of our method by executing grasping and tight-fit insertions on robots with inaccurate controllers.
arXiv Detail & Related papers (2020-03-10T00:34:01Z) - Counter-example Guided Learning of Bounds on Environment Behavior [11.357397596759172]
We present a data-driven solution that allows for a system to be evaluated for specification conformance without an accurate model of the environment.
Our approach involves learning a conservative reactive bound of the environment's behavior using data and specification of the system's desired behavior.
arXiv Detail & Related papers (2020-01-20T19:58:24Z) - Learning Predictive Models From Observation and Interaction [137.77887825854768]
Learning predictive models from interaction with the world allows an agent, such as a robot, to learn about how the world works.
However, learning a model that captures the dynamics of complex skills represents a major challenge.
We propose a method to augment the training set with observational data of other agents, such as humans.
arXiv Detail & Related papers (2019-12-30T01:10:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.