Counter-example Guided Learning of Bounds on Environment Behavior
- URL: http://arxiv.org/abs/2001.07233v3
- Date: Thu, 6 Feb 2020 06:19:27 GMT
- Title: Counter-example Guided Learning of Bounds on Environment Behavior
- Authors: Yuxiao Chen, Sumanth Dathathri, Tung Phan-Minh, and Richard M. Murray
- Abstract summary: We present a data-driven solution that allows for a system to be evaluated for specification conformance without an accurate model of the environment.
Our approach involves learning a conservative reactive bound of the environment's behavior using data and specification of the system's desired behavior.
- Score: 11.357397596759172
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: There is a growing interest in building autonomous systems that interact with
complex environments. The difficulty associated with obtaining an accurate
model for such environments poses a challenge to the task of assessing and
guaranteeing the system's performance. We present a data-driven solution that
allows for a system to be evaluated for specification conformance without an
accurate model of the environment. Our approach involves learning a
conservative reactive bound of the environment's behavior using data and
specification of the system's desired behavior. First, the approach begins by
learning a conservative reactive bound on the environment's actions that
captures its possible behaviors with high probability. This bound is then used
to assist verification, and if the verification fails under this bound, the
algorithm returns counter-examples to show how failure occurs and then uses
these to refine the bound. We demonstrate the applicability of the approach
through two case-studies: i) verifying controllers for a toy multi-robot
system, and ii) verifying an instance of human-robot interaction during a
lane-change maneuver given real-world human driving data.
Related papers
- Data-driven Force Observer for Human-Robot Interaction with Series Elastic Actuators using Gaussian Processes [4.229902091180109]
In this work, we learn the unknown dynamics components using Gaussian process (GP) regression.
We derive guaranteed estimation error bounds, thus, facilitating the use in safety-critical applications.
We demonstrate the effectiveness of the proposed approach experimentally in a human-exoskeleton interaction scenario.
arXiv Detail & Related papers (2024-05-14T15:51:52Z) - Evaluating Real-World Robot Manipulation Policies in Simulation [91.55267186958892]
Control and visual disparities between real and simulated environments are key challenges for reliable simulated evaluation.
We propose approaches for mitigating these gaps without needing to craft full-fidelity digital twins of real-world environments.
We create SIMPLER, a collection of simulated environments for manipulation policy evaluation on common real robot setups.
arXiv Detail & Related papers (2024-05-09T17:30:16Z) - Discovering Decision Manifolds to Assure Trusted Autonomous Systems [0.0]
We propose an optimization-based search technique for capturing the range of correct and incorrect responses a system could exhibit.
This manifold provides a more detailed understanding of system reliability than traditional testing or Monte Carlo simulations.
In this proof-of-concept, we apply our method to a software-in-the-loop evaluation of an autonomous vehicle.
arXiv Detail & Related papers (2024-02-12T16:55:58Z) - DARTH: Holistic Test-time Adaptation for Multiple Object Tracking [87.72019733473562]
Multiple object tracking (MOT) is a fundamental component of perception systems for autonomous driving.
Despite the urge of safety in driving systems, no solution to the MOT adaptation problem to domain shift in test-time conditions has ever been proposed.
We introduce DARTH, a holistic test-time adaptation framework for MOT.
arXiv Detail & Related papers (2023-10-03T10:10:42Z) - Interactive System-wise Anomaly Detection [66.3766756452743]
Anomaly detection plays a fundamental role in various applications.
It is challenging for existing methods to handle the scenarios where the instances are systems whose characteristics are not readily observed as data.
We develop an end-to-end approach which includes an encoder-decoder module that learns system embeddings.
arXiv Detail & Related papers (2023-04-21T02:20:24Z) - Error-Aware Policy Learning: Zero-Shot Generalization in Partially
Observable Dynamic Environments [18.8481771211768]
We introduce a novel approach to tackle such a sim-to-real problem by developing policies capable of adapting to new environments.
Key to our approach is an error-aware policy (EAP) that is explicitly made aware of the effect of unobservable factors during training.
We show that a trained EAP for a hip-torque assistive device can be transferred to different human agents with unseen biomechanical characteristics.
arXiv Detail & Related papers (2021-03-13T15:36:44Z) - Domain Curiosity: Learning Efficient Data Collection Strategies for
Domain Adaptation [16.539422751949797]
We present domain curiosity -- a method of training exploratory policies that are explicitly optimized to provide data.
In contrast to most curiosity methods, our approach explicitly rewards learning, which makes it robust to environment noise.
We evaluate the proposed method by comparing how much a model can learn about environment dynamics given data collected by the proposed approach.
arXiv Detail & Related papers (2021-03-12T12:02:11Z) - A Novel Anomaly Detection Algorithm for Hybrid Production Systems based
on Deep Learning and Timed Automata [73.38551379469533]
DAD:DeepAnomalyDetection is a new approach for automatic model learning and anomaly detection in hybrid production systems.
It combines deep learning and timed automata for creating behavioral model from observations.
The algorithm has been applied to few data sets including two from real systems and has shown promising results.
arXiv Detail & Related papers (2020-10-29T08:27:43Z) - Guided Uncertainty-Aware Policy Optimization: Combining Learning and
Model-Based Strategies for Sample-Efficient Policy Learning [75.56839075060819]
Traditional robotic approaches rely on an accurate model of the environment, a detailed description of how to perform the task, and a robust perception system to keep track of the current state.
reinforcement learning approaches can operate directly from raw sensory inputs with only a reward signal to describe the task, but are extremely sample-inefficient and brittle.
In this work, we combine the strengths of model-based methods with the flexibility of learning-based methods to obtain a general method that is able to overcome inaccuracies in the robotics perception/actuation pipeline.
arXiv Detail & Related papers (2020-05-21T19:47:05Z) - Unsupervised Domain Adaptation in Person re-ID via k-Reciprocal
Clustering and Large-Scale Heterogeneous Environment Synthesis [76.46004354572956]
We introduce an unsupervised domain adaptation approach for person re-identification.
Experimental results show that the proposed ktCUDA and SHRED approach achieves an average improvement of +5.7 mAP in re-identification performance.
arXiv Detail & Related papers (2020-01-14T17:43:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.