Deep Learning Tubes for Tube MPC
- URL: http://arxiv.org/abs/2002.01587v2
- Date: Thu, 4 Jun 2020 20:12:35 GMT
- Title: Deep Learning Tubes for Tube MPC
- Authors: David D. Fan, Ali-akbar Agha-mohammadi and Evangelos A. Theodorou
- Abstract summary: We use deep learning to obtain expressive and flexible models of how trajectories behave.
We then use for nonlinear Model Predictive Control (MPC)
We present experiments in simulation on a nonlinear quadrotor system.
- Score: 21.84264471259777
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Learning-based control aims to construct models of a system to use for
planning or trajectory optimization, e.g. in model-based reinforcement
learning. In order to obtain guarantees of safety in this context, uncertainty
must be accurately quantified. This uncertainty may come from errors in
learning (due to a lack of data, for example), or may be inherent to the
system. Propagating uncertainty forward in learned dynamics models is a
difficult problem. In this work we use deep learning to obtain expressive and
flexible models of how distributions of trajectories behave, which we then use
for nonlinear Model Predictive Control (MPC). We introduce a deep quantile
regression framework for control that enforces probabilistic quantile bounds
and quantifies epistemic uncertainty. Using our method we explore three
different approaches for learning tubes that contain the possible trajectories
of the system, and demonstrate how to use each of them in a Tube MPC scheme. We
prove these schemes are recursively feasible and satisfy constraints with a
desired margin of probability. We present experiments in simulation on a
nonlinear quadrotor system, demonstrating the practical efficacy of these
ideas.
Related papers
- Learning Controlled Stochastic Differential Equations [61.82896036131116]
This work proposes a novel method for estimating both drift and diffusion coefficients of continuous, multidimensional, nonlinear controlled differential equations with non-uniform diffusion.
We provide strong theoretical guarantees, including finite-sample bounds for (L2), (Linfty), and risk metrics, with learning rates adaptive to coefficients' regularity.
Our method is available as an open-source Python library.
arXiv Detail & Related papers (2024-11-04T11:09:58Z) - Non-Markovian Quantum Control via Model Maximum Likelihood Estimation
and Reinforcement Learning [0.0]
We propose a novel approach that incorporates the non-Markovian nature of the environment into a low-dimensional effective reservoir.
We utilize machine learning techniques to learn the effective quantum dynamics more efficiently than traditional tomographic methods.
This approach may not only mitigates the issues of model bias but also provides a more accurate representation of quantum dynamics.
arXiv Detail & Related papers (2024-02-07T18:37:17Z) - Statistical Safety and Robustness Guarantees for Feedback Motion
Planning of Unknown Underactuated Stochastic Systems [1.0323063834827415]
We propose a sampling-based planner that uses the mean dynamics model and simultaneously bounds the closed-loop tracking error via a learned disturbance bound.
We validate that our guarantees translate to empirical safety in simulation on a 10D quadrotor, and in the real world on a physical CrazyFlie quadrotor and Clearpath Jackal robot.
arXiv Detail & Related papers (2022-12-13T19:38:39Z) - Learning-enhanced Nonlinear Model Predictive Control using
Knowledge-based Neural Ordinary Differential Equations and Deep Ensembles [5.650647159993238]
In this work, we leverage deep learning tools, namely knowledge-based neural ordinary differential equations (KNODE) and deep ensembles, to improve the prediction accuracy of a model predictive control (MPC)
In particular, we learn an ensemble of KNODE models, which we refer to as the KNODE ensemble, to obtain an accurate prediction of the true system dynamics.
We show that the KNODE ensemble provides more accurate predictions and illustrate the efficacy and closed-loop performance of the proposed nonlinear MPC framework.
arXiv Detail & Related papers (2022-11-24T23:51:18Z) - Probabilistic robust linear quadratic regulators with Gaussian processes [73.0364959221845]
Probabilistic models such as Gaussian processes (GPs) are powerful tools to learn unknown dynamical systems from data for subsequent use in control design.
We present a novel controller synthesis for linearized GP dynamics that yields robust controllers with respect to a probabilistic stability margin.
arXiv Detail & Related papers (2021-05-17T08:36:18Z) - Controlling nonlinear dynamical systems into arbitrary states using
machine learning [77.34726150561087]
We propose a novel and fully data driven control scheme which relies on machine learning (ML)
Exploiting recently developed ML-based prediction capabilities of complex systems, we demonstrate that nonlinear systems can be forced to stay in arbitrary dynamical target states coming from any initial state.
Having this highly flexible control scheme with little demands on the amount of required data on hand, we briefly discuss possible applications that range from engineering to medicine.
arXiv Detail & Related papers (2021-02-23T16:58:26Z) - Using Data Assimilation to Train a Hybrid Forecast System that Combines
Machine-Learning and Knowledge-Based Components [52.77024349608834]
We consider the problem of data-assisted forecasting of chaotic dynamical systems when the available data is noisy partial measurements.
We show that by using partial measurements of the state of the dynamical system, we can train a machine learning model to improve predictions made by an imperfect knowledge-based model.
arXiv Detail & Related papers (2021-02-15T19:56:48Z) - Anticipating the Long-Term Effect of Online Learning in Control [75.6527644813815]
AntLer is a design algorithm for learning-based control laws that anticipates learning.
We show that AntLer approximates an optimal solution arbitrarily accurately with probability one.
arXiv Detail & Related papers (2020-07-24T07:00:14Z) - How Training Data Impacts Performance in Learning-based Control [67.7875109298865]
This paper derives an analytical relationship between the density of the training data and the control performance.
We formulate a quality measure for the data set, which we refer to as $rho$-gap.
We show how the $rho$-gap can be applied to a feedback linearizing control law.
arXiv Detail & Related papers (2020-05-25T12:13:49Z) - DISCO: Double Likelihood-free Inference Stochastic Control [29.84276469617019]
We propose to leverage the power of modern simulators and recent techniques in Bayesian statistics for likelihood-free inference.
The posterior distribution over simulation parameters is propagated through a potentially non-analytical model of the system.
Experiments show that the controller proposed attained superior performance and robustness on classical control and robotics tasks.
arXiv Detail & Related papers (2020-02-18T05:29:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.