Deep Koopman Operator with Control for Nonlinear Systems
- URL: http://arxiv.org/abs/2202.08004v1
- Date: Wed, 16 Feb 2022 11:40:36 GMT
- Title: Deep Koopman Operator with Control for Nonlinear Systems
- Authors: Haojie Shi, Max Q.H. Meng
- Abstract summary: We propose an end-to-end deep learning framework to learn the Koopman embedding function and Koopman Operator.
We first parameterize the embedding function and Koopman Operator with the neural network and train them end-to-end with the K-steps loss function.
We then design an auxiliary control network to encode the nonlinear state-dependent control term to model the nonlinearity in control input.
- Score: 44.472875714432504
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Recently Koopman operator has become a promising data-driven tool to
facilitate real-time control for unknown nonlinear systems. It maps nonlinear
systems into equivalent linear systems in embedding space, ready for real-time
linear control methods. However, designing an appropriate Koopman embedding
function remains a challenging task. Furthermore, most Koopman-based algorithms
only consider nonlinear systems with linear control input, resulting in lousy
prediction and control performance when the system is fully nonlinear with the
control input. In this work, we propose an end-to-end deep learning framework
to learn the Koopman embedding function and Koopman Operator together to
alleviate such difficulties. We first parameterize the embedding function and
Koopman Operator with the neural network and train them end-to-end with the
K-steps loss function. We then design an auxiliary control network to encode
the nonlinear state-dependent control term to model the nonlinearity in control
input. For linear control, this encoded term is considered the new control
variable instead, ensuring the linearity of the embedding space. Then we deploy
Linear Quadratic Regulator (LQR) on the linear embedding space to derive the
optimal control policy and decode the actual control input from the control
net. Experimental results demonstrate that our approach outperforms other
existing methods, reducing the prediction error by order-of-magnitude and
achieving superior control performance in several nonlinear dynamic systems
like damping pendulum, CartPole, and 7 Dof robotic manipulator.
Related papers
- Nonparametric Control-Koopman Operator Learning: Flexible and Scalable Models for Prediction and Control [2.7784144651669704]
We present a nonparametric framework for learning Koopman operator representations of nonlinear control-affine systems.
We also enhance the scalability of control-Koopman operator estimators by leveraging random projections.
The efficacy of our novel cKOR approach is demonstrated on both forecasting and control tasks.
arXiv Detail & Related papers (2024-05-12T15:46:52Z) - Computationally Efficient Data-Driven Discovery and Linear
Representation of Nonlinear Systems For Control [0.0]
This work focuses on developing a data-driven framework using Koopman operator theory for system identification and linearization of nonlinear systems for control.
We show that our proposed method is trained more efficiently and is more accurate than an autoencoder baseline.
arXiv Detail & Related papers (2023-09-08T02:19:14Z) - Differentially Flat Learning-based Model Predictive Control Using a
Stability, State, and Input Constraining Safety Filter [10.52705437098686]
Learning-based optimal control algorithms control unknown systems using past trajectory data and a learned model of the system dynamics.
We present a novel nonlinear controller that exploits differential flatness to achieve similar performance to state-of-the-art learning-based controllers.
arXiv Detail & Related papers (2023-07-20T02:42:23Z) - Towards Data-driven LQR with KoopmanizingFlows [8.133902705930327]
We propose a novel framework for learning linear time-invariant (LTI) models for a class of continuous-time non-autonomous nonlinear dynamics.
We learn a finite representation of the Koopman operator that is linear in controls while concurrently learning meaningful lifting coordinates.
arXiv Detail & Related papers (2022-01-27T17:02:03Z) - Neural Koopman Lyapunov Control [0.0]
We propose a framework to identify and construct stabilizable bilinear control systems and its associated observables from data.
Our proposed approach provides provable guarantees of global stability for the nonlinear control systems with unknown dynamics.
arXiv Detail & Related papers (2022-01-13T17:38:09Z) - Finite-time System Identification and Adaptive Control in Autoregressive
Exogenous Systems [79.67879934935661]
We study the problem of system identification and adaptive control of unknown ARX systems.
We provide finite-time learning guarantees for the ARX systems under both open-loop and closed-loop data collection.
arXiv Detail & Related papers (2021-08-26T18:00:00Z) - Improving Input-Output Linearizing Controllers for Bipedal Robots via
Reinforcement Learning [85.13138591433635]
The main drawbacks of input-output linearizing controllers are the need for precise dynamics models and not being able to account for input constraints.
In this paper, we address both challenges for the specific case of bipedal robot control by the use of reinforcement learning techniques.
arXiv Detail & Related papers (2020-04-15T18:15:49Z) - Logarithmic Regret Bound in Partially Observable Linear Dynamical
Systems [91.43582419264763]
We study the problem of system identification and adaptive control in partially observable linear dynamical systems.
We present the first model estimation method with finite-time guarantees in both open and closed-loop system identification.
We show that AdaptOn is the first algorithm that achieves $textpolylogleft(Tright)$ regret in adaptive control of unknown partially observable linear dynamical systems.
arXiv Detail & Related papers (2020-03-25T06:00:33Z) - Adaptive Control and Regret Minimization in Linear Quadratic Gaussian
(LQG) Setting [91.43582419264763]
We propose LqgOpt, a novel reinforcement learning algorithm based on the principle of optimism in the face of uncertainty.
LqgOpt efficiently explores the system dynamics, estimates the model parameters up to their confidence interval, and deploys the controller of the most optimistic model.
arXiv Detail & Related papers (2020-03-12T19:56:38Z) - Improper Learning for Non-Stochastic Control [78.65807250350755]
We consider the problem of controlling a possibly unknown linear dynamical system with adversarial perturbations, adversarially chosen convex loss functions, and partially observed states.
Applying online descent to this parametrization yields a new controller which attains sublinear regret vs. a large class of closed-loop policies.
Our bounds are the first in the non-stochastic control setting that compete with emphall stabilizing linear dynamical controllers.
arXiv Detail & Related papers (2020-01-25T02:12:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.