On Training and Evaluation of Neural Network Approaches for Model
  Predictive Control
        - URL: http://arxiv.org/abs/2005.04112v1
- Date: Fri, 8 May 2020 15:37:55 GMT
- Title: On Training and Evaluation of Neural Network Approaches for Model
  Predictive Control
- Authors: Rebecka Winqvist, Arun Venkitaraman, Bo Wahlberg
- Abstract summary: This paper is a framework for training and evaluation of Model Predictive Control (MPC) implemented using constrained neural networks.
The motivation is to replace real-time optimization in safety critical feedback control systems with learnt mappings in the form of neural networks with optimization layers.
- Score: 9.8918553325509
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   The contribution of this paper is a framework for training and evaluation of
Model Predictive Control (MPC) implemented using constrained neural networks.
Recent studies have proposed to use neural networks with differentiable convex
optimization layers to implement model predictive controllers. The motivation
is to replace real-time optimization in safety critical feedback control
systems with learnt mappings in the form of neural networks with optimization
layers. Such mappings take as the input the state vector and predict the
control law as the output. The learning takes place using training data
generated from off-line MPC simulations. However, a general framework for
characterization of learning approaches in terms of both model validation and
efficient training data generation is lacking in literature. In this paper, we
take the first steps towards developing such a coherent framework. We discuss
how the learning problem has similarities with system identification, in
particular input design, model structure selection and model validation. We
consider the study of neural network architectures in PyTorch with the explicit
MPC constraints implemented as a differentiable optimization layer using CVXPY.
We propose an efficient approach of generating MPC input samples subject to the
MPC model constraints using a hit-and-run sampler. The corresponding true
outputs are generated by solving the MPC offline using OSOP. We propose
different metrics to validate the resulting approaches. Our study further aims
to explore the advantages of incorporating domain knowledge into the network
structure from a training and evaluation perspective. Different model
structures are numerically tested using the proposed framework in order to
obtain more insights in the properties of constrained neural networks based
MPC.
 
      
        Related papers
        - Sample-Efficient Reinforcement Learning of Koopman eNMPC [42.72938925647165]
 Reinforcement learning can be used to tune data-driven (economic) nonlinear model predictive controllers ((e)NMPCs) for optimal performance in a specific control task.
We combine a model-based RL algorithm with our published method that turns Koopman (e)NMPCs into automatically differentiable policies.
 arXiv  Detail & Related papers  (2025-03-24T15:35:16Z)
- Reservoir computing for system identification and predictive control   with limited data [3.1484174280822845]
 We assess the ability of RNN variants to both learn the dynamics of benchmark control systems and serve as surrogate models for model predictive control (MPC)
We find that echo state networks (ESNs) have a variety of benefits over competing architectures, namely reductions in computational complexity, longer valid prediction times, and reductions in cost of the MPC objective function.
 arXiv  Detail & Related papers  (2024-10-23T21:59:07Z)
- Dropout MPC: An Ensemble Neural MPC Approach for Systems with Learned   Dynamics [0.0]
 We propose a novel sampling-based ensemble neural MPC algorithm that employs the Monte-Carlo dropout technique on the learned system model.
The method aims in general at uncertain systems with complex dynamics, where models derived from first principles are hard to infer.
 arXiv  Detail & Related papers  (2024-06-04T17:15:25Z)
- Efficient and Flexible Neural Network Training through Layer-wise   Feedback Propagation [49.44309457870649]
 Layer-wise Feedback feedback (LFP) is a novel training principle for neural network-like predictors.<n>LFP decomposes a reward to individual neurons based on their respective contributions.<n>Our method then implements a greedy reinforcing approach helpful parts of the network and weakening harmful ones.
 arXiv  Detail & Related papers  (2023-08-23T10:48:28Z)
- End-to-End Reinforcement Learning of Koopman Models for Economic   Nonlinear Model Predictive Control [45.84205238554709]
 We present a method for reinforcement learning of Koopman surrogate models for optimal performance as part of (e)NMPC.
We show that the end-to-end trained models outperform those trained using system identification in (e)NMPC.
 arXiv  Detail & Related papers  (2023-08-03T10:21:53Z)
- Model Predictive Control via On-Policy Imitation Learning [28.96122879515294]
 We develop new sample complexity results and performance guarantees for data-driven Model Predictive Control.
Our algorithm uses the structure of constrained linear MPC, and our analysis uses the properties of the explicit MPC solution to theoretically bound the number of online MPC trajectories needed to achieve optimal performance.
 arXiv  Detail & Related papers  (2022-10-17T16:06:06Z)
- Fitting a Directional Microstructure Model to Diffusion-Relaxation MRI
  Data with Self-Supervised Machine Learning [2.8167227950959206]
 Self-supervised machine learning is emerging as an attractive alternative to supervised learning.
In this paper, we demonstrate self-supervised machine learning model fitting for a directional microstructural model.
Our approach shows clear improvements in parameter estimation and computational time, compared to standard non-linear least squares fitting.
 arXiv  Detail & Related papers  (2022-10-05T15:51:39Z)
- Learning to Learn with Generative Models of Neural Network Checkpoints [71.06722933442956]
 We construct a dataset of neural network checkpoints and train a generative model on the parameters.
We find that our approach successfully generates parameters for a wide range of loss prompts.
We apply our method to different neural network architectures and tasks in supervised and reinforcement learning.
 arXiv  Detail & Related papers  (2022-09-26T17:59:58Z)
- Real-time Neural-MPC: Deep Learning Model Predictive Control for
  Quadrotors and Agile Robotic Platforms [59.03426963238452]
 We present Real-time Neural MPC, a framework to efficiently integrate large, complex neural network architectures as dynamics models within a model-predictive control pipeline.
We show the feasibility of our framework on real-world problems by reducing the positional tracking error by up to 82% when compared to state-of-the-art MPC approaches without neural network dynamics.
 arXiv  Detail & Related papers  (2022-03-15T09:38:15Z)
- Gone Fishing: Neural Active Learning with Fisher Embeddings [55.08537975896764]
 There is an increasing need for active learning algorithms that are compatible with deep neural networks.
This article introduces BAIT, a practical representation of tractable, and high-performing active learning algorithm for neural networks.
 arXiv  Detail & Related papers  (2021-06-17T17:26:31Z)
- Edge-assisted Democratized Learning Towards Federated Analytics [67.44078999945722]
 We show the hierarchical learning structure of the proposed edge-assisted democratized learning mechanism, namely Edge-DemLearn.
We also validate Edge-DemLearn as a flexible model training mechanism to build a distributed control and aggregation methodology in regions.
 arXiv  Detail & Related papers  (2020-12-01T11:46:03Z)
- Belief Propagation Reloaded: Learning BP-Layers for Labeling Problems [83.98774574197613]
 We take one of the simplest inference methods, a truncated max-product Belief  propagation, and add what is necessary to make it a proper component of a deep learning model.
This BP-Layer can be used as the final or an intermediate block in convolutional neural networks (CNNs)
The model is applicable to a range of dense prediction problems, is well-trainable and provides parameter-efficient and robust solutions in stereo, optical flow and semantic segmentation.
 arXiv  Detail & Related papers  (2020-03-13T13:11:35Z)
- Information Theoretic Model Predictive Q-Learning [64.74041985237105]
 We present a novel theoretical connection between information theoretic MPC and entropy regularized RL.
We develop a Q-learning algorithm that can leverage biased models.
 arXiv  Detail & Related papers  (2019-12-31T00:29:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.