Beyond Occam's Razor in System Identification: Double-Descent when
Modeling Dynamics
- URL: http://arxiv.org/abs/2012.06341v1
- Date: Fri, 11 Dec 2020 13:34:56 GMT
- Title: Beyond Occam's Razor in System Identification: Double-Descent when
Modeling Dynamics
- Authors: Ant\^onio H. Ribeiro, Johannes N. Hendriks, Adrian G. Wills, Thomas B.
Sch\"on
- Abstract summary: System identification aims to build models of dynamical systems from data.
It is typically observed that model validation performance follows a U-shaped curve as the model complexity increases.
Recent developments in machine learning and statistics have observed situations where a "double-descent" curve subsumes this U-shaped model-performance curve.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: System identification aims to build models of dynamical systems from data.
Traditionally, choosing the model requires the designer to balance between two
goals of conflicting nature; the model must be rich enough to capture the
system dynamics, but not so flexible that it learns spurious random effects
from the dataset. It is typically observed that model validation performance
follows a U-shaped curve as the model complexity increases. Recent developments
in machine learning and statistics, however, have observed situations where a
"double-descent" curve subsumes this U-shaped model-performance curve. With a
second decrease in performance occurring beyond the point where the model has
reached the capacity of interpolating - i.e., (near) perfectly fitting - the
training data. To the best of our knowledge, however, such phenomena have not
been studied within the context of the identification of dynamic systems. The
present paper aims to answer the question: "Can such a phenomenon also be
observed when estimating parameters of dynamic systems?" We show the answer is
yes, verifying such behavior experimentally both for artificially generated and
real-world datasets.
Related papers
- Learning Latent Dynamics via Invariant Decomposition and
(Spatio-)Temporal Transformers [0.6767885381740952]
We propose a method for learning dynamical systems from high-dimensional empirical data.
We focus on the setting in which data are available from multiple different instances of a system.
We study behaviour through simple theoretical analyses and extensive experiments on synthetic and real-world datasets.
arXiv Detail & Related papers (2023-06-21T07:52:07Z) - Pseudo-Hamiltonian system identification [0.0]
We consider systems that can be modelled as first-order ordinary differential equations.
We are able to learn the analytic terms of internal dynamics even if the model is trained on data where the system is affected by unknown damping and external disturbances.
arXiv Detail & Related papers (2023-05-09T15:22:05Z) - Capturing dynamical correlations using implicit neural representations [85.66456606776552]
We develop an artificial intelligence framework which combines a neural network trained to mimic simulated data from a model Hamiltonian with automatic differentiation to recover unknown parameters from experimental data.
In doing so, we illustrate the ability to build and train a differentiable model only once, which then can be applied in real-time to multi-dimensional scattering data.
arXiv Detail & Related papers (2023-04-08T07:55:36Z) - Physics-Inspired Temporal Learning of Quadrotor Dynamics for Accurate
Model Predictive Trajectory Tracking [76.27433308688592]
Accurately modeling quadrotor's system dynamics is critical for guaranteeing agile, safe, and stable navigation.
We present a novel Physics-Inspired Temporal Convolutional Network (PI-TCN) approach to learning quadrotor's system dynamics purely from robot experience.
Our approach combines the expressive power of sparse temporal convolutions and dense feed-forward connections to make accurate system predictions.
arXiv Detail & Related papers (2022-06-07T13:51:35Z) - Capturing Actionable Dynamics with Structured Latent Ordinary
Differential Equations [68.62843292346813]
We propose a structured latent ODE model that captures system input variations within its latent representation.
Building on a static variable specification, our model learns factors of variation for each input to the system, thus separating the effects of the system inputs in the latent space.
arXiv Detail & Related papers (2022-02-25T20:00:56Z) - Learning continuous models for continuous physics [94.42705784823997]
We develop a test based on numerical analysis theory to validate machine learning models for science and engineering applications.
Our results illustrate how principled numerical analysis methods can be coupled with existing ML training/testing methodologies to validate models for science and engineering applications.
arXiv Detail & Related papers (2022-02-17T07:56:46Z) - Leveraging the structure of dynamical systems for data-driven modeling [111.45324708884813]
We consider the impact of the training set and its structure on the quality of the long-term prediction.
We show how an informed design of the training set, based on invariants of the system and the structure of the underlying attractor, significantly improves the resulting models.
arXiv Detail & Related papers (2021-12-15T20:09:20Z) - Learning Low-Dimensional Quadratic-Embeddings of High-Fidelity Nonlinear
Dynamics using Deep Learning [9.36739413306697]
Learning dynamical models from data plays a vital role in engineering design, optimization, and predictions.
We use deep learning to identify low-dimensional embeddings for high-fidelity dynamical systems.
arXiv Detail & Related papers (2021-11-25T10:09:00Z) - Using scientific machine learning for experimental bifurcation analysis
of dynamic systems [2.204918347869259]
This study focuses on training universal differential equation (UDE) models for physical nonlinear dynamical systems with limit cycles.
We consider examples where training data is generated by numerical simulations, whereas we also employ the proposed modelling concept to physical experiments.
We use both neural networks and Gaussian processes as universal approximators alongside the mechanistic models to give a critical assessment of the accuracy and robustness of the UDE modelling approach.
arXiv Detail & Related papers (2021-10-22T15:43:03Z) - Using Data Assimilation to Train a Hybrid Forecast System that Combines
Machine-Learning and Knowledge-Based Components [52.77024349608834]
We consider the problem of data-assisted forecasting of chaotic dynamical systems when the available data is noisy partial measurements.
We show that by using partial measurements of the state of the dynamical system, we can train a machine learning model to improve predictions made by an imperfect knowledge-based model.
arXiv Detail & Related papers (2021-02-15T19:56:48Z) - Bridging the Gap: Machine Learning to Resolve Improperly Modeled
Dynamics [4.940323406667406]
We present a data-driven modeling strategy to overcome improperly modeled dynamics for systems exhibiting complex-temporal behaviors.
We propose a Deep Learning framework to resolve the differences between the true dynamics of the system and the dynamics given by a model of the system that is either inaccurately or inadequately described.
arXiv Detail & Related papers (2020-08-23T04:57:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.