Synthetic data generation for system identification: leveraging
knowledge transfer from similar systems
- URL: http://arxiv.org/abs/2403.05164v1
- Date: Fri, 8 Mar 2024 09:09:15 GMT
- Title: Synthetic data generation for system identification: leveraging
knowledge transfer from similar systems
- Authors: Dario Piga, Matteo Rufolo, Gabriele Maroni, Manas Mejari, Marco
Forgione
- Abstract summary: This paper introduces a novel approach for the generation of synthetic data, aimed at enhancing model generalization and robustness in scenarios characterized by data scarcity.
Synthetic data is generated through a pre-trained meta-model that describes a broad class of systems to which the system of interest is assumed to belong.
The efficacy of the approach is shown through a numerical example that highlights the advantages of integrating synthetic data into the system identification process.
- Score: 0.3749861135832073
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper addresses the challenge of overfitting in the learning of
dynamical systems by introducing a novel approach for the generation of
synthetic data, aimed at enhancing model generalization and robustness in
scenarios characterized by data scarcity. Central to the proposed methodology
is the concept of knowledge transfer from systems within the same class.
Specifically, synthetic data is generated through a pre-trained meta-model that
describes a broad class of systems to which the system of interest is assumed
to belong. Training data serves a dual purpose: firstly, as input to the
pre-trained meta model to discern the system's dynamics, enabling the
prediction of its behavior and thereby generating synthetic output sequences
for new input sequences; secondly, in conjunction with synthetic data, to
define the loss function used for model estimation. A validation dataset is
used to tune a scalar hyper-parameter balancing the relative importance of
training and synthetic data in the definition of the loss function. The same
validation set can be also used for other purposes, such as early stopping
during the training, fundamental to avoid overfitting in case of small-size
training datasets. The efficacy of the approach is shown through a numerical
example that highlights the advantages of integrating synthetic data into the
system identification process.
Related papers
- InVAErt networks: a data-driven framework for model synthesis and
identifiability analysis [0.0]
inVAErt is a framework for data-driven analysis and synthesis of physical systems.
It uses a deterministic decoder to represent the forward and inverse maps, a normalizing flow to capture the probabilistic distribution of system outputs, and a variational encoder to learn a compact latent representation for the lack of bijectivity between inputs and outputs.
arXiv Detail & Related papers (2023-07-24T07:58:18Z) - Learning Latent Dynamics via Invariant Decomposition and
(Spatio-)Temporal Transformers [0.6767885381740952]
We propose a method for learning dynamical systems from high-dimensional empirical data.
We focus on the setting in which data are available from multiple different instances of a system.
We study behaviour through simple theoretical analyses and extensive experiments on synthetic and real-world datasets.
arXiv Detail & Related papers (2023-06-21T07:52:07Z) - Learning Dynamical Systems by Leveraging Data from Similar Systems [1.3053088049015455]
We consider the problem of learning the dynamics of a linear system when one has access to data generated by an auxiliary system.
We show that the auxiliary data can help to reduce the intrinsic system identification error due to noise.
arXiv Detail & Related papers (2023-02-08T21:27:16Z) - Capturing Actionable Dynamics with Structured Latent Ordinary
Differential Equations [68.62843292346813]
We propose a structured latent ODE model that captures system input variations within its latent representation.
Building on a static variable specification, our model learns factors of variation for each input to the system, thus separating the effects of the system inputs in the latent space.
arXiv Detail & Related papers (2022-02-25T20:00:56Z) - Supervised DKRC with Images for Offline System Identification [77.34726150561087]
Modern dynamical systems are becoming increasingly non-linear and complex.
There is a need for a framework to model these systems in a compact and comprehensive representation for prediction and control.
Our approach learns these basis functions using a supervised learning approach.
arXiv Detail & Related papers (2021-09-06T04:39:06Z) - Using Data Assimilation to Train a Hybrid Forecast System that Combines
Machine-Learning and Knowledge-Based Components [52.77024349608834]
We consider the problem of data-assisted forecasting of chaotic dynamical systems when the available data is noisy partial measurements.
We show that by using partial measurements of the state of the dynamical system, we can train a machine learning model to improve predictions made by an imperfect knowledge-based model.
arXiv Detail & Related papers (2021-02-15T19:56:48Z) - Model-Based Deep Learning [155.063817656602]
Signal processing, communications, and control have traditionally relied on classical statistical modeling techniques.
Deep neural networks (DNNs) use generic architectures which learn to operate from data, and demonstrate excellent performance.
We are interested in hybrid techniques that combine principled mathematical models with data-driven systems to benefit from the advantages of both approaches.
arXiv Detail & Related papers (2020-12-15T16:29:49Z) - A Novel Anomaly Detection Algorithm for Hybrid Production Systems based
on Deep Learning and Timed Automata [73.38551379469533]
DAD:DeepAnomalyDetection is a new approach for automatic model learning and anomaly detection in hybrid production systems.
It combines deep learning and timed automata for creating behavioral model from observations.
The algorithm has been applied to few data sets including two from real systems and has shown promising results.
arXiv Detail & Related papers (2020-10-29T08:27:43Z) - How Training Data Impacts Performance in Learning-based Control [67.7875109298865]
This paper derives an analytical relationship between the density of the training data and the control performance.
We formulate a quality measure for the data set, which we refer to as $rho$-gap.
We show how the $rho$-gap can be applied to a feedback linearizing control law.
arXiv Detail & Related papers (2020-05-25T12:13:49Z) - On the Estimation of Complex Circuits Functional Failure Rate by Machine
Learning Techniques [0.16311150636417257]
De-Rating or Vulnerability Factors are a major feature of failure analysis efforts mandated by today's Functional Safety requirements.
New approach is proposed which uses Machine Learning to estimate the Functional De-Rating of individual flip-flops.
arXiv Detail & Related papers (2020-02-18T15:18:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.