A toolkit for data-driven discovery of governing equations in high-noise
regimes
- URL: http://arxiv.org/abs/2111.04870v1
- Date: Mon, 8 Nov 2021 23:32:11 GMT
- Title: A toolkit for data-driven discovery of governing equations in high-noise
regimes
- Authors: Charles B. Delahunt and J. Nathan Kutz
- Abstract summary: We consider the data-driven discovery of governing equations from time-series data in the limit of high noise.
We offer two primary contributions, both focused on noisy data acquired from a system x' = f(x)
- Score: 5.025654873456756
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We consider the data-driven discovery of governing equations from time-series
data in the limit of high noise. The algorithms developed describe an extensive
toolkit of methods for circumventing the deleterious effects of noise in the
context of the sparse identification of nonlinear dynamics (SINDy) framework.
We offer two primary contributions, both focused on noisy data acquired from a
system x' = f(x). First, we propose, for use in high-noise settings, an
extensive toolkit of critically enabling extensions for the SINDy regression
method, to progressively cull functionals from an over-complete library and
yield a set of sparse equations that regress to the derivate x'. These
innovations can extract sparse governing equations and coefficients from
high-noise time-series data (e.g. 300% added noise). For example, it discovers
the correct sparse libraries in the Lorenz system, with median coefficient
estimate errors equal to 1% - 3% (for 50% noise), 6% - 8% (for 100% noise); and
23% - 25% (for 300% noise). The enabling modules in the toolkit are combined
into a single method, but the individual modules can be tactically applied in
other equation discovery methods (SINDy or not) to improve results on
high-noise data. Second, we propose a technique, applicable to any model
discovery method based on x' = f(x), to assess the accuracy of a discovered
model in the context of non-unique solutions due to noisy data. Currently, this
non-uniqueness can obscure a discovered model's accuracy and thus a discovery
method's effectiveness. We describe a technique that uses linear dependencies
among functionals to transform a discovered model into an equivalent form that
is closest to the true model, enabling more accurate assessment of a discovered
model's accuracy.
Related papers
- Accelerated zero-order SGD under high-order smoothness and overparameterized regime [79.85163929026146]
We present a novel gradient-free algorithm to solve convex optimization problems.
Such problems are encountered in medicine, physics, and machine learning.
We provide convergence guarantees for the proposed algorithm under both types of noise.
arXiv Detail & Related papers (2024-11-21T10:26:17Z) - Equation Discovery with Bayesian Spike-and-Slab Priors and Efficient Kernels [57.46832672991433]
We propose a novel equation discovery method based on Kernel learning and BAyesian Spike-and-Slab priors (KBASS)
We use kernel regression to estimate the target function, which is flexible, expressive, and more robust to data sparsity and noises.
We develop an expectation-propagation expectation-maximization algorithm for efficient posterior inference and function estimation.
arXiv Detail & Related papers (2023-10-09T03:55:09Z) - Score-based Diffusion Models in Function Space [140.792362459734]
Diffusion models have recently emerged as a powerful framework for generative modeling.
We introduce a mathematically rigorous framework called Denoising Diffusion Operators (DDOs) for training diffusion models in function space.
We show that the corresponding discretized algorithm generates accurate samples at a fixed cost independent of the data resolution.
arXiv Detail & Related papers (2023-02-14T23:50:53Z) - Improving the Robustness of Summarization Models by Detecting and
Removing Input Noise [50.27105057899601]
We present a large empirical study quantifying the sometimes severe loss in performance from different types of input noise for a range of datasets and model sizes.
We propose a light-weight method for detecting and removing such noise in the input during model inference without requiring any training, auxiliary models, or even prior knowledge of the type of noise.
arXiv Detail & Related papers (2022-12-20T00:33:11Z) - Learning from aggregated data with a maximum entropy model [73.63512438583375]
We show how a new model, similar to a logistic regression, may be learned from aggregated data only by approximating the unobserved feature distribution with a maximum entropy hypothesis.
We present empirical evidence on several public datasets that the model learned this way can achieve performances comparable to those of a logistic model trained with the full unaggregated data.
arXiv Detail & Related papers (2022-10-05T09:17:27Z) - An end-to-end deep learning approach for extracting stochastic dynamical
systems with $\alpha$-stable L\'evy noise [5.815325960286111]
In this work, we identify dynamical systems driven by $alpha$-stable Levy noise from only random pairwise data.
Our innovations include: (1) designing a deep learning approach to learn both drift and diffusion terms for Levy induced noise with $alpha$ across all values, (2) learning complex multiplicative noise without restrictions on small noise intensity, and (3) proposing an end-to-end complete framework for systems identification.
arXiv Detail & Related papers (2022-01-31T10:51:25Z) - Learning Summary Statistics for Bayesian Inference with Autoencoders [58.720142291102135]
We use the inner dimension of deep neural network based Autoencoders as summary statistics.
To create an incentive for the encoder to encode all the parameter-related information but not the noise, we give the decoder access to explicit or implicit information that has been used to generate the training data.
arXiv Detail & Related papers (2022-01-28T12:00:31Z) - Learning Dynamics from Noisy Measurements using Deep Learning with a
Runge-Kutta Constraint [9.36739413306697]
We discuss a methodology to learn differential equation(s) using noisy and sparsely sampled measurements.
In our methodology, the main innovation can be seen in of integration of deep neural networks with a classical numerical integration method.
arXiv Detail & Related papers (2021-09-23T15:43:45Z) - Automatic Differentiation to Simultaneously Identify Nonlinear Dynamics
and Extract Noise Probability Distributions from Data [4.996878640124385]
SINDy is a framework for the discovery of parsimonious dynamic models and equations from time-series data.
We develop a variant of the SINDy algorithm that integrates automatic differentiation and recent time-stepping constrained by Rudy et al.
We show the method can identify a diversity of probability distributions including Gaussian, uniform, Gamma, and Rayleigh.
arXiv Detail & Related papers (2020-09-12T23:52:25Z) - Weak SINDy For Partial Differential Equations [0.0]
We extend our Weak SINDy (WSINDy) framework to the setting of partial differential equations (PDEs)
The elimination of pointwise derivative approximations via the weak form enables effective machine-precision recovery of model coefficients from noise-free data.
We demonstrate WSINDy's robustness, speed and accuracy on several challenging PDEs.
arXiv Detail & Related papers (2020-07-06T16:03:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.