Data-based Discovery of Governing Equations
- URL: http://arxiv.org/abs/2012.06036v2
- Date: Mon, 21 Dec 2020 17:23:37 GMT
- Title: Data-based Discovery of Governing Equations
- Authors: Waad Subber, Piyush Pandita, Sayan Ghosh, Genghis Khan, Liping Wang,
Roger Ghanem
- Abstract summary: We propose a Data-based Physics Discovery (DPD) framework for automatic discovery of governing equations from observed data.
We demonstrate the performance of the proposed framework on a real-world application in the aerospace industry.
- Score: 1.574365819926238
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Most common mechanistic models are traditionally presented in mathematical
forms to explain a given physical phenomenon. Machine learning algorithms, on
the other hand, provide a mechanism to map the input data to output without
explicitly describing the underlying physical process that generated the data.
We propose a Data-based Physics Discovery (DPD) framework for automatic
discovery of governing equations from observed data. Without a prior definition
of the model structure, first a free-form of the equation is discovered, and
then calibrated and validated against the available data. In addition to the
observed data, the DPD framework can utilize available prior physical models,
and domain expert feedback. When prior models are available, the DPD framework
can discover an additive or multiplicative correction term represented
symbolically. The correction term can be a function of the existing input
variable to the prior model, or a newly introduced variable. In case a prior
model is not available, the DPD framework discovers a new data-based standalone
model governing the observations. We demonstrate the performance of the
proposed framework on a real-world application in the aerospace industry.
Related papers
- No Equations Needed: Learning System Dynamics Without Relying on Closed-Form ODEs [56.78271181959529]
This paper proposes a conceptual shift to modeling low-dimensional dynamical systems by departing from the traditional two-step modeling process.
Instead of first discovering a closed-form equation and then analyzing it, our approach, direct semantic modeling, predicts the semantic representation of the dynamical system.
Our approach not only simplifies the modeling pipeline but also enhances the transparency and flexibility of the resulting models.
arXiv Detail & Related papers (2025-01-30T18:36:48Z) - Modeling and Discovering Direct Causes for Predictive Models [0.0]
We introduce a causal modeling framework that captures the input-output behavior of predictive models.
The framework enables us to define and identify features that directly cause the predictions.
We show two assumptions under which the direct causes can be discovered from data, one of which further simplifies the discovery process.
arXiv Detail & Related papers (2024-12-03T22:25:42Z) - SymbolFit: Automatic Parametric Modeling with Symbolic Regression [1.2662552408022727]
We introduce SymbolFit, a framework that automates parametric modeling by using symbolic regression to perform a machine-search for functions that fit the data.
Our approach is demonstrated in data analysis applications in high-energy physics experiments at the CERN Large Hadron Collider.
arXiv Detail & Related papers (2024-11-15T00:09:37Z) - Influence Functions for Scalable Data Attribution in Diffusion Models [52.92223039302037]
Diffusion models have led to significant advancements in generative modelling.
Yet their widespread adoption poses challenges regarding data attribution and interpretability.
We develop an influence functions framework to address these challenges.
arXiv Detail & Related papers (2024-10-17T17:59:02Z) - Data Shapley in One Training Run [88.59484417202454]
Data Shapley provides a principled framework for attributing data's contribution within machine learning contexts.
Existing approaches require re-training models on different data subsets, which is computationally intensive.
This paper introduces In-Run Data Shapley, which addresses these limitations by offering scalable data attribution for a target model of interest.
arXiv Detail & Related papers (2024-06-16T17:09:24Z) - Synthetic location trajectory generation using categorical diffusion
models [50.809683239937584]
Diffusion models (DPMs) have rapidly evolved to be one of the predominant generative models for the simulation of synthetic data.
We propose using DPMs for the generation of synthetic individual location trajectories (ILTs) which are sequences of variables representing physical locations visited by individuals.
arXiv Detail & Related papers (2024-02-19T15:57:39Z) - Discovering Interpretable Physical Models using Symbolic Regression and
Discrete Exterior Calculus [55.2480439325792]
We propose a framework that combines Symbolic Regression (SR) and Discrete Exterior Calculus (DEC) for the automated discovery of physical models.
DEC provides building blocks for the discrete analogue of field theories, which are beyond the state-of-the-art applications of SR to physical problems.
We prove the effectiveness of our methodology by re-discovering three models of Continuum Physics from synthetic experimental data.
arXiv Detail & Related papers (2023-10-10T13:23:05Z) - $Φ$-DVAE: Physics-Informed Dynamical Variational Autoencoders for Unstructured Data Assimilation [3.2873782624127843]
We develop a physics-informed dynamical variational autoencoder ($Phi$-DVAE) to embed diverse data streams into time-evolving physical systems.
Our approach combines a standard, possibly nonlinear, filter for the latent state-space model and a VAE, to assimilate the unstructured data into the latent dynamical system.
A variational Bayesian framework is used for the joint estimation of the encoding, latent states, and unknown system parameters.
arXiv Detail & Related papers (2022-09-30T17:34:48Z) - Mixed Effects Neural ODE: A Variational Approximation for Analyzing the
Dynamics of Panel Data [50.23363975709122]
We propose a probabilistic model called ME-NODE to incorporate (fixed + random) mixed effects for analyzing panel data.
We show that our model can be derived using smooth approximations of SDEs provided by the Wong-Zakai theorem.
We then derive Evidence Based Lower Bounds for ME-NODE, and develop (efficient) training algorithms.
arXiv Detail & Related papers (2022-02-18T22:41:51Z) - Closed-form discovery of structural errors in models of chaotic systems
by integrating Bayesian sparse regression and data assimilation [0.0]
We introduce a framework named MEDIDA: Model Error Discovery with Interpretability and Data Assimilation.
In MEDIDA, first the model error is estimated from differences between the observed states and model-predicted states.
If observations are noisy, a data assimilation technique such as ensemble Kalman filter (EnKF) is first used to provide a noise-free analysis state of the system.
Finally, an equation-discovery technique, such as the relevance vector machine (RVM), a sparsity-promoting Bayesian method, is used to identify an interpretable, parsimonious, closed
arXiv Detail & Related papers (2021-10-01T17:19:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.