Estimating the Distribution of Parameters in Differential Equations with Repeated Cross-Sectional Data
- URL: http://arxiv.org/abs/2404.14873v1
- Date: Tue, 23 Apr 2024 10:01:43 GMT
- Title: Estimating the Distribution of Parameters in Differential Equations with Repeated Cross-Sectional Data
- Authors: Hyeontae Jo, Sung Woong Cho, Hyung Ju Hwang,
- Abstract summary: In economy, politics, and biology, observation data points in the time series are often independently obtained.
Traditional methods for parameter estimation in differential equations have limitations in estimating the shape of parameter distributions.
We introduce a novel method, Estimation of.
EPD, providing accurate distribution of parameters without loss of data information.
- Score: 5.79648227233365
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Differential equations are pivotal in modeling and understanding the dynamics of various systems, offering insights into their future states through parameter estimation fitted to time series data. In fields such as economy, politics, and biology, the observation data points in the time series are often independently obtained (i.e., Repeated Cross-Sectional (RCS) data). With RCS data, we found that traditional methods for parameter estimation in differential equations, such as using mean values of time trajectories or Gaussian Process-based trajectory generation, have limitations in estimating the shape of parameter distributions, often leading to a significant loss of data information. To address this issue, we introduce a novel method, Estimation of Parameter Distribution (EPD), providing accurate distribution of parameters without loss of data information. EPD operates in three main steps: generating synthetic time trajectories by randomly selecting observed values at each time point, estimating parameters of a differential equation that minimize the discrepancy between these trajectories and the true solution of the equation, and selecting the parameters depending on the scale of discrepancy. We then evaluated the performance of EPD across several models, including exponential growth, logistic population models, and target cell-limited models with delayed virus production, demonstrating its superiority in capturing the shape of parameter distributions. Furthermore, we applied EPD to real-world datasets, capturing various shapes of parameter distributions rather than a normal distribution. These results effectively address the heterogeneity within systems, marking a substantial progression in accurately modeling systems using RCS data.
Related papers
- High-Dimensional Differential Parameter Inference in Exponential Family using Time Score Matching [13.263382678154253]
Instead of estimating a high-dimensional model at each time, we learn the differential parameter, i.e., the time derivative of the parameter.
Our methodology effectively infers differential structures in high-dimensional graphical models, verified on simulated and real-world datasets.
arXiv Detail & Related papers (2024-10-14T15:49:27Z) - On the Trajectory Regularity of ODE-based Diffusion Sampling [79.17334230868693]
Diffusion-based generative models use differential equations to establish a smooth connection between a complex data distribution and a tractable prior distribution.
In this paper, we identify several intriguing trajectory properties in the ODE-based sampling process of diffusion models.
arXiv Detail & Related papers (2024-05-18T15:59:41Z) - Diffusion posterior sampling for simulation-based inference in tall data settings [53.17563688225137]
Simulation-based inference ( SBI) is capable of approximating the posterior distribution that relates input parameters to a given observation.
In this work, we consider a tall data extension in which multiple observations are available to better infer the parameters of the model.
We compare our method to recently proposed competing approaches on various numerical experiments and demonstrate its superiority in terms of numerical stability and computational cost.
arXiv Detail & Related papers (2024-04-11T09:23:36Z) - DynGMA: a robust approach for learning stochastic differential equations from data [13.858051019755283]
We introduce novel approximations to the transition density of the parameterized SDE.
Our method exhibits superior accuracy compared to baseline methods in learning the fully unknown drift diffusion functions.
It is capable of handling data with low time resolution and variable, even uncontrollable, time step sizes.
arXiv Detail & Related papers (2024-02-22T12:09:52Z) - Synthetic location trajectory generation using categorical diffusion
models [50.809683239937584]
Diffusion models (DPMs) have rapidly evolved to be one of the predominant generative models for the simulation of synthetic data.
We propose using DPMs for the generation of synthetic individual location trajectories (ILTs) which are sequences of variables representing physical locations visited by individuals.
arXiv Detail & Related papers (2024-02-19T15:57:39Z) - Towards Theoretical Understandings of Self-Consuming Generative Models [56.84592466204185]
This paper tackles the emerging challenge of training generative models within a self-consuming loop.
We construct a theoretical framework to rigorously evaluate how this training procedure impacts the data distributions learned by future models.
We present results for kernel density estimation, delivering nuanced insights such as the impact of mixed data training on error propagation.
arXiv Detail & Related papers (2024-02-19T02:08:09Z) - A Geometric Perspective on Diffusion Models [57.27857591493788]
We inspect the ODE-based sampling of a popular variance-exploding SDE.
We establish a theoretical relationship between the optimal ODE-based sampling and the classic mean-shift (mode-seeking) algorithm.
arXiv Detail & Related papers (2023-05-31T15:33:16Z) - Benign Overfitting in Time Series Linear Model with
Over-Parameterization [5.68558935178946]
We develop a theory for excess risk of the estimator under multiple dependence types.
We show that the convergence rate of risks with short-memory processes is identical to that of cases with independent data.
arXiv Detail & Related papers (2022-04-18T15:26:58Z) - Mixed Effects Neural ODE: A Variational Approximation for Analyzing the
Dynamics of Panel Data [50.23363975709122]
We propose a probabilistic model called ME-NODE to incorporate (fixed + random) mixed effects for analyzing panel data.
We show that our model can be derived using smooth approximations of SDEs provided by the Wong-Zakai theorem.
We then derive Evidence Based Lower Bounds for ME-NODE, and develop (efficient) training algorithms.
arXiv Detail & Related papers (2022-02-18T22:41:51Z) - Coarse-grained and emergent distributed parameter systems from data [0.6117371161379209]
We derivation of PDEs from computation system data.
In particular, we focus here on the use of manifold learning techniques.
We demonstrate each approach through an established PDE example.
arXiv Detail & Related papers (2020-11-16T18:02:01Z) - Data-Space Inversion Using a Recurrent Autoencoder for Time-Series
Parameterization [0.0]
We develop and evaluate a new approach for data parameterization in data-space inversion (DSI)
The new parameterization uses a recurrent autoencoder (RAE) for dimension reduction, and a long-term memory (LSTM) network to represent flow-rate time series.
The RAE-based parameterization is clearly useful in DSI, and it may also find application in other subsurface flow problems.
arXiv Detail & Related papers (2020-04-30T19:17:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.