Mixed Effects Neural ODE: A Variational Approximation for Analyzing the
Dynamics of Panel Data
- URL: http://arxiv.org/abs/2202.09463v1
- Date: Fri, 18 Feb 2022 22:41:51 GMT
- Title: Mixed Effects Neural ODE: A Variational Approximation for Analyzing the
Dynamics of Panel Data
- Authors: Jurijs Nazarovs, Rudrasis Chakraborty, Songwong Tasneeyapant, Sathya
N. Ravi, Vikas Singh
- Abstract summary: We propose a probabilistic model called ME-NODE to incorporate (fixed + random) mixed effects for analyzing panel data.
We show that our model can be derived using smooth approximations of SDEs provided by the Wong-Zakai theorem.
We then derive Evidence Based Lower Bounds for ME-NODE, and develop (efficient) training algorithms.
- Score: 50.23363975709122
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Panel data involving longitudinal measurements of the same set of
participants taken over multiple time points is common in studies to understand
childhood development and disease modeling. Deep hybrid models that marry the
predictive power of neural networks with physical simulators such as
differential equations, are starting to drive advances in such applications.
The task of modeling not just the observations but the hidden dynamics that are
captured by the measurements poses interesting statistical/computational
questions. We propose a probabilistic model called ME-NODE to incorporate
(fixed + random) mixed effects for analyzing such panel data. We show that our
model can be derived using smooth approximations of SDEs provided by the
Wong-Zakai theorem. We then derive Evidence Based Lower Bounds for ME-NODE, and
develop (efficient) training algorithms using MC based sampling methods and
numerical ODE solvers. We demonstrate ME-NODE's utility on tasks spanning the
spectrum from simulations and toy data to real longitudinal 3D imaging data
from an Alzheimer's disease (AD) study, and study its performance in terms of
accuracy of reconstruction for interpolation, uncertainty estimates and
personalized prediction.
Related papers
- Trajectory Flow Matching with Applications to Clinical Time Series Modeling [77.58277281319253]
Trajectory Flow Matching (TFM) trains a Neural SDE in a simulation-free manner, bypassing backpropagation through the dynamics.
We demonstrate improved performance on three clinical time series datasets in terms of absolute performance and uncertainty prediction.
arXiv Detail & Related papers (2024-10-28T15:54:50Z) - Diffusion posterior sampling for simulation-based inference in tall data settings [53.17563688225137]
Simulation-based inference ( SBI) is capable of approximating the posterior distribution that relates input parameters to a given observation.
In this work, we consider a tall data extension in which multiple observations are available to better infer the parameters of the model.
We compare our method to recently proposed competing approaches on various numerical experiments and demonstrate its superiority in terms of numerical stability and computational cost.
arXiv Detail & Related papers (2024-04-11T09:23:36Z) - Synthetic location trajectory generation using categorical diffusion
models [50.809683239937584]
Diffusion models (DPMs) have rapidly evolved to be one of the predominant generative models for the simulation of synthetic data.
We propose using DPMs for the generation of synthetic individual location trajectories (ILTs) which are sequences of variables representing physical locations visited by individuals.
arXiv Detail & Related papers (2024-02-19T15:57:39Z) - Towards Theoretical Understandings of Self-Consuming Generative Models [56.84592466204185]
This paper tackles the emerging challenge of training generative models within a self-consuming loop.
We construct a theoretical framework to rigorously evaluate how this training procedure impacts the data distributions learned by future models.
We present results for kernel density estimation, delivering nuanced insights such as the impact of mixed data training on error propagation.
arXiv Detail & Related papers (2024-02-19T02:08:09Z) - Predicting Multi-Joint Kinematics of the Upper Limb from EMG Signals
Across Varied Loads with a Physics-Informed Neural Network [0.0]
The PINN model is constructed by combining a feed-forward Artificial Neural Network (ANN) with a joint torque model.
The training dataset for the PINN model comprises EMG and time data collected from four different subjects.
The results demonstrated strong correlations of 58% to 83% in joint angle prediction.
arXiv Detail & Related papers (2023-11-28T16:55:11Z) - Capturing dynamical correlations using implicit neural representations [85.66456606776552]
We develop an artificial intelligence framework which combines a neural network trained to mimic simulated data from a model Hamiltonian with automatic differentiation to recover unknown parameters from experimental data.
In doing so, we illustrate the ability to build and train a differentiable model only once, which then can be applied in real-time to multi-dimensional scattering data.
arXiv Detail & Related papers (2023-04-08T07:55:36Z) - Gradient-based Bayesian Experimental Design for Implicit Models using
Mutual Information Lower Bounds [20.393359858407162]
We introduce a framework for Bayesian experimental design (BED) with implicit models, where the data-generating distribution is intractable but sampling from it is still possible.
In order to find optimal experimental designs for such models, our approach maximises mutual information lower bounds that are parametrised by neural networks.
By training a neural network on sampled data, we simultaneously update network parameters and designs using gradient-ascent.
arXiv Detail & Related papers (2021-05-10T13:59:25Z) - Data-driven Identification of 2D Partial Differential Equations using
extracted physical features [0.0]
We propose an ML method to discover the terms involved in the equation from two-dimensional data.
This idea provides us with the ability to discover 2D equations with time derivatives of different orders, and also to identify new underlying physics on which the model has not been trained.
The results indicate robustness of the features extracted based on prior knowledge in comparison to automatically detected features by a Three-dimensional Convolutional Neural Network (3D CNN) given the same amounts of data.
arXiv Detail & Related papers (2020-10-20T21:06:50Z) - DeepCOVIDNet: An Interpretable Deep Learning Model for Predictive
Surveillance of COVID-19 Using Heterogeneous Features and their Interactions [2.30238915794052]
We propose a deep learning model to forecast the range of increase in COVID-19 infected cases in future days.
Using data collected from various sources, we estimate the range of increase in infected cases seven days into the future for all U.S. counties.
arXiv Detail & Related papers (2020-07-31T23:37:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.