Counterfactual Outcome Prediction using Structured State Space Model
- URL: http://arxiv.org/abs/2305.09207v1
- Date: Tue, 16 May 2023 06:32:43 GMT
- Title: Counterfactual Outcome Prediction using Structured State Space Model
- Authors: Vishal Purohit
- Abstract summary: We compare the performance of two models: Treatment Effect Neural Controlled Differential Equation (TE-CDE) and structured state space model (S4Model)
S4Model is more efficient at modeling long-range dependencies and easier to train.
Our results suggest that the state space model may be a promising approach for counterfactual outcome prediction in longitudinal data.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Counterfactual outcome prediction in longitudinal data has recently gained
attention due to its potential applications in healthcare and social sciences.
In this paper, we explore the use of the state space model, a popular sequence
model, for this task. Specifically, we compare the performance of two models:
Treatment Effect Neural Controlled Differential Equation (TE-CDE) and
structured state space model (S4Model). While TE-CDE uses controlled
differential equations to address time-dependent confounding, it suffers from
optimization issues and slow training. In contrast, S4Model is more efficient
at modeling long-range dependencies and easier to train. We evaluate the models
on a simulated lung tumor growth dataset and find that S4Model outperforms
TE-CDE with 1.63x reduction in per epoch training time and 10x better
normalized mean squared error. Additionally, S4Model is more stable during
training and less sensitive to weight initialization than TE-CDE. Our results
suggest that the state space model may be a promising approach for
counterfactual outcome prediction in longitudinal data, with S4Model offering a
more efficient and effective alternative to TE-CDE.
Related papers
- Trajectory Flow Matching with Applications to Clinical Time Series Modeling [77.58277281319253]
Trajectory Flow Matching (TFM) trains a Neural SDE in a simulation-free manner, bypassing backpropagation through the dynamics.
We demonstrate improved performance on three clinical time series datasets in terms of absolute performance and uncertainty prediction.
arXiv Detail & Related papers (2024-10-28T15:54:50Z) - On conditional diffusion models for PDE simulations [53.01911265639582]
We study score-based diffusion models for forecasting and assimilation of sparse observations.
We propose an autoregressive sampling approach that significantly improves performance in forecasting.
We also propose a new training strategy for conditional score-based models that achieves stable performance over a range of history lengths.
arXiv Detail & Related papers (2024-10-21T18:31:04Z) - DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training [87.90342423839876]
We present a new auto-regressive denoising pre-training strategy, which allows for more stable and efficient pre-training on PDE data.
We train our PDE foundation model with up to 0.5B parameters on 10+ PDE datasets with more than 100k trajectories.
arXiv Detail & Related papers (2024-03-06T08:38:34Z) - Model Compression Method for S4 with Diagonal State Space Layers using Balanced Truncation [0.0]
We propose to use the balanced truncation, a prevalent model reduction technique in control theory, applied specifically to DSS layers in pre-trained S4 model as a novel model compression method.
Numerical experiments demonstrate that our trained models combined with the balanced truncation surpass conventionally trained models with Skew-HiPPO.
arXiv Detail & Related papers (2024-02-25T05:22:45Z) - Learning Space-Time Continuous Neural PDEs from Partially Observed
States [13.01244901400942]
We introduce a grid-independent model learning partial differential equations (PDEs) from noisy and partial observations on irregular grids.
We propose a space-time continuous latent neural PDE model with an efficient probabilistic framework and a novel design encoder for improved data efficiency and grid independence.
arXiv Detail & Related papers (2023-07-09T06:53:59Z) - Delay-SDE-net: A deep learning approach for time series modelling with
memory and uncertainty estimates [0.0]
This paper presents the Delay-SDE-net, a neural network model based on delay differential equations (SDDEs)
The use of SDDEs with multiple delays as modelling makes it a suitable model for time series with memory effects.
We derive the theoretical error of the Delay-SDE-net and analyze the convergence rate numerically.
arXiv Detail & Related papers (2023-03-14T14:31:38Z) - Deep Latent State Space Models for Time-Series Generation [68.45746489575032]
We propose LS4, a generative model for sequences with latent variables evolving according to a state space ODE.
Inspired by recent deep state space models (S4), we achieve speedups by leveraging a convolutional representation of LS4.
We show that LS4 significantly outperforms previous continuous-time generative models in terms of marginal distribution, classification, and prediction scores on real-world datasets.
arXiv Detail & Related papers (2022-12-24T15:17:42Z) - STAR: Sparse Transformer-based Action Recognition [61.490243467748314]
This work proposes a novel skeleton-based human action recognition model with sparse attention on the spatial dimension and segmented linear attention on the temporal dimension of data.
Experiments show that our model can achieve comparable performance while utilizing much less trainable parameters and achieve high speed in training and inference.
arXiv Detail & Related papers (2021-07-15T02:53:11Z) - Closed-form Continuous-Depth Models [99.40335716948101]
Continuous-depth neural models rely on advanced numerical differential equation solvers.
We present a new family of models, termed Closed-form Continuous-depth (CfC) networks, that are simple to describe and at least one order of magnitude faster.
arXiv Detail & Related papers (2021-06-25T22:08:51Z) - GraphTCN: Spatio-Temporal Interaction Modeling for Human Trajectory
Prediction [5.346782918364054]
We propose a novel CNN-based spatial-temporal graph framework GraphCNT to support more efficient and accurate trajectory predictions.
In contrast to conventional models, both the spatial and temporal modeling of our model are computed within each local time window.
Our model achieves better performance in terms of both efficiency and accuracy as compared with state-of-the-art models on various trajectory prediction benchmark datasets.
arXiv Detail & Related papers (2020-03-16T12:56:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.