Interpretable Water Level Forecaster with Spatiotemporal Causal
Attention Mechanisms
- URL: http://arxiv.org/abs/2303.00515v6
- Date: Thu, 29 Jun 2023 06:00:27 GMT
- Title: Interpretable Water Level Forecaster with Spatiotemporal Causal
Attention Mechanisms
- Authors: Sunghcul Hong, Yunjin Choi and Jong-June Jeon
- Abstract summary: This work proposes a neuraltemporal model with a transformer exploiting a causal relationship based on prior knowledge.
We use the Han River dataset from 2016 to compare 2021, and confirm that our model provides an interpretable and consistent model with prior knowledge.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Forecasting the water level of the Han River is essential to control traffic
and avoid natural disasters. The stream flow of the Han River is affected by
various and intricately connected factors. Thus, a simple forecasting machine
frequently fails to capture its serial pattern. On the other hand, a complex
predictive model loses the interpretability of the model output. This work
proposes a neural network model with a novel transformer exploiting a causal
relationship based on prior knowledge. The transformer consists of
spatiotemporal attention weight that describes the spatial and temporal
causation with multilayer networks with masking. Our model has two
distinguished advantages against the existing spatiotemporal forecasting
models. First, the model allows the heterogeneous predictors for each site such
that a flexible regression is applicable to the causal network. Next, the model
is adapted to partially identified causal structures. As a result, we have
relaxed the constraints of the applicable causal network through our model. In
real data analysis, we use the Han River dataset from 2016 to 2021, compare the
proposed model with deep learning models, and confirm that our model provides
an interpretable and consistent model with prior knowledge, such as a
seasonality arising from the tidal force. Furthermore, in prediction
performance, our model is better than or competitive with the state-of-the-art
models.
Related papers
- Power Failure Cascade Prediction using Graph Neural Networks [4.667031410586657]
We propose a flow-free model that predicts grid states at every generation of a cascade process given an initial contingency and power injection values.
We show that the proposed model reduces the computational time by almost two orders of magnitude.
arXiv Detail & Related papers (2024-04-24T18:45:50Z) - Graph neural network-based surrogate modelling for real-time hydraulic prediction of urban drainage networks [1.8073031015436376]
Physics-based models are computationally time-consuming and infeasible for real-time scenarios of urban drainage networks.
Fully-connected neural networks (NNs) are potential surrogate models, but may suffer from low interpretability and efficiency in fitting complex targets.
This work proposes a GNN-based surrogate of the flow routing model for the hydraulic prediction problem of drainage networks.
arXiv Detail & Related papers (2024-04-16T07:08:04Z) - Improving Adversarial Transferability via Model Alignment [25.43899674478279]
We introduce a novel model alignment technique aimed at improving a given source model's ability in generating transferable adversarial perturbations.
Experiments on the ImageNet dataset, using a variety of model architectures, demonstrate that perturbations generated from aligned source models exhibit significantly higher transferability.
arXiv Detail & Related papers (2023-11-30T12:15:49Z) - Mining Causality from Continuous-time Dynamics Models: An Application to
Tsunami Forecasting [22.434845478979604]
We propose a mechanism for mining causal structures from continuous-time models.
We train models to capture the causal structure by enforcing sparsity in the weights of the input layers of the dynamics models.
We apply our method to a real-world problem, namely tsunami forecasting, where the exact causal-structures are difficult to characterize.
arXiv Detail & Related papers (2022-10-10T18:53:13Z) - On the Generalization and Adaption Performance of Causal Models [99.64022680811281]
Differentiable causal discovery has proposed to factorize the data generating process into a set of modules.
We study the generalization and adaption performance of such modular neural causal models.
Our analysis shows that the modular neural causal models outperform other models on both zero and few-shot adaptation in low data regimes.
arXiv Detail & Related papers (2022-06-09T17:12:32Z) - An advanced spatio-temporal convolutional recurrent neural network for
storm surge predictions [73.4962254843935]
We study the capability of artificial neural network models to emulate storm surge based on the storm track/size/intensity history.
This study presents a neural network model that can predict storm surge, informed by a database of synthetic storm simulations.
arXiv Detail & Related papers (2022-04-18T23:42:18Z) - Churn Reduction via Distillation [54.5952282395487]
We show an equivalence between training with distillation using the base model as the teacher and training with an explicit constraint on the predictive churn.
We then show that distillation performs strongly for low churn training against a number of recent baselines.
arXiv Detail & Related papers (2021-06-04T18:03:31Z) - Anomaly Detection of Time Series with Smoothness-Inducing Sequential
Variational Auto-Encoder [59.69303945834122]
We present a Smoothness-Inducing Sequential Variational Auto-Encoder (SISVAE) model for robust estimation and anomaly detection of time series.
Our model parameterizes mean and variance for each time-stamp with flexible neural networks.
We show the effectiveness of our model on both synthetic datasets and public real-world benchmarks.
arXiv Detail & Related papers (2021-02-02T06:15:15Z) - Firearm Detection via Convolutional Neural Networks: Comparing a
Semantic Segmentation Model Against End-to-End Solutions [68.8204255655161]
Threat detection of weapons and aggressive behavior from live video can be used for rapid detection and prevention of potentially deadly incidents.
One way for achieving this is through the use of artificial intelligence and, in particular, machine learning for image analysis.
We compare a traditional monolithic end-to-end deep learning model and a previously proposed model based on an ensemble of simpler neural networks detecting fire-weapons via semantic segmentation.
arXiv Detail & Related papers (2020-12-17T15:19:29Z) - Generative Temporal Difference Learning for Infinite-Horizon Prediction [101.59882753763888]
We introduce the $gamma$-model, a predictive model of environment dynamics with an infinite probabilistic horizon.
We discuss how its training reflects an inescapable tradeoff between training-time and testing-time compounding errors.
arXiv Detail & Related papers (2020-10-27T17:54:12Z) - Causal Inference with Deep Causal Graphs [0.0]
Parametric causal modelling techniques rarely provide functionality for counterfactual estimation.
Deep Causal Graphs is an abstract specification of the required functionality for a neural network to model causal distributions.
We demonstrate its expressive power in modelling complex interactions and showcase applications to machine learning explainability and fairness.
arXiv Detail & Related papers (2020-06-15T13:03:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.