Regression modelling of spatiotemporal extreme U.S. wildfires via
partially-interpretable neural networks
- URL: http://arxiv.org/abs/2208.07581v4
- Date: Thu, 7 Mar 2024 17:23:43 GMT
- Title: Regression modelling of spatiotemporal extreme U.S. wildfires via
partially-interpretable neural networks
- Authors: Jordan Richards and Rapha\"el Huser
- Abstract summary: We propose a new methodological framework for performing extreme quantile regression using artificial neutral networks.
We unify linear, and additive, regression methodology with deep learning to create partially-interpretable neural networks.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Risk management in many environmental settings requires an understanding of
the mechanisms that drive extreme events. Useful metrics for quantifying such
risk are extreme quantiles of response variables conditioned on predictor
variables that describe, e.g., climate, biosphere and environmental states.
Typically these quantiles lie outside the range of observable data and so, for
estimation, require specification of parametric extreme value models within a
regression framework. Classical approaches in this context utilise linear or
additive relationships between predictor and response variables and suffer in
either their predictive capabilities or computational efficiency; moreover,
their simplicity is unlikely to capture the truly complex structures that lead
to the creation of extreme wildfires. In this paper, we propose a new
methodological framework for performing extreme quantile regression using
artificial neutral networks, which are able to capture complex non-linear
relationships and scale well to high-dimensional data. The "black box" nature
of neural networks means that they lack the desirable trait of interpretability
often favoured by practitioners; thus, we unify linear, and additive,
regression methodology with deep learning to create partially-interpretable
neural networks that can be used for statistical inference but retain high
prediction accuracy. To complement this methodology, we further propose a novel
point process model for extreme values which overcomes the finite
lower-endpoint problem associated with the generalised extreme value class of
distributions. Efficacy of our unified framework is illustrated on U.S.
wildfire data with a high-dimensional predictor set and we illustrate vast
improvements in predictive performance over linear and spline-based regression
techniques.
Related papers
- Variational Bayesian Bow tie Neural Networks with Shrinkage [0.276240219662896]
We build a relaxed version of the standard feed-forward rectified neural network.
We employ Polya-Gamma data augmentation tricks to render a conditionally linear and Gaussian model.
We derive a variational inference algorithm that avoids distributional assumptions and independence across layers.
arXiv Detail & Related papers (2024-11-17T17:36:30Z) - Structured Radial Basis Function Network: Modelling Diversity for
Multiple Hypotheses Prediction [51.82628081279621]
Multi-modal regression is important in forecasting nonstationary processes or with a complex mixture of distributions.
A Structured Radial Basis Function Network is presented as an ensemble of multiple hypotheses predictors for regression problems.
It is proved that this structured model can efficiently interpolate this tessellation and approximate the multiple hypotheses target distribution.
arXiv Detail & Related papers (2023-09-02T01:27:53Z) - Advancing Counterfactual Inference through Nonlinear Quantile Regression [77.28323341329461]
We propose a framework for efficient and effective counterfactual inference implemented with neural networks.
The proposed approach enhances the capacity to generalize estimated counterfactual outcomes to unseen data.
Empirical results conducted on multiple datasets offer compelling support for our theoretical assertions.
arXiv Detail & Related papers (2023-06-09T08:30:51Z) - Adaptive Conditional Quantile Neural Processes [9.066817971329899]
Conditional Quantile Neural Processes (CQNPs) are a new member of the neural processes family.
We introduce an extension of quantile regression where the model learns to focus on estimating informative quantiles.
Experiments with real and synthetic datasets demonstrate substantial improvements in predictive performance.
arXiv Detail & Related papers (2023-05-30T06:19:19Z) - Neural Networks for Extreme Quantile Regression with an Application to Forecasting of Flood Risk [0.0]
We propose the EQRN model that combines tools from neural networks and extreme value theory.
We apply this method to forecast flood risk in the Swiss Aare catchment.
arXiv Detail & Related papers (2022-08-16T08:02:49Z) - DeepBayes -- an estimator for parameter estimation in stochastic
nonlinear dynamical models [11.917949887615567]
We propose DeepBayes estimators that leverage the power of deep recurrent neural networks in learning an estimator.
The deep recurrent neural network architectures can be trained offline and ensure significant time savings during inference.
We demonstrate the applicability of our proposed method on different example models and perform detailed comparisons with state-of-the-art approaches.
arXiv Detail & Related papers (2022-05-04T18:12:17Z) - Probabilistic AutoRegressive Neural Networks for Accurate Long-range
Forecasting [6.295157260756792]
We introduce the Probabilistic AutoRegressive Neural Networks (PARNN)
PARNN is capable of handling complex time series data exhibiting non-stationarity, nonlinearity, non-seasonality, long-range dependence, and chaotic patterns.
We evaluate the performance of PARNN against standard statistical, machine learning, and deep learning models, including Transformers, NBeats, and DeepAR.
arXiv Detail & Related papers (2022-04-01T17:57:36Z) - The Interplay Between Implicit Bias and Benign Overfitting in Two-Layer
Linear Networks [51.1848572349154]
neural network models that perfectly fit noisy data can generalize well to unseen test data.
We consider interpolating two-layer linear neural networks trained with gradient flow on the squared loss and derive bounds on the excess risk.
arXiv Detail & Related papers (2021-08-25T22:01:01Z) - Interpretable Social Anchors for Human Trajectory Forecasting in Crowds [84.20437268671733]
We propose a neural network-based system to predict human trajectory in crowds.
We learn interpretable rule-based intents, and then utilise the expressibility of neural networks to model scene-specific residual.
Our architecture is tested on the interaction-centric benchmark TrajNet++.
arXiv Detail & Related papers (2021-05-07T09:22:34Z) - Firearm Detection via Convolutional Neural Networks: Comparing a
Semantic Segmentation Model Against End-to-End Solutions [68.8204255655161]
Threat detection of weapons and aggressive behavior from live video can be used for rapid detection and prevention of potentially deadly incidents.
One way for achieving this is through the use of artificial intelligence and, in particular, machine learning for image analysis.
We compare a traditional monolithic end-to-end deep learning model and a previously proposed model based on an ensemble of simpler neural networks detecting fire-weapons via semantic segmentation.
arXiv Detail & Related papers (2020-12-17T15:19:29Z) - Unlabelled Data Improves Bayesian Uncertainty Calibration under
Covariate Shift [100.52588638477862]
We develop an approximate Bayesian inference scheme based on posterior regularisation.
We demonstrate the utility of our method in the context of transferring prognostic models of prostate cancer across globally diverse populations.
arXiv Detail & Related papers (2020-06-26T13:50:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.