Feature Importance in a Deep Learning Climate Emulator
- URL: http://arxiv.org/abs/2108.13203v1
- Date: Fri, 27 Aug 2021 13:36:42 GMT
- Title: Feature Importance in a Deep Learning Climate Emulator
- Authors: Wei Xu, Xihaier Luo, Yihui Ren, Ji Hwan Park, Shinjae Yoo,
Balasubramanya T. Nadiga
- Abstract summary: We present a study using a class of post-hoc local explanation methods i.e., feature importance methods for "understanding" a deep learning (DL) emulator of climate.
We consider a multiple-input-output variations emulator that uses a encoderNet-decoder architecture and is trained to predict interannual variations of sea surface temperature (SST) at 1, 6, and 9 month lead times.
- Score: 10.48891954541828
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present a study using a class of post-hoc local explanation methods i.e.,
feature importance methods for "understanding" a deep learning (DL) emulator of
climate. Specifically, we consider a multiple-input-single-output emulator that
uses a DenseNet encoder-decoder architecture and is trained to predict
interannual variations of sea surface temperature (SST) at 1, 6, and 9 month
lead times using the preceding 36 months of (appropriately filtered) SST data.
First, feature importance methods are employed for individual predictions to
spatio-temporally identify input features that are important for model
prediction at chosen geographical regions and chosen prediction lead times. In
a second step, we also examine the behavior of feature importance in a
generalized sense by considering an aggregation of the importance heatmaps over
training samples. We find that: 1) the climate emulator's prediction at any
geographical location depends dominantly on a small neighborhood around it; 2)
the longer the prediction lead time, the further back the "importance" extends;
and 3) to leading order, the temporal decay of "importance" is independent of
geographical location. An ablation experiment is adopted to verify the
findings. From the perspective of climate dynamics, these findings suggest a
dominant role for local processes and a negligible role for remote
teleconnections at the spatial and temporal scales we consider. From the
perspective of network architecture, the spatio-temporal relations between the
inputs and outputs we find suggest potential model refinements. We discuss
further extensions of our methods, some of which we are considering in ongoing
work.
Related papers
- Causal Representation Learning in Temporal Data via Single-Parent Decoding [66.34294989334728]
Scientific research often seeks to understand the causal structure underlying high-level variables in a system.
Scientists typically collect low-level measurements, such as geographically distributed temperature readings.
We propose a differentiable method, Causal Discovery with Single-parent Decoding, that simultaneously learns the underlying latents and a causal graph over them.
arXiv Detail & Related papers (2024-10-09T15:57:50Z) - Generating Fine-Grained Causality in Climate Time Series Data for Forecasting and Anomaly Detection [67.40407388422514]
We design a conceptual fine-grained causal model named TBN Granger Causality.
Second, we propose an end-to-end deep generative model called TacSas, which discovers TBN Granger Causality in a generative manner.
We test TacSas on climate benchmark ERA5 for climate forecasting and the extreme weather benchmark of NOAA for extreme weather alerts.
arXiv Detail & Related papers (2024-08-08T06:47:21Z) - Predicting unobserved climate time series data at distant areas via spatial correlation using reservoir computing [5.008022566317949]
This study focuses on a prediction of climatic elements, specifically near-surface temperature and pressure, at a target location apart from a data observation point.
Our results show that the accuracy of the predictions degrades with the distance between the observation and target locations.
arXiv Detail & Related papers (2024-06-05T08:39:10Z) - Spatial-temporal Forecasting for Regions without Observations [13.805203053973772]
We study spatial-temporal forecasting for a region of interest without any historical observations.
We propose a model named STSM for the task.
Our key insight is to learn from the locations that resemble those in the region of interest.
arXiv Detail & Related papers (2024-01-19T06:26:05Z) - Parallel Reasoning Network for Human-Object Interaction Detection [53.422076419484945]
We propose a new transformer-based method named Parallel Reasoning Network(PR-Net)
PR-Net constructs two independent predictors for instance-level localization and relation-level understanding.
Our PR-Net has achieved competitive results on HICO-DET and V-COCO benchmarks.
arXiv Detail & Related papers (2023-01-09T17:00:34Z) - TempSAL -- Uncovering Temporal Information for Deep Saliency Prediction [64.63645677568384]
We introduce a novel saliency prediction model that learns to output saliency maps in sequential time intervals.
Our approach locally modulates the saliency predictions by combining the learned temporal maps.
Our code will be publicly available on GitHub.
arXiv Detail & Related papers (2023-01-05T22:10:16Z) - Joint Forecasting of Panoptic Segmentations with Difference Attention [72.03470153917189]
We study a new panoptic segmentation forecasting model that jointly forecasts all object instances in a scene.
We evaluate the proposed model on the Cityscapes and AIODrive datasets.
arXiv Detail & Related papers (2022-04-14T17:59:32Z) - Building Autocorrelation-Aware Representations for Fine-Scale
Spatiotemporal Prediction [1.2862507359003323]
We present a novel deep learning architecture that incorporates theories of spatial statistics into neural networks.
DeepLATTE contains an autocorrelation-guided semi-supervised learning strategy to enforce both local autocorrelation patterns and global autocorrelation trends.
We conduct a demonstration of DeepLATTE using publicly available data for an important public health topic, air quality prediction in a well-fitting, complex physical environment.
arXiv Detail & Related papers (2021-12-10T03:21:19Z) - Latent State Inference in a Spatiotemporal Generative Model [3.7525506486107267]
We focus on temperature and weather processes, including wave propagation dynamics, for which we assume that universal causes apply throughout space and time.
A recently introduced DIsed Stemporal graph graph artificial Neural Architecture (DISTANA) is used and enhanced to learn such processes.
We show that DISTANA, when combined with a retrospective latent state inference principle called active tuning, can reliably derive location-respective hidden causal factors.
arXiv Detail & Related papers (2020-09-21T12:59:40Z) - Evaluation of Local Explanation Methods for Multivariate Time Series
Forecasting [0.21094707683348418]
Local interpretability is important in determining why a model makes particular predictions.
Despite the recent focus on AI interpretability, there has been a lack of research in local interpretability methods for time series forecasting.
arXiv Detail & Related papers (2020-09-18T21:15:28Z) - Value-driven Hindsight Modelling [68.658900923595]
Value estimation is a critical component of the reinforcement learning (RL) paradigm.
Model learning can make use of the rich transition structure present in sequences of observations, but this approach is usually not sensitive to the reward function.
We develop an approach for representation learning in RL that sits in between these two extremes.
This provides tractable prediction targets that are directly relevant for a task, and can thus accelerate learning the value function.
arXiv Detail & Related papers (2020-02-19T18:10:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.