Weather2vec: Representation Learning for Causal Inference with Non-Local
Confounding in Air Pollution and Climate Studies
- URL: http://arxiv.org/abs/2209.12316v1
- Date: Sun, 25 Sep 2022 20:40:19 GMT
- Title: Weather2vec: Representation Learning for Causal Inference with Non-Local
Confounding in Air Pollution and Climate Studies
- Authors: Mauricio Tec, James Scott, Corwin Zigler
- Abstract summary: Estimating the causal effects of a spatially-varying intervention may be subject to non-local confounding (NLC)
This paper first formalizes NLC using the potential outcomes framework, providing a comparison with the related phenomenon of causal interference.
Then, it proposes a broadly applicable framework, termed "weather2vec", that uses the theory of balancing scores to learn representations of non-local information.
- Score: 3.0616624345970975
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Estimating the causal effects of a spatially-varying intervention on a
spatially-varying outcome may be subject to non-local confounding (NLC), a
phenomenon that can bias estimates when the treatments and outcomes of a given
unit are dictated in part by the covariates of other nearby units. In
particular, NLC is a challenge for evaluating the effects of environmental
policies and climate events on health-related outcomes such as air pollution
exposure. This paper first formalizes NLC using the potential outcomes
framework, providing a comparison with the related phenomenon of causal
interference. Then, it proposes a broadly applicable framework, termed
"weather2vec", that uses the theory of balancing scores to learn
representations of non-local information into a scalar or vector defined for
each observational unit, which is subsequently used to adjust for confounding
in conjunction with causal inference methods. The framework is evaluated in a
simulation study and two case studies on air pollution where the weather is an
(inherently regional) known confounder.
Related papers
- Spatio-temporal Multivariate Cluster Evolution Analysis for Detecting and Tracking Climate Impacts [0.0]
This paper presents a novel and efficient unsupervised data-driven approach for detecting statistically-significant impacts.
We demonstrate that the proposed approach is capable of detecting known post-eruption impacts/events.
We additionally describe a methodology for extracting meaningful sequences of post-eruption impacts/events by using NLP.
arXiv Detail & Related papers (2024-10-21T22:13:09Z) - Causal Representation Learning in Temporal Data via Single-Parent Decoding [66.34294989334728]
Scientific research often seeks to understand the causal structure underlying high-level variables in a system.
Scientists typically collect low-level measurements, such as geographically distributed temperature readings.
We propose a differentiable method, Causal Discovery with Single-parent Decoding, that simultaneously learns the underlying latents and a causal graph over them.
arXiv Detail & Related papers (2024-10-09T15:57:50Z) - Variable importance measure for spatial machine learning models with application to air pollution exposure prediction [2.633085745593072]
The objective is to predict air pollution exposures for study subjects at locations without data in order to optimize our ability to learn about health effects of air pollution.
We tackle these challenges in two datasets: sulfur (S) from regulatory United States national PM2.5 sub-species data and ultrafine particles (UFP) from a new Seattle-area traffic-related air pollution dataset.
Our key contribution is a leave-one-out approach for variable importance that leads to interpretable and comparable measures for a broad class of models.
arXiv Detail & Related papers (2024-06-04T05:51:36Z) - Smoke and Mirrors in Causal Downstream Tasks [59.90654397037007]
This paper looks at the causal inference task of treatment effect estimation, where the outcome of interest is recorded in high-dimensional observations.
We compare 6 480 models fine-tuned from state-of-the-art visual backbones, and find that the sampling and modeling choices significantly affect the accuracy of the causal estimate.
Our results suggest that future benchmarks should carefully consider real downstream scientific questions, especially causal ones.
arXiv Detail & Related papers (2024-05-27T13:26:34Z) - Estimating Direct and Indirect Causal Effects of Spatiotemporal Interventions in Presence of Spatial Interference [0.46052594866569146]
We first extend the concept of spatial interference in case of time-varying treatment outcomes by extending the potential outcome framework under the assumption of no unmeasured confounding.
We then propose our deep learning based potential outcome model fortemporal causal inference.
We utilize latent factor modeling to reduce interference due to time-varying confounding while leveraging the power of U-Net architecture to capture global spatial interference in data over time.
arXiv Detail & Related papers (2024-05-13T20:39:27Z) - Robust detection and attribution of climate change under interventions [4.344839102717429]
Fingerprints are key tools in climate change detection and attribution (D&A)
We propose a direct D&A approach based on supervised learning to extract fingerprints that lead to robust predictions.
Our study shows that incorporating robustness constraints against relevant interventions may significantly benefit detection and attribution of climate change.
arXiv Detail & Related papers (2022-12-09T15:13:40Z) - Neighborhood Adaptive Estimators for Causal Inference under Network
Interference [152.4519491244279]
We consider the violation of the classical no-interference assumption, meaning that the treatment of one individuals might affect the outcomes of another.
To make interference tractable, we consider a known network that describes how interference may travel.
We study estimators for the average direct treatment effect on the treated in such a setting.
arXiv Detail & Related papers (2022-12-07T14:53:47Z) - Multi-scale Digital Twin: Developing a fast and physics-informed
surrogate model for groundwater contamination with uncertain climate models [53.44486283038738]
Climate change exacerbates the long-term soil management problem of groundwater contamination.
We develop a physics-informed machine learning surrogate model using U-Net enhanced Fourier Neural Contaminated (PDENO)
In parallel, we develop a convolutional autoencoder combined with climate data to reduce the dimensionality of climatic region similarities across the United States.
arXiv Detail & Related papers (2022-11-20T06:46:35Z) - Spatiotemporal modeling of European paleoclimate using doubly sparse
Gaussian processes [61.31361524229248]
We build on recent scale sparsetemporal GPs to reduce the computational burden.
We successfully employ such a doubly sparse GP to construct a probabilistic model of paleoclimate.
arXiv Detail & Related papers (2022-11-15T14:15:04Z) - Using an expert deviation carrying the knowledge of climate data in
usual clustering algorithms [0.0]
We identify an algorithm-temporal using clustering analysis on wind speed and cumulative rainfall datasets.
We show that using the L2 norm in conventional clustering methods can induce undesirable effects.
We propose to replace Euclidean distanceL (2) by a dissimilarity measure named Expert Cluster Deviation (ED)
arXiv Detail & Related papers (2020-06-10T01:42:40Z) - A Survey on Causal Inference [64.45536158710014]
Causal inference is a critical research topic across many domains, such as statistics, computer science, education, public policy and economics.
Various causal effect estimation methods for observational data have sprung up.
arXiv Detail & Related papers (2020-02-05T21:35:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.