Regional climate risk assessment from climate models using probabilistic machine learning
- URL: http://arxiv.org/abs/2412.08079v2
- Date: Mon, 16 Jun 2025 21:56:55 GMT
- Title: Regional climate risk assessment from climate models using probabilistic machine learning
- Authors: Zhong Yi Wan, Ignacio Lopez-Gomez, Robert Carver, Tapio Schneider, John Anderson, Fei Sha, Leonardo Zepeda-Núñez,
- Abstract summary: GenFocal is a general-purpose, end-to-end generative framework for complex climate processes interacting at finetemporal scales.<n>It more accurately assesses extreme risk in the current climate than leading approaches.<n>GenFocal shows compelling results consistent with the literature on projecting climate impact on decadal timescales.
- Score: 12.737495484442443
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Accurate, actionable climate information at km scales is crucial for robust natural hazard risk assessment and infrastructure planning. Simulating climate at these resolutions remains intractable, forcing reliance on downscaling: either physics-based or statistical methods that transform climate simulations from coarse to impact-relevant resolutions. One major challenge for downscaling is to comprehensively capture the interdependency among climate processes of interest, a prerequisite for representing climate hazards. However, current approaches either lack the desired scalability or are bespoke to specific types of hazards. We introduce GenFocal, a computationally efficient, general-purpose, end-to-end generative framework that gives rise to full probabilistic characterizations of complex climate processes interacting at fine spatiotemporal scales. GenFocal more accurately assesses extreme risk in the current climate than leading approaches, including one used in the US 5th National Climate Assessment. It produces plausible tracks of tropical cyclones, providing accurate statistics of their genesis and evolution, even when they are absent from the corresponding climate simulations. GenFocal also shows compelling results that are consistent with the literature on projecting climate impact on decadal timescales. GenFocal revolutionizes how climate simulations can be efficiently augmented with observations and harnessed to enable future climate impact assessments at the spatiotemporal scales relevant to local and regional communities. We believe this work establishes genAI as an effective paradigm for modeling complex, high-dimensional multivariate statistical correlations that have deterred precise quantification of climate risks associated with hazards such as wildfires, extreme heat, tropical cyclones, and flooding; thereby enabling the evaluation of adaptation strategies.
Related papers
- ClimateBench-M: A Multi-Modal Climate Data Benchmark with a Simple Generative Method [61.76389719956301]
We contribute a multi-modal climate benchmark, i.e., ClimateBench-M, which aligns time series climate data from ERA5, extreme weather events data from NOAA, and satellite image data from NASA.<n>Under each data modality, we also propose a simple but strong generative method that could produce competitive performance in weather forecasting, thunderstorm alerts, and crop segmentation tasks.
arXiv Detail & Related papers (2025-04-10T02:22:23Z) - Fast Solvers for Discrete Diffusion Models: Theory and Applications of High-Order Algorithms [31.42317398879432]
Current inference approaches mainly fall into two categories: exact simulation and approximate methods such as $tau$-leaping.
In this work, we advance the latter category by tailoring the first extension of high-order numerical inference schemes to discrete diffusion models.
We rigorously analyze the proposed schemes and establish the second-order accuracy of the $theta$-trapezoidal method in KL divergence.
arXiv Detail & Related papers (2025-02-01T00:25:21Z) - A Generative Framework for Probabilistic, Spatiotemporally Coherent Downscaling of Climate Simulation [23.504915709396204]
We present a novel generative framework that uses a score-based diffusion model trained on high-resolution reanalysis data to capture the statistical properties of local weather dynamics.<n>We demonstrate that the model generates spatially and temporally coherent weather dynamics that align with global climate output.
arXiv Detail & Related papers (2024-12-19T19:47:35Z) - Causal Invariance Learning via Efficient Optimization of a Nonconvex Objective [12.423111378195667]
We propose a novel method for finding causal relationships among variables.
We show that our method converges to a standard gradient method.
Our algorithm avoids exhaustive searches, making it especially when the number of covariants is large.
arXiv Detail & Related papers (2024-12-16T15:11:02Z) - On conditional diffusion models for PDE simulations [53.01911265639582]
We study score-based diffusion models for forecasting and assimilation of sparse observations.
We propose an autoregressive sampling approach that significantly improves performance in forecasting.
We also propose a new training strategy for conditional score-based models that achieves stable performance over a range of history lengths.
arXiv Detail & Related papers (2024-10-21T18:31:04Z) - Causal Deciphering and Inpainting in Spatio-Temporal Dynamics via Diffusion Model [45.45700202300292]
CaPaint aims to identify causal regions in data and endow model with causal reasoning ability in a two-stage process.
By using a fine-tuned unconditional Diffusion Probabilistic Model (DDPM) as the generative prior, we in-fill the masks defined as environmental parts.
Experiments conducted on five real-world ST benchmarks demonstrate that integrating the CaPaint concept allows models to achieve improvements ranging from 4.3% to 77.3%.
arXiv Detail & Related papers (2024-09-29T08:18:50Z) - Efficient Localized Adaptation of Neural Weather Forecasting: A Case Study in the MENA Region [62.09891513612252]
We focus on limited-area modeling and train our model specifically for localized region-level downstream tasks.
We consider the MENA region due to its unique climatic challenges, where accurate localized weather forecasting is crucial for managing water resources, agriculture and mitigating the impacts of extreme weather events.
Our study aims to validate the effectiveness of integrating parameter-efficient fine-tuning (PEFT) methodologies, specifically Low-Rank Adaptation (LoRA) and its variants, to enhance forecast accuracy, as well as training speed, computational resource utilization, and memory efficiency in weather and climate modeling for specific regions.
arXiv Detail & Related papers (2024-09-11T19:31:56Z) - Capturing Climatic Variability: Using Deep Learning for Stochastic Downscaling [0.0]
Adapting to the changing climate requires accurate local climate information.
Capturing variability while downscaling is crucial for estimating uncertainty and characterising extreme events.
We propose approaches to improve the calibration of GANs in three ways.
arXiv Detail & Related papers (2024-05-31T03:04:10Z) - Comparing Data-Driven and Mechanistic Models for Predicting Phenology in
Deciduous Broadleaf Forests [47.285748922842444]
We train a deep neural network to predict a phenological index from meteorological time series.
We find that this approach outperforms traditional process-based models.
arXiv Detail & Related papers (2024-01-08T15:29:23Z) - Precipitation Downscaling with Spatiotemporal Video Diffusion [19.004369237435437]
This work extends recent video diffusion models to precipitation super-resolution.
We use a deterministic downscaler followed by a temporally-conditioned diffusion model to capture noise characteristics and high-frequency patterns.
Our analysis, capturing CRPS, MSE, precipitation distributions, and qualitative aspects using California and the Himalayas, establishes our method as a new standard for data-driven precipitation downscaling.
arXiv Detail & Related papers (2023-12-11T02:38:07Z) - Combining deep generative models with extreme value theory for synthetic
hazard simulation: a multivariate and spatially coherent approach [0.0]
Generative adversarial networks (GANs) are well-suited to such a problem due to their ability to implicitly learn the distribution of data in high-dimensional settings.
We employ a GAN to model the dependence structure for daily maximum wind speed, significant wave height, and total precipitation over the Bay of Bengal.
Once trained, the model can be used to efficiently generate thousands of realistic compound hazard events.
arXiv Detail & Related papers (2023-11-30T12:55:51Z) - CMIP X-MOS: Improving Climate Models with Extreme Model Output
Statistics [40.517778024431244]
We introduce Extreme Model Output Statistics (X-MOS) to improve predictions of natural disaster risks.
This approach utilizes deep regression techniques to precisely map CMIP model outputs to real measurements obtained from weather stations.
In contrast to previous research, our study places a strong emphasis on enhancing the estimation of the tails of future climate parameter distributions.
arXiv Detail & Related papers (2023-10-24T13:18:53Z) - Quantifying predictive uncertainty of aphasia severity in stroke patients with sparse heteroscedastic Bayesian high-dimensional regression [47.1405366895538]
Sparse linear regression methods for high-dimensional data commonly assume that residuals have constant variance, which can be violated in practice.
This paper proposes estimating high-dimensional heteroscedastic linear regression models using a heteroscedastic partitioned empirical Bayes Expectation Conditional Maximization algorithm.
arXiv Detail & Related papers (2023-09-15T22:06:29Z) - Debias Coarsely, Sample Conditionally: Statistical Downscaling through
Optimal Transport and Probabilistic Diffusion Models [15.623456909553786]
We introduce a two-stage probabilistic framework for statistical downscaling using unpaired data.
We demonstrate the utility of the proposed approach on one- and two-dimensional fluid flow problems.
arXiv Detail & Related papers (2023-05-24T23:40:23Z) - ClimaX: A foundation model for weather and climate [51.208269971019504]
ClimaX is a deep learning model for weather and climate science.
It can be pre-trained with a self-supervised learning objective on climate datasets.
It can be fine-tuned to address a breadth of climate and weather tasks.
arXiv Detail & Related papers (2023-01-24T23:19:01Z) - Multi-scale Digital Twin: Developing a fast and physics-informed
surrogate model for groundwater contamination with uncertain climate models [53.44486283038738]
Climate change exacerbates the long-term soil management problem of groundwater contamination.
We develop a physics-informed machine learning surrogate model using U-Net enhanced Fourier Neural Contaminated (PDENO)
In parallel, we develop a convolutional autoencoder combined with climate data to reduce the dimensionality of climatic region similarities across the United States.
arXiv Detail & Related papers (2022-11-20T06:46:35Z) - Distributionally Robust Models with Parametric Likelihood Ratios [123.05074253513935]
Three simple ideas allow us to train models with DRO using a broader class of parametric likelihood ratios.
We find that models trained with the resulting parametric adversaries are consistently more robust to subpopulation shifts when compared to other DRO approaches.
arXiv Detail & Related papers (2022-04-13T12:43:12Z) - Increasing the accuracy and resolution of precipitation forecasts using
deep generative models [3.8073142980733]
We train a conditional Generative Adversarial Network -- coined CorrectorGAN -- to produce ensembles of high-resolution, bias-corrected forecasts.
CorrectorGAN, once trained, produces predictions in seconds on a single machine.
Results raise exciting questions about the necessity of regional models, and whether data-driven downscaling and correction methods can be transferred to data-poor regions.
arXiv Detail & Related papers (2022-03-23T09:45:12Z) - Analysis of the Evolution of Parametric Drivers of High-End Sea-Level
Hazards [0.0]
We use random forests to examine the parametric drivers of future climate risk and how the relative importances of those drivers change over time.
We find that the equilibrium climate sensitivity and a factor that scales the effect of aerosols on radiative forcing are consistently the most important climate model parametric uncertainties throughout the 2020 to 2150 interval for both low and high radiative forcing scenarios.
arXiv Detail & Related papers (2021-06-11T01:50:16Z) - A Twin Neural Model for Uplift [59.38563723706796]
Uplift is a particular case of conditional treatment effect modeling.
We propose a new loss function defined by leveraging a connection with the Bayesian interpretation of the relative risk.
We show our proposed method is competitive with the state-of-the-art in simulation setting and on real data from large scale randomized experiments.
arXiv Detail & Related papers (2021-05-11T16:02:39Z) - Autoregressive Score Matching [113.4502004812927]
We propose autoregressive conditional score models (AR-CSM) where we parameterize the joint distribution in terms of the derivatives of univariable log-conditionals (scores)
For AR-CSM models, this divergence between data and model distributions can be computed and optimized efficiently, requiring no expensive sampling or adversarial training.
We show with extensive experimental results that it can be applied to density estimation on synthetic data, image generation, image denoising, and training latent variable models with implicit encoders.
arXiv Detail & Related papers (2020-10-24T07:01:24Z) - Dynamical Landscape and Multistability of a Climate Model [64.467612647225]
We find a third intermediate stable state in one of the two climate models we consider.
The combination of our approaches allows to identify how the negative feedback of ocean heat transport and entropy production drastically change the topography of Earth's climate.
arXiv Detail & Related papers (2020-10-20T15:31:38Z) - HECT: High-Dimensional Ensemble Consistency Testing for Climate Models [1.7587442088965226]
Climate models play a crucial role in understanding the effect of environmental changes on climate to help mitigate climate risks and inform decisions.
Large global climate models such as the Community Earth System Model (CESM), are very complex with millions of lines of code describing interactions of the atmosphere, land, oceans, and ice.
Our work uses probabilistics like tree-based algorithms and deep neural networks to perform a statistically rigorous goodness-of-fit test of high-dimensional and man-made data.
arXiv Detail & Related papers (2020-10-08T15:16:16Z) - Improving Maximum Likelihood Training for Text Generation with Density
Ratio Estimation [51.091890311312085]
We propose a new training scheme for auto-regressive sequence generative models, which is effective and stable when operating at large sample space encountered in text generation.
Our method stably outperforms Maximum Likelihood Estimation and other state-of-the-art sequence generative models in terms of both quality and diversity.
arXiv Detail & Related papers (2020-07-12T15:31:24Z) - Instability, Computational Efficiency and Statistical Accuracy [101.32305022521024]
We develop a framework that yields statistical accuracy based on interplay between the deterministic convergence rate of the algorithm at the population level, and its degree of (instability) when applied to an empirical object based on $n$ samples.
We provide applications of our general results to several concrete classes of models, including Gaussian mixture estimation, non-linear regression models, and informative non-response models.
arXiv Detail & Related papers (2020-05-22T22:30:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.