Vision Transformers for Multi-Variable Climate Downscaling: Emulating Regional Climate Models with a Shared Encoder and Multi-Decoder Architecture
- URL: http://arxiv.org/abs/2506.22447v1
- Date: Thu, 12 Jun 2025 11:48:41 GMT
- Title: Vision Transformers for Multi-Variable Climate Downscaling: Emulating Regional Climate Models with a Shared Encoder and Multi-Decoder Architecture
- Authors: Fabio Merizzi, Harilaos Loukos,
- Abstract summary: We propose a multi-task, multi-variable Vision Transformer (ViT) architecture with a shared encoder and variable-specific decoders (1EMD)<n>We show that our multi-variable approach achieves positive cross-variable knowledge transfer and consistently outperforms single-variable baselines trained under identical conditions.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Global Climate Models (GCMs) are critical for simulating large-scale climate dynamics, but their coarse spatial resolution limits their applicability in regional studies. Regional Climate Models (RCMs) refine this through dynamic downscaling, albeit at considerable computational cost and with limited flexibility. While deep learning has emerged as an efficient data-driven alternative, most existing studies have focused on single-variable models that downscale one variable at a time. This approach can lead to limited contextual awareness, redundant computation, and lack of cross-variable interaction. Our study addresses these limitations by proposing a multi-task, multi-variable Vision Transformer (ViT) architecture with a shared encoder and variable-specific decoders (1EMD). The proposed architecture jointly predicts three key climate variables: surface temperature (tas), wind speed (sfcWind), and 500 hPa geopotential height (zg500), directly from GCM-resolution inputs, emulating RCM-scale downscaling over Europe. We show that our multi-variable approach achieves positive cross-variable knowledge transfer and consistently outperforms single-variable baselines trained under identical conditions, while also improving computational efficiency. These results demonstrate the effectiveness of multi-variable modeling for high-resolution climate downscaling.
Related papers
- World Model-Based Learning for Long-Term Age of Information Minimization in Vehicular Networks [53.98633183204453]
In this paper, a novel world model-based learning framework is proposed to minimize packet-completeness-aware age of information (CAoI) in a vehicular network.<n>A world model framework is proposed to jointly learn a dynamic model of the mmWave V2X environment and use it to imagine trajectories for learning how to perform link scheduling.<n>In particular, the long-term policy is learned in differentiable imagined trajectories instead of environment interactions.
arXiv Detail & Related papers (2025-05-03T06:23:18Z) - Paraformer: Parameterization of Sub-grid Scale Processes Using Transformers [6.622831012413507]
We propose a "memory-aware" Transformer-based model on ClimSim, the largest dataset ever created for climate parameterization.<n>Our results demonstrate that the proposed model successfully captures the complex non-linear dependencies in the sub-grid scale variables.
arXiv Detail & Related papers (2024-12-21T20:21:52Z) - Dynamical-generative downscaling of climate model ensembles [13.376226374728917]
We propose a novel approach combining dynamical downscaling with generative artificial intelligence to reduce the cost and improve the uncertainty estimates of downscaled climate projections.
In our framework, an RCM dynamically downscales ESM output to an intermediate resolution, followed by a generative diffusion model that further refines the resolution to the target scale.
arXiv Detail & Related papers (2024-10-02T17:31:01Z) - Evaluating the transferability potential of deep learning models for climate downscaling [16.30722178785489]
We evaluate the efficacy of training deep learning downscaling models on multiple climate datasets to learn more robust and transferable representations.
We assess the spatial, variable, and product transferability of downscaling models experimentally, to understand the generalizability of these different architecture types.
arXiv Detail & Related papers (2024-07-17T12:10:24Z) - Multi-Stream Cellular Test-Time Adaptation of Real-Time Models Evolving in Dynamic Environments [53.79708667153109]
Smart objects, notably autonomous vehicles, face challenges in critical local computations due to limited resources.
We propose a novel Multi-Stream Cellular Test-Time Adaptation setup where models adapt on the fly to a dynamic environment divided into cells.
We validate our methodology in the context of autonomous vehicles navigating across cells defined based on location and weather conditions.
arXiv Detail & Related papers (2024-04-27T15:00:57Z) - Transferability and explainability of deep learning emulators for
regional climate model projections: Perspectives for future applications [0.4821250031784094]
Regional climate models (RCMs) are essential tools for simulating and studying regional climate variability and change.
Deep learning models have been introduced as a cost-effective and promising alternative that requires only short RCM simulations to train the models.
This paper considers the two different emulation approaches proposed in the literature (PP and MOS)
We find that both approaches are able to emulate certain climatological properties of RCMs for different periods and scenarios (soft transferability) but the consistency of the emulation functions differ between approaches.
arXiv Detail & Related papers (2023-11-01T00:44:39Z) - Multi-variable Hard Physical Constraints for Climate Model Downscaling [17.402215838651557]
Global Climate Models (GCMs) are the primary tool to simulate climate evolution and assess the impacts of climate change.
They often operate at a coarse spatial resolution that limits their accuracy in reproducing local-scale phenomena.
This study investigates the scope of this problem and, through an application on temperature, lays the foundation for a framework introducing multi-variable hard constraints.
arXiv Detail & Related papers (2023-08-02T11:42:02Z) - ClimaX: A foundation model for weather and climate [51.208269971019504]
ClimaX is a deep learning model for weather and climate science.
It can be pre-trained with a self-supervised learning objective on climate datasets.
It can be fine-tuned to address a breadth of climate and weather tasks.
arXiv Detail & Related papers (2023-01-24T23:19:01Z) - Multi-scale Digital Twin: Developing a fast and physics-informed
surrogate model for groundwater contamination with uncertain climate models [53.44486283038738]
Climate change exacerbates the long-term soil management problem of groundwater contamination.
We develop a physics-informed machine learning surrogate model using U-Net enhanced Fourier Neural Contaminated (PDENO)
In parallel, we develop a convolutional autoencoder combined with climate data to reduce the dimensionality of climatic region similarities across the United States.
arXiv Detail & Related papers (2022-11-20T06:46:35Z) - DeepClimGAN: A High-Resolution Climate Data Generator [60.59639064716545]
Earth system models (ESMs) are often used to generate future projections of climate change scenarios.
As a compromise, emulators are substantially less expensive but may not have all of the complexity of an ESM.
Here we demonstrate the use of a conditional generative adversarial network (GAN) to act as an ESM emulator.
arXiv Detail & Related papers (2020-11-23T20:13:37Z) - Learning to Continuously Optimize Wireless Resource In Episodically
Dynamic Environment [55.91291559442884]
This work develops a methodology that enables data-driven methods to continuously learn and optimize in a dynamic environment.
We propose to build the notion of continual learning into the modeling process of learning wireless systems.
Our design is based on a novel min-max formulation which ensures certain fairness" across different data samples.
arXiv Detail & Related papers (2020-11-16T08:24:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.