Searth Transformer: A Transformer Architecture Incorporating Earth's Geospheric Physical Priors for Global Mid-Range Weather Forecasting
- URL: http://arxiv.org/abs/2601.09467v1
- Date: Wed, 14 Jan 2026 13:20:17 GMT
- Title: Searth Transformer: A Transformer Architecture Incorporating Earth's Geospheric Physical Priors for Global Mid-Range Weather Forecasting
- Authors: Tianye Li, Qi Liu, Hao Li, Lei Chen, Wencong Cheng, Fei Zheng, Xiangao Xia, Ya Wang, Gang Huang, Weiwei Wang, Xuan Tong, Ziqing Zu, Yi Fang, Shenming Fu, Jiang Jiang, Haochen Li, Mingxing Li, Jiangjiang Xia,
- Abstract summary: YanTian is a global medium-range weather forecasting model.<n>It achieves higher accuracy than the high-resolution forecast of the European Centre for Medium-Range Weather Forecasts.<n>It requires roughly 200 times lower computational cost than standard autoregressive fine-tuning.
- Score: 23.666628869892737
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Accurate global medium-range weather forecasting is fundamental to Earth system science. Most existing Transformer-based forecasting models adopt vision-centric architectures that neglect the Earth's spherical geometry and zonal periodicity. In addition, conventional autoregressive training is computationally expensive and limits forecast horizons due to error accumulation. To address these challenges, we propose the Shifted Earth Transformer (Searth Transformer), a physics-informed architecture that incorporates zonal periodicity and meridional boundaries into window-based self-attention for physically consistent global information exchange. We further introduce a Relay Autoregressive (RAR) fine-tuning strategy that enables learning long-range atmospheric evolution under constrained memory and computational budgets. Based on these methods, we develop YanTian, a global medium-range weather forecasting model. YanTian achieves higher accuracy than the high-resolution forecast of the European Centre for Medium-Range Weather Forecasts and performs competitively with state-of-the-art AI models at one-degree resolution, while requiring roughly 200 times lower computational cost than standard autoregressive fine-tuning. Furthermore, YanTian attains a longer skillful forecast lead time for Z500 (10.3 days) than HRES (9 days). Beyond weather forecasting, this work establishes a robust algorithmic foundation for predictive modeling of complex global-scale geophysical circulation systems, offering new pathways for Earth system science.
Related papers
- A Space-Time Transformer for Precipitation Forecasting [38.87144329787491]
SaTformer is a video transformer that skillfully forecasts extreme precipitation from satellite radiances.<n>We reformulate precipitation regression into a classification problem, and employ a class-weighted loss to address label imbalances.<n>Our model scored first place on the NeurIPS Weather4Cast 2025 Cumulative Rainfall challenge.
arXiv Detail & Related papers (2025-11-14T09:10:31Z) - OneForecast: A Universal Framework for Global and Regional Weather Forecasting [67.61381313555091]
We propose a global-regional nested weather forecasting framework (OneForecast) based on graph neural networks.<n>By combining a dynamic system perspective with multi-grid theory, we construct a multi-scale graph structure and densify the target region.<n>We introduce an adaptive messaging mechanism, using dynamic gating units, to deeply integrate node and edge features for more accurate extreme event forecasting.
arXiv Detail & Related papers (2025-02-01T06:49:16Z) - Resolution-Agnostic Transformer-based Climate Downscaling [0.0]
This study introduces a cost-efficient downscaling method using a pretrained Earth Vision Transformer (Earth ViT) model.<n>It performs well without additional training, demonstrating its ability to generalize across different resolutions.<n>Ultimately, this method could provide more comprehensive estimates of potential future changes in key climate variables.
arXiv Detail & Related papers (2024-11-22T07:32:11Z) - FengWu-W2S: A deep learning model for seamless weather-to-subseasonal forecast of global atmosphere [53.22497376154084]
We propose FengWu-Weather to Subseasonal (FengWu-W2S), which builds on the FengWu global weather forecast model and incorporates an ocean-atmosphere-land coupling structure along with a diverse perturbation strategy.
Our hindcast results demonstrate that FengWu-W2S reliably predicts atmospheric conditions out to 3-6 weeks ahead, enhancing predictive capabilities for global surface air temperature, precipitation, geopotential height and intraseasonal signals such as the Madden-Julian Oscillation (MJO) and North Atlantic Oscillation (NAO)
Our ablation experiments on forecast error growth from daily to seasonal timescales reveal potential
arXiv Detail & Related papers (2024-11-15T13:44:37Z) - Super Resolution On Global Weather Forecasts [0.1747623282473278]
Group seeks to improve upon existing deep learning based forecasting methods by increasing spatial resolutions of global weather predictions.
Specifically, we are interested in performing super resolution (SR) on GraphCast temperature predictions by increasing the global precision from 1 degree of accuracy to 0.5 degrees.
arXiv Detail & Related papers (2024-09-17T19:07:13Z) - MambaDS: Near-Surface Meteorological Field Downscaling with Topography Constrained Selective State Space Modeling [68.69647625472464]
Downscaling, a crucial task in meteorological forecasting, enables the reconstruction of high-resolution meteorological states for target regions.
Previous downscaling methods lacked tailored designs for meteorology and encountered structural limitations.
We propose a novel model called MambaDS, which enhances the utilization of multivariable correlations and topography information.
arXiv Detail & Related papers (2024-08-20T13:45:49Z) - Efficient Subseasonal Weather Forecast using Teleconnection-informed
Transformers [29.33938664834226]
Subseasonal forecasting is pivotal for agriculture, water resource management, and early warning of disasters.
Recent advances in machine learning have revolutionized weather forecasting by achieving competitive predictive skills to numerical models.
However, training such foundation models requires thousands of GPU days, which causes substantial carbon emissions.
arXiv Detail & Related papers (2024-01-31T14:27:35Z) - FengWu-GHR: Learning the Kilometer-scale Medium-range Global Weather
Forecasting [56.73502043159699]
This work presents FengWu-GHR, the first data-driven global weather forecasting model running at the 0.09$circ$ horizontal resolution.
It introduces a novel approach that opens the door for operating ML-based high-resolution forecasts by inheriting prior knowledge from a low-resolution model.
The hindcast of weather prediction in 2022 indicates that FengWu-GHR is superior to the IFS-HRES.
arXiv Detail & Related papers (2024-01-28T13:23:25Z) - Towards an end-to-end artificial intelligence driven global weather forecasting system [57.5191940978886]
We present an AI-based data assimilation model, i.e., Adas, for global weather variables.
We demonstrate that Adas can assimilate global observations to produce high-quality analysis, enabling the system operate stably for long term.
We are the first to apply the methods to real-world scenarios, which is more challenging and has considerable practical application potential.
arXiv Detail & Related papers (2023-12-18T09:05:28Z) - Residual Corrective Diffusion Modeling for Km-scale Atmospheric Downscaling [58.456404022536425]
State of the art for physical hazard prediction from weather and climate requires expensive km-scale numerical simulations driven by coarser resolution global inputs.
Here, a generative diffusion architecture is explored for downscaling such global inputs to km-scale, as a cost-effective machine learning alternative.
The model is trained to predict 2km data from a regional weather model over Taiwan, conditioned on a 25km global reanalysis.
arXiv Detail & Related papers (2023-09-24T19:57:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.