MeteorPred: A Meteorological Multimodal Large Model and Dataset for Severe Weather Event Prediction
- URL: http://arxiv.org/abs/2508.06859v1
- Date: Sat, 09 Aug 2025 06:54:41 GMT
- Title: MeteorPred: A Meteorological Multimodal Large Model and Dataset for Severe Weather Event Prediction
- Authors: Shuo Tang, Jian Xu, Jiadong Zhang, Yi Chen, Qizhao Jin, Lingdong Shen, Chenglin Liu, Shiming Xiang,
- Abstract summary: We introduce MP-Bench, the first large-scale temporal multimodal dataset for severe weather events prediction.<n>On top of this dataset, we develop a multimodal large model (MMLM) that directly ingests 4D meteorological inputs.<n> MMLM performs exceptionally well across multiple tasks, highlighting its effectiveness in severe weather understanding.
- Score: 44.09814361538848
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Timely and accurate severe weather warnings are critical for disaster mitigation. However, current forecasting systems remain heavily reliant on manual expert interpretation, introducing subjectivity and significant operational burdens. With the rapid development of AI technologies, the end-to-end "AI weather station" is gradually emerging as a new trend in predicting severe weather events. Three core challenges impede the development of end-to-end AI severe weather system: (1) scarcity of severe weather event samples; (2) imperfect alignment between high-dimensional meteorological data and textual warnings; (3) existing multimodal language models are unable to handle high-dimensional meteorological data and struggle to fully capture the complex dependencies across temporal sequences, vertical pressure levels, and spatial dimensions. To address these challenges, we introduce MP-Bench, the first large-scale temporal multimodal dataset for severe weather events prediction, comprising 421,363 pairs of raw multi-year meteorological data and corresponding text caption, covering a wide range of severe weather scenarios across China. On top of this dataset, we develop a meteorology multimodal large model (MMLM) that directly ingests 4D meteorological inputs. In addition, it is designed to accommodate the unique characteristics of 4D meteorological data flow, incorporating three plug-and-play adaptive fusion modules that enable dynamic feature extraction and integration across temporal sequences, vertical pressure layers, and spatial dimensions. Extensive experiments on MP-Bench demonstrate that MMLM performs exceptionally well across multiple tasks, highlighting its effectiveness in severe weather understanding and marking a key step toward realizing automated, AI-driven weather forecasting systems. Our source code and dataset will be made publicly available.
Related papers
- OneForecast: A Universal Framework for Global and Regional Weather Forecasting [67.61381313555091]
We propose a global-regional nested weather forecasting framework (OneForecast) based on graph neural networks.<n>By combining a dynamic system perspective with multi-grid theory, we construct a multi-scale graph structure and densify the target region.<n>We introduce an adaptive messaging mechanism, using dynamic gating units, to deeply integrate node and edge features for more accurate extreme event forecasting.
arXiv Detail & Related papers (2025-02-01T06:49:16Z) - FATE: Focal-modulated Attention Encoder for Multivariate Time-series Forecasting [0.0]
Climate stands as one of the most pressing global challenges of the twenty-first century, with far-reaching consequences such as rising sea levels, melting glaciers, and increasingly extreme weather patterns.<n> Accurate forecasting is critical for monitoring these phenomena and supporting mitigation strategies.<n>Recent data-driven models for time-series forecasting, including CNNs, RNNs, and attention-based transformers, have shown promise but struggle with dependencies and limited parallelization.<n>In this work, we present Modulated Attention Focal (FATE) for reliable time-series forecasting.
arXiv Detail & Related papers (2024-08-21T04:40:18Z) - FuXi Weather: A data-to-forecast machine learning system for global weather [13.052716094161886]
FuXi Weather is a machine learning weather forecasting system that assimilates data from multiple satellites.
FuXi Weather consistently outperforms ECMWF HRES in observation-sparse regions, such as central Africa.
arXiv Detail & Related papers (2024-08-10T07:42:01Z) - WeatherQA: Can Multimodal Language Models Reason about Severe Weather? [45.43764278625153]
Severe convective weather events, such as hail, tornadoes, and thunderstorms, often occur quickly yet cause significant damage, costing billions of dollars every year.
This highlights the importance of forecasting severe weather threats hours in advance to better prepare meteorologists and residents in at-risk areas.
We introduce WeatherQA, the first multimodal dataset designed for machines to reason about complex combinations of weather parameters and predict severe weather in real-world scenarios.
arXiv Detail & Related papers (2024-06-17T05:23:18Z) - WeatherFormer: A Pretrained Encoder Model for Learning Robust Weather Representations from Small Datasets [0.5735035463793009]
WeatherFormer is a transformer encoder-based model designed to learn robust weather features from minimal observations.
WeatherFormer was pretrained on a large pretraining dataset comprised of 39 years of satellite measurements across the Americas.
arXiv Detail & Related papers (2024-05-22T17:43:46Z) - Generalizing Weather Forecast to Fine-grained Temporal Scales via Physics-AI Hybrid Modeling [55.13352174687475]
This paper proposes a physics-AI hybrid model (i.e., WeatherGFT) which generalizes weather forecasts to finer-grained temporal scales beyond training dataset.<n>Specifically, we employ a carefully designed PDE kernel to simulate physical evolution on a small time scale.<n>We also introduce a lead time-aware training framework to promote the generalization of the model at different lead times.
arXiv Detail & Related papers (2024-05-22T16:21:02Z) - Towards an end-to-end artificial intelligence driven global weather forecasting system [57.5191940978886]
We present an AI-based data assimilation model, i.e., Adas, for global weather variables.
We demonstrate that Adas can assimilate global observations to produce high-quality analysis, enabling the system operate stably for long term.
We are the first to apply the methods to real-world scenarios, which is more challenging and has considerable practical application potential.
arXiv Detail & Related papers (2023-12-18T09:05:28Z) - A Distributed Approach to Meteorological Predictions: Addressing Data
Imbalance in Precipitation Prediction Models through Federated Learning and
GANs [0.0]
classification of weather data involves categorizing meteorological phenomena into classes, thereby facilitating nuanced analyses and precise predictions.
It's imperative that classification algorithms proficiently navigate challenges such as data imbalances.
Data augmentation techniques can improve the model's accuracy in classifying rare but critical weather events.
arXiv Detail & Related papers (2023-10-19T21:28:20Z) - Prompt Federated Learning for Weather Forecasting: Toward Foundation
Models on Meteorological Data [37.549578998407675]
To tackle the global climate challenge, it urgently needs to develop a collaborative platform for comprehensive weather forecasting on large-scale meteorological data.
This paper develops a foundation model across regions of understanding complex meteorological data and providing weather forecasting.
A novel prompt learning mechanism has been adopted to satisfy low-resourced sensors' communication and computational constraints.
arXiv Detail & Related papers (2023-01-22T16:47:05Z) - Pangu-Weather: A 3D High-Resolution Model for Fast and Accurate Global
Weather Forecast [91.9372563527801]
We present Pangu-Weather, a deep learning based system for fast and accurate global weather forecast.
For the first time, an AI-based method outperforms state-of-the-art numerical weather prediction (NWP) methods in terms of accuracy.
Pangu-Weather supports a wide range of downstream forecast scenarios, including extreme weather forecast and large-member ensemble forecast in real-time.
arXiv Detail & Related papers (2022-11-03T17:19:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.