IndiaWeatherBench: A Dataset and Benchmark for Data-Driven Regional Weather Forecasting over India
- URL: http://arxiv.org/abs/2509.00653v1
- Date: Sun, 31 Aug 2025 01:25:49 GMT
- Title: IndiaWeatherBench: A Dataset and Benchmark for Data-Driven Regional Weather Forecasting over India
- Authors: Tung Nguyen, Harkanwar Singh, Nilay Naharas, Lucas Bandarkar, Aditya Grover,
- Abstract summary: IndiaWeatherBench is a benchmark for data-driven regional weather forecasting focused on the Indian subcontinent.<n>We implement and evaluate a range of models across diverse architectures, including UNets, Transformers, and Graph-based networks.<n>We open-source all raw and preprocessed datasets, model implementations, and pipelines to promote accessibility and future development.
- Score: 37.53463011903496
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Regional weather forecasting is a critical problem for localized climate adaptation, disaster mitigation, and sustainable development. While machine learning has shown impressive progress in global weather forecasting, regional forecasting remains comparatively underexplored. Existing efforts often use different datasets and experimental setups, limiting fair comparison and reproducibility. We introduce IndiaWeatherBench, a comprehensive benchmark for data-driven regional weather forecasting focused on the Indian subcontinent. IndiaWeatherBench provides a curated dataset built from high-resolution regional reanalysis products, along with a suite of deterministic and probabilistic metrics to facilitate consistent training and evaluation. To establish strong baselines, we implement and evaluate a range of models across diverse architectures, including UNets, Transformers, and Graph-based networks, as well as different boundary conditioning strategies and training objectives. While focused on India, IndiaWeatherBench is easily extensible to other geographic regions. We open-source all raw and preprocessed datasets, model implementations, and evaluation pipelines to promote accessibility and future development. We hope IndiaWeatherBench will serve as a foundation for advancing regional weather forecasting research. Code is available at https://github.com/tung-nd/IndiaWeatherBench.
Related papers
- Learning Regional Monsoon Patterns with a Multimodal Attention U-Net [3.134311415077079]
We present a multimodal deep learning framework for high-resolution precipitation classification.<n>We curate a new 1 km resolution dataset for five Indian states.<n>Our framework consistently outperforms unimodal baselines and existing deep learning methods.
arXiv Detail & Related papers (2025-09-27T11:48:30Z) - OneForecast: A Universal Framework for Global and Regional Weather Forecasting [42.49580040971529]
We propose a global-regional nested weather forecasting framework (OneForecast) based on graph neural networks.<n>By combining a dynamic system perspective with multi-grid theory, we construct a multi-scale graph structure and densify the target region.<n>We introduce an adaptive messaging mechanism, using dynamic gating units, to deeply integrate node and edge features for more accurate extreme event forecasting.
arXiv Detail & Related papers (2025-02-01T06:49:16Z) - PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines [86.36060279469304]
We introduce PredBench, a benchmark tailored for the holistic evaluation of prediction-temporal networks.
This benchmark integrates 12 widely adopted methods with diverse datasets across multiple application domains.
Its multi-dimensional evaluation framework broadens the analysis with a comprehensive set of metrics.
arXiv Detail & Related papers (2024-07-11T11:51:36Z) - Day-ahead regional solar power forecasting with hierarchical temporal
convolutional neural networks using historical power generation and weather
data [2.9884358130293056]
We propose two deep-learning-based regional forecasting methods that can effectively leverage both types of time series ( aggregated and individual) with weather data in a region.
The proposed work is evaluated using a large dataset collected over a year from 101 locations across Western Australia to provide a day ahead forecast.
arXiv Detail & Related papers (2024-03-04T00:09:07Z) - SCTc-TE: A Comprehensive Formulation and Benchmark for Temporal Event Forecasting [63.01035584154509]
We develop a fully automated pipeline and construct a large-scale dataset named MidEast-TE from about 0.6 million news articles.
This dataset focuses on the cooperation and conflict events among countries mainly in the MidEast region from 2015 to 2022.
We propose a novel method LoGo that is able to take advantage of both Local and Global contexts for SCTc-TE forecasting.
arXiv Detail & Related papers (2023-12-02T07:40:21Z) - Federated Prompt Learning for Weather Foundation Models on Devices [37.88417074427373]
On-device intelligence for weather forecasting uses local deep learning models to analyze weather patterns without centralized cloud computing.
This paper propose Federated Prompt Learning for Weather Foundation Models on Devices (FedPoD)
FedPoD enables devices to obtain highly customized models while maintaining communication efficiency.
arXiv Detail & Related papers (2023-05-23T16:59:20Z) - ClimaX: A foundation model for weather and climate [51.208269971019504]
ClimaX is a deep learning model for weather and climate science.
It can be pre-trained with a self-supervised learning objective on climate datasets.
It can be fine-tuned to address a breadth of climate and weather tasks.
arXiv Detail & Related papers (2023-01-24T23:19:01Z) - Pangu-Weather: A 3D High-Resolution Model for Fast and Accurate Global
Weather Forecast [91.9372563527801]
We present Pangu-Weather, a deep learning based system for fast and accurate global weather forecast.
For the first time, an AI-based method outperforms state-of-the-art numerical weather prediction (NWP) methods in terms of accuracy.
Pangu-Weather supports a wide range of downstream forecast scenarios, including extreme weather forecast and large-member ensemble forecast in real-time.
arXiv Detail & Related papers (2022-11-03T17:19:43Z) - SubseasonalClimateUSA: A Dataset for Subseasonal Forecasting and
Benchmarking [20.442879707675115]
SubseasonalClimateUSA is a curated dataset for training and benchmarking subseasonal forecasting models in the United States.
We use this dataset to benchmark a diverse suite of models, including operational dynamical models, classical meteorological baselines, and ten state-of-the-art machine learning and deep learning-based methods from the literature.
arXiv Detail & Related papers (2021-09-21T18:42:10Z) - WeatherBench: A benchmark dataset for data-driven weather forecasting [17.76377510880905]
We present a benchmark dataset for data-driven medium-range weather forecasting.
We provide data derived from the ERA5 archive that has been processed to facilitate the use in machine learning models.
We provide baseline scores from simple linear regression techniques, deep learning models, as well as purely physical forecasting models.
arXiv Detail & Related papers (2020-02-02T19:20:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.