A Meta-Analysis of Solar Forecasting Based on Skill Score
- URL: http://arxiv.org/abs/2208.10536v2
- Date: Wed, 12 Apr 2023 07:57:50 GMT
- Title: A Meta-Analysis of Solar Forecasting Based on Skill Score
- Authors: Thi Ngoc Nguyen and Felix M\"usgens
- Abstract summary: We conduct the first comprehensive meta-analysis of deterministic solar forecasting based on skill score.
A database of 4,687 points was built and analyzed.
More improvement is observed for intra-hour and intra-day than day-ahead forecasts.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: We conduct the first comprehensive meta-analysis of deterministic solar
forecasting based on skill score, screening 1,447 papers from Google Scholar
and reviewing the full texts of 320 papers for data extraction. A database of
4,687 points was built and analyzed with multivariate adaptive regression
spline modelling, partial dependence plots, and linear regression. The marginal
impacts on skill score of ten factors were quantified. The analysis shows the
non-linearity and complex interaction between variables in the database.
Forecast horizon has a central impact and dominates other factors' impacts.
Therefore, the analysis of solar forecasts should be done separately for each
horizon. Climate zone variables have statistically significant correlation with
skill score. Regarding inputs, historical data and spatial temporal information
are highly helpful. For intra-day, sky and satellite images show the most
importance. For day-ahead, numerical weather predictions and locally measured
meteorological data are very efficient. All forecast models were compared.
Ensemble-hybrid models achieve the most accurate forecasts for all horizons.
Hybrid models show superiority for intra-hour while image-based methods are the
most efficient for intra-day forecasts. More training data can enhance skill
score. However, over-fitting is observed when there is too much training data
(longer than 2000 days). There has been a substantial improvement in solar
forecast accuracy, especially in recent years. More improvement is observed for
intra-hour and intra-day than day-ahead forecasts. By controlling for the key
differences between forecasts, including location variables, our findings can
be applied globally.
Related papers
- Scaling Laws of Global Weather Models [57.27583619011988]
We investigate the relationship between model performance (validation loss) and three key factors: model size, dataset size, and compute budget.<n>Across a range of models, we find that Aurora exhibits the strongest data-scaling behavior.<n>Our compute-optimal analysis indicates that, under fixed compute budgets, allocating resources to longer training durations yields greater performance gains than increasing model size.
arXiv Detail & Related papers (2026-02-26T12:57:38Z) - Scaling Open-Ended Reasoning to Predict the Future [56.672065928345525]
We train language models to make predictions on open-ended forecasting questions.<n>To scale up training data, we synthesize novel forecasting questions from global events reported in daily news.<n>We find calibration improvements from forecasting training generalize across popular benchmarks.
arXiv Detail & Related papers (2025-12-31T18:59:51Z) - A visual big data system for the prediction of weather-related variables: Jordan-Spain case study [0.0]
We propose a visual big data system that is designed to deal with high amounts of weather-related data.<n>The proposed system collects open data and loads them onto a local database fusing them at different levels of temporal and spatial aggregation.<n>The system has been assessed in terms of usability and predictive performance, obtaining an overall normalized mean squared error value of 0.00013.
arXiv Detail & Related papers (2025-10-24T05:59:08Z) - The Living Forecast: Evolving Day-Ahead Predictions into Intraday Reality [0.688204255655161]
This paper introduces a Bayesian updating mechanism that converts fully probabilistic day-ahead forecasts into intraday forecasts without retraining or re-inference.<n>Experiments on household electricity consumption and photovoltaic generation datasets demonstrate that the proposed method improves forecast accuracy up to 25% across likelihood-, sample-, quantile-, and point-based metrics.
arXiv Detail & Related papers (2025-10-14T08:25:20Z) - S$^2$Transformer: Scalable Structured Transformers for Global Station Weather Forecasting [67.93713728260646]
Existing time series forecasting methods often ignore or unidirectionally model spatial correlation when conducting large-scale global station forecasting.<n>This contradicts the nature underlying observations of the global weather system limiting forecast performance.<n>We propose a novel Structured Spatial Attention in this paper.<n>It partitions the spatial graph into a set of subgraphs and instantiates Intra-subgraph Attention to learn local spatial correlation within each subgraph.<n>It aggregates nodes into subgraph representations for message passing among the subgraphs via Inter-subgraph Attention -- considering both spatial proximity and global correlation.
arXiv Detail & Related papers (2025-09-10T05:33:28Z) - Appa: Bending Weather Dynamics with Latent Diffusion Models for Global Data Assimilation [4.430758443755128]
Appa is a score-based data assimilation model producing global atmospheric trajectories at 0.25-degree resolution and 1-hour intervals.
Our results establish latent score-based data assimilation as a promising foundation for future global atmospheric modeling systems.
arXiv Detail & Related papers (2025-04-25T22:14:29Z) - WeatherReal: A Benchmark Based on In-Situ Observations for Evaluating Weather Models [11.016845506758841]
We introduce WeatherReal, a novel benchmark dataset for weather forecasting derived from global near-surface in-situ observations.
This paper details the sources and processing methodologies underlying the dataset, and illustrates the advantage of in-situ observations in capturing hyper-local and extreme weather.
Our work aims to advance the AI-based weather forecasting research towards a more application-focused and operation-ready approach.
arXiv Detail & Related papers (2024-09-14T08:53:46Z) - ReAugment: Model Zoo-Guided RL for Few-Shot Time Series Augmentation and Forecasting [74.00765474305288]
We present a pilot study on using reinforcement learning (RL) for time series data augmentation.<n>Our method, ReAugment, tackles three critical questions: which parts of the training set should be augmented, how the augmentation should be performed, and what advantages RL brings to the process.
arXiv Detail & Related papers (2024-09-10T07:34:19Z) - Predicting unobserved climate time series data at distant areas via spatial correlation using reservoir computing [5.008022566317949]
This study focuses on a prediction of climatic elements, specifically near-surface temperature and pressure, at a target location apart from a data observation point.
Our results show that the accuracy of the predictions degrades with the distance between the observation and target locations.
arXiv Detail & Related papers (2024-06-05T08:39:10Z) - Site-specific Deterministic Temperature and Humidity Forecasts with Explainable and Reliable Machine Learning [0.0]
Recent developments in machine learning have prompted increasing interest in applying ML as a novel approach towards this problem.
We develop a working ML framework, named 'Multi-SiteBoost' and initial testing results show a significant improvement compared with gridded values from bias-corrected NWP models.
arXiv Detail & Related papers (2024-04-04T09:12:13Z) - Confidence and Dispersity Speak: Characterising Prediction Matrix for
Unsupervised Accuracy Estimation [51.809741427975105]
This work aims to assess how well a model performs under distribution shifts without using labels.
We use the nuclear norm that has been shown to be effective in characterizing both properties.
We show that the nuclear norm is more accurate and robust in accuracy than existing methods.
arXiv Detail & Related papers (2023-02-02T13:30:48Z) - ClimaX: A foundation model for weather and climate [51.208269971019504]
ClimaX is a deep learning model for weather and climate science.
It can be pre-trained with a self-supervised learning objective on climate datasets.
It can be fine-tuned to address a breadth of climate and weather tasks.
arXiv Detail & Related papers (2023-01-24T23:19:01Z) - GraphCast: Learning skillful medium-range global weather forecasting [107.40054095223779]
We introduce a machine learning-based method called "GraphCast", which can be trained directly from reanalysis data.
It predicts hundreds of weather variables, over 10 days at 0.25 degree resolution globally, in under one minute.
We show that GraphCast significantly outperforms the most accurate operational deterministic systems on 90% of 1380 verification targets.
arXiv Detail & Related papers (2022-12-24T18:15:39Z) - Pangu-Weather: A 3D High-Resolution Model for Fast and Accurate Global
Weather Forecast [91.9372563527801]
We present Pangu-Weather, a deep learning based system for fast and accurate global weather forecast.
For the first time, an AI-based method outperforms state-of-the-art numerical weather prediction (NWP) methods in terms of accuracy.
Pangu-Weather supports a wide range of downstream forecast scenarios, including extreme weather forecast and large-member ensemble forecast in real-time.
arXiv Detail & Related papers (2022-11-03T17:19:43Z) - Physics Informed Shallow Machine Learning for Wind Speed Prediction [66.05661813632568]
We analyze a massive dataset of wind measured from anemometers located at 10 m height in 32 locations in Italy.
We train supervised learning algorithms using the past history of wind to predict its value at a future time.
We find that the optimal design as well as its performance vary with the location.
arXiv Detail & Related papers (2022-04-01T14:55:10Z) - CAMul: Calibrated and Accurate Multi-view Time-Series Forecasting [70.54920804222031]
We propose a general probabilistic multi-view forecasting framework CAMul.
It can learn representations and uncertainty from diverse data sources.
It integrates the knowledge and uncertainty from each data view in a dynamic context-specific manner.
We show that CAMul outperforms other state-of-art probabilistic forecasting models by over 25% in accuracy and calibration.
arXiv Detail & Related papers (2021-09-15T17:13:47Z) - WeatherBench: A benchmark dataset for data-driven weather forecasting [17.76377510880905]
We present a benchmark dataset for data-driven medium-range weather forecasting.
We provide data derived from the ERA5 archive that has been processed to facilitate the use in machine learning models.
We provide baseline scores from simple linear regression techniques, deep learning models, as well as purely physical forecasting models.
arXiv Detail & Related papers (2020-02-02T19:20:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.