A Study on Monthly Marine Heatwave Forecasts in New Zealand: An Investigation of Imbalanced Regression Loss Functions with Neural Network Models
- URL: http://arxiv.org/abs/2502.13495v1
- Date: Wed, 19 Feb 2025 07:27:51 GMT
- Title: A Study on Monthly Marine Heatwave Forecasts in New Zealand: An Investigation of Imbalanced Regression Loss Functions with Neural Network Models
- Authors: Ding Ning, Varvara Vetrova, Sébastien Delaux, Rachael Tappenden, Karin R. Bryan, Yun Sing Koh,
- Abstract summary: Marine heatwaves (MHWs) are extreme ocean-temperature events with significant impacts on marine ecosystems and related industries.
Forecasting MHWs presents a challenging imbalanced regression task due to the rarity of extreme temperature anomalies in comparison to more frequent moderate conditions.
We use a fully-connected neural network and compare standard and specialized regression loss functions, including the mean squared error (MSE), the mean absolute error (MAE), the Huber, the weighted MSE, the focal-R, the balanced MSE, and a proposed scaling-weighted MSE.
- Score: 3.5638052139155105
- License:
- Abstract: Marine heatwaves (MHWs) are extreme ocean-temperature events with significant impacts on marine ecosystems and related industries. Accurate forecasts (one to six months ahead) of MHWs would aid in mitigating these impacts. However, forecasting MHWs presents a challenging imbalanced regression task due to the rarity of extreme temperature anomalies in comparison to more frequent moderate conditions. In this study, we examine monthly MHW forecasts for 12 locations around New Zealand. We use a fully-connected neural network and compare standard and specialized regression loss functions, including the mean squared error (MSE), the mean absolute error (MAE), the Huber, the weighted MSE, the focal-R, the balanced MSE, and a proposed scaling-weighted MSE. Results show that (i) short lead times (one month) are considerably more predictable than three- and six-month leads, (ii) models trained with the standard MSE or MAE losses excel at forecasting average conditions but struggle to capture extremes, and (iii) specialized loss functions such as the balanced MSE and our scaling-weighted MSE substantially improve forecasting of MHW and suspected MHW events. These findings underscore the importance of tailored loss functions for imbalanced regression, particularly in forecasting rare but impactful events such as MHWs.
Related papers
- Improved Forecasts of Global Extreme Marine Heatwaves Through a Physics-guided Data-driven Approach [6.881917151193729]
Marine heatwaves (MHWs) have a profound impact on marine ecosystems.
We create a novel deep learning neural network that is capable of accurate 10-day MHW forecasting.
Our framework has significantly higher accuracy and requires fewer computational resources.
arXiv Detail & Related papers (2024-12-20T03:47:56Z) - Advancing Marine Heatwave Forecasts: An Integrated Deep Learning Approach [3.8713566366330325]
Extreme climate phenomenon heatwaves (MHWs) pose significant challenges to marine ecosystems and industries.
This study introduces an integrated deep learning approach to forecast short-to-long-term MHWs on a global scale.
arXiv Detail & Related papers (2024-11-19T06:11:52Z) - Machine learning models for daily rainfall forecasting in Northern Tropical Africa using tropical wave predictors [0.0]
Numerical weather prediction (NWP) models often underperform compared to simpler climatology-based precipitation forecasts in northern tropical Africa.
This study uses two machine-learning models--gamma regression and a convolutional neural network (CNN)--trained on tropical waves (TWs) to predict daily rainfall during the July-September monsoon season.
arXiv Detail & Related papers (2024-08-29T08:36:22Z) - Validating Deep Learning Weather Forecast Models on Recent High-Impact Extreme Events [0.1747623282473278]
We compare machine learning weather prediction models and ECMWF's high-resolution forecast system.
We find that ML weather prediction models locally achieve similar accuracy to HRES on the record-shattering Pacific Northwest heatwave.
We also highlight structural differences in how the errors of HRES and the ML models build up to that event.
arXiv Detail & Related papers (2024-04-26T18:18:25Z) - ExtremeCast: Boosting Extreme Value Prediction for Global Weather Forecast [57.6987191099507]
We introduce Exloss, a novel loss function that performs asymmetric optimization and highlights extreme values to obtain accurate extreme weather forecast.
We also introduce ExBooster, which captures the uncertainty in prediction outcomes by employing multiple random samples.
Our solution can achieve state-of-the-art performance in extreme weather prediction, while maintaining the overall forecast accuracy comparable to the top medium-range forecast models.
arXiv Detail & Related papers (2024-02-02T10:34:13Z) - Long-term drought prediction using deep neural networks based on geospatial weather data [75.38539438000072]
High-quality drought forecasting up to a year in advance is critical for agriculture planning and insurance.
We tackle drought data by introducing an end-to-end approach that adopts a systematic end-to-end approach.
Key findings are the exceptional performance of a Transformer model, EarthFormer, in making accurate short-term (up to six months) forecasts.
arXiv Detail & Related papers (2023-09-12T13:28:06Z) - Forecast reconciliation for vaccine supply chain optimization [61.13962963550403]
Vaccine supply chain optimization can benefit from hierarchical time series forecasting.
Forecasts of different hierarchy levels become incoherent when higher levels do not match the sum of the lower levels forecasts.
We tackle the vaccine sale forecasting problem by modeling sales data from GSK between 2010 and 2021 as a hierarchical time series.
arXiv Detail & Related papers (2023-05-02T14:34:34Z) - Probable Domain Generalization via Quantile Risk Minimization [90.15831047587302]
Domain generalization seeks predictors which perform well on unseen test distributions.
We propose a new probabilistic framework for DG where the goal is to learn predictors that perform well with high probability.
arXiv Detail & Related papers (2022-07-20T14:41:09Z) - When in Doubt: Neural Non-Parametric Uncertainty Quantification for
Epidemic Forecasting [70.54920804222031]
Most existing forecasting models disregard uncertainty quantification, resulting in mis-calibrated predictions.
Recent works in deep neural models for uncertainty-aware time-series forecasting also have several limitations.
We model the forecasting task as a probabilistic generative process and propose a functional neural process model called EPIFNP.
arXiv Detail & Related papers (2021-06-07T18:31:47Z) - Physics-Informed Gaussian Process Regression for Probabilistic States
Estimation and Forecasting in Power Grids [67.72249211312723]
Real-time state estimation and forecasting is critical for efficient operation of power grids.
PhI-GPR is presented and used for forecasting and estimating the phase angle, angular speed, and wind mechanical power of a three-generator power grid system.
We demonstrate that the proposed PhI-GPR method can accurately forecast and estimate both observed and unobserved states.
arXiv Detail & Related papers (2020-10-09T14:18:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.