Leveraging AI multimodal geospatial foundation models for improved near-real-time flood mapping at a global scale
- URL: http://arxiv.org/abs/2512.02055v1
- Date: Thu, 27 Nov 2025 19:04:01 GMT
- Title: Leveraging AI multimodal geospatial foundation models for improved near-real-time flood mapping at a global scale
- Authors: Mirela G. Tulbure, Julio Caineta, Mark Broich, Mollie D. Gaines, Philippe Rufin, Leon-Friedrich Thomas, Hamed Alemohammad, Jan Hemmerling, Patrick Hostert,
- Abstract summary: Floods are among the most damaging weather-related hazards, and in 2024, the warmest year on record, extreme flood events affected communities across five continents.<n>Recent Geospatial Foundation Models (GFMs) offer improved generalizability through large-scale self-supervised pretraining.<n>We fine-tune ESA-IBM's TerraMind for flood extent mapping using FloodsNet.
- Score: 0.44993939572253855
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Floods are among the most damaging weather-related hazards, and in 2024, the warmest year on record, extreme flood events affected communities across five continents. Earth observation (EO) satellites provide critical, frequent coverage for mapping inundation, yet operational accuracy depends heavily on labeled datasets and model generalization. Recent Geospatial Foundation Models (GFMs), such as ESA-IBM's TerraMind, offer improved generalizability through large-scale self-supervised pretraining, but their performance on diverse global flood events remains poorly understood. We fine-tune TerraMind for flood extent mapping using FloodsNet, a harmonized multimodal dataset containing co-located Sentinel-1 (Synthetic Aperture Radar, SAR data) and Sentinel-2 (optical) imagery for 85 flood events worldwide. We tested four configurations (base vs. large models; frozen vs. unfrozen backbones) and compared against the TerraMind Sen1Floods11 example and a U-Net trained on both FloodsNet and Sen1Floods11. The base-unfrozen configuration provided the best balance of accuracy, precision, and recall at substantially lower computational cost than the large model. The large unfrozen model achieved the highest recall. Models trained on FloodsNet outperformed the Sen1Floods11-trained example in recall with similar overall accuracy. U-Net achieved higher recall than all GFM configurations, though with slightly lower accuracy and precision. Our results demonstrate that integrating multimodal optical and SAR data and fine-tuning a GFM can enhance near-real-time flood mapping. This study provides one of the first global-scale evaluations of a GFM for flood segmentation, highlighting both its potential and current limitations for climate adaptation and disaster resilience.
Related papers
- Predictive Modeling of Flood-Prone Areas Using SAR and Environmental Variables [0.0]
Flooding is one of the most destructive natural hazards worldwide, posing serious risks to ecosystems, infrastructure, and human livelihoods.<n>This study combines Synthetic Aperture Radar (SAR) imagery with environmental and hydrological data to model flood susceptibility in the River Nyando watershed, western Kenya.
arXiv Detail & Related papers (2025-12-06T16:24:10Z) - Riverine Flood Prediction and Early Warning in Mountainous Regions using Artificial Intelligence [0.9320657506524149]
This study uses the Kabul River between Pakistan and Afghanistan as a case study to reflect the complications of flood forecasting in transboundary basins.<n> utilizing satellite-based climatic data, this study applied numerous advanced machine-learning and deep learning models to predict daily and multi-step river flow.<n>The LSTM network outperformed other models, achieving the highest R2 value of 0.96 and the lowest RMSE value of 140.96 m3/sec.
arXiv Detail & Related papers (2025-05-24T11:10:09Z) - OneForecast: A Universal Framework for Global and Regional Weather Forecasting [67.61381313555091]
We propose a global-regional nested weather forecasting framework (OneForecast) based on graph neural networks.<n>By combining a dynamic system perspective with multi-grid theory, we construct a multi-scale graph structure and densify the target region.<n>We introduce an adaptive messaging mechanism, using dynamic gating units, to deeply integrate node and edge features for more accurate extreme event forecasting.
arXiv Detail & Related papers (2025-02-01T06:49:16Z) - BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster response [50.76124284445902]
Building damage assessment (BDA) is an essential capability in the aftermath of a disaster to reduce human casualties.<n>Recent research focuses on the development of AI models to achieve accurate mapping of unseen disaster events.<n>We present a BDA dataset using veRy-hIGH-resoluTion optical and SAR imagery (BRIGHT) to support AI-based all-weather disaster response.
arXiv Detail & Related papers (2025-01-10T14:57:18Z) - Residual Corrective Diffusion Modeling for Km-scale Atmospheric Downscaling [58.456404022536425]
State of the art for physical hazard prediction from weather and climate requires expensive km-scale numerical simulations driven by coarser resolution global inputs.
Here, a generative diffusion architecture is explored for downscaling such global inputs to km-scale, as a cost-effective machine learning alternative.
The model is trained to predict 2km data from a regional weather model over Taiwan, conditioned on a 25km global reanalysis.
arXiv Detail & Related papers (2023-09-24T19:57:22Z) - Long-term drought prediction using deep neural networks based on geospatial weather data [75.38539438000072]
High-quality drought forecasting up to a year in advance is critical for agriculture planning and insurance.
We tackle drought data by introducing an end-to-end approach that adopts a systematic end-to-end approach.
Key findings are the exceptional performance of a Transformer model, EarthFormer, in making accurate short-term (up to six months) forecasts.
arXiv Detail & Related papers (2023-09-12T13:28:06Z) - Automating global landslide detection with heterogeneous ensemble
deep-learning classification [44.99833362998488]
Landslides threaten infrastructure, including roads, railways, buildings, and human life.
Hazard-based spatial planning and early warning systems are cost-effective strategies to reduce the risk to society from landslides.
Deep learning models have recently been applied for landside mapping using medium- to high-resolution satellite images as input.
arXiv Detail & Related papers (2023-09-12T10:56:16Z) - AI Increases Global Access to Reliable Flood Forecasts [6.498815455353096]
Floods are one of the most common natural disasters, with a disproportionate impact in developing countries that often lack dense streamflow gauge networks.
Using AI, we achieve reliability in predicting extreme riverine events in ungauged watersheds at up to a 5-day lead time.
This work highlights a need for increasing the availability of hydrological data to continue to improve global access to reliable flood warnings.
arXiv Detail & Related papers (2023-07-30T01:49:21Z) - Rapid Flood Inundation Forecast Using Fourier Neural Operator [77.30160833875513]
Flood inundation forecast provides critical information for emergency planning before and during flood events.
High-resolution hydrodynamic modeling has become more accessible in recent years, however, predicting flood extents at the street and building levels in real-time is still computationally demanding.
We present a hybrid process-based and data-driven machine learning (ML) approach for flood extent and inundation depth prediction.
arXiv Detail & Related papers (2023-07-29T22:49:50Z) - DAM-Net: Global Flood Detection from SAR Imagery Using Differential
Attention Metric-Based Vision Transformers [22.885444177106873]
Detection of flooded areas using high-resolution synthetic aperture radar (SAR) imagery is a critical task with applications in crisis and disaster management.
To address this issue, we propose a novel differential attention metric-based network (DAM-Net) in this study.
The DAM-Net comprises two key components: a weight-sharing Siamese backbone to obtain multi-scale change features of multi-temporal images and tokens containing high-level semantic information of water-body changes.
arXiv Detail & Related papers (2023-06-01T14:12:33Z) - Inferring the past: a combined CNN-LSTM deep learning framework to fuse
satellites for historical inundation mapping [3.753635296366263]
We develop a combined CNN--LSTM deep learning framework to fuse Sentinel-1 derived fractional flooded area with MODIS data in order to infer historical floods over Bangladesh.
The model is applied to historical MODIS data to infer the past 20 years of inundation extents over Bangladesh and compared to a thresholding algorithm and a physical model.
arXiv Detail & Related papers (2023-05-01T03:11:42Z) - Towards Daily High-resolution Inundation Observations using Deep
Learning and EO [0.0]
Constantly remote sensing presents a cost-effective solution for synoptic flood monitoring.
Satellites do offer timely inundation information when they cover an ongoing flood event, but they are limited by their resolution in terms of their ability to monitor flood evolution at various scales.
Data from satellites, such as the Copernicus Sentinels, which have high spatial and low temporal resolution, with data from NASA SMAP and GPM missions could yield high-resolution flood inundation at a daily scale.
arXiv Detail & Related papers (2022-08-10T14:04:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.