Assessment of a new GeoAI foundation model for flood inundation mapping
- URL: http://arxiv.org/abs/2309.14500v4
- Date: Fri, 3 Nov 2023 21:58:45 GMT
- Title: Assessment of a new GeoAI foundation model for flood inundation mapping
- Authors: Wenwen Li, Hyunho Lee, Sizhe Wang, Chia-Yu Hsu, Samantha T. Arundel
- Abstract summary: This paper evaluates the performance of the first-of-its-kind geospatial foundation model, IBM-NASA's Prithvi, to support a crucial geospatial analysis task: flood inundation mapping.
A benchmark dataset, Sen1Floods11, is used in the experiments, and the models' predictability, generalizability, and transferability are evaluated.
Results show the good transferability of the Prithvi model, highlighting its performance advantages in segmenting flooded areas in previously unseen regions.
- Score: 4.312965283062856
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Vision foundation models are a new frontier in Geospatial Artificial
Intelligence (GeoAI), an interdisciplinary research area that applies and
extends AI for geospatial problem solving and geographic knowledge discovery,
because of their potential to enable powerful image analysis by learning and
extracting important image features from vast amounts of geospatial data. This
paper evaluates the performance of the first-of-its-kind geospatial foundation
model, IBM-NASA's Prithvi, to support a crucial geospatial analysis task: flood
inundation mapping. This model is compared with convolutional neural network
and vision transformer-based architectures in terms of mapping accuracy for
flooded areas. A benchmark dataset, Sen1Floods11, is used in the experiments,
and the models' predictability, generalizability, and transferability are
evaluated based on both a test dataset and a dataset that is completely unseen
by the model. Results show the good transferability of the Prithvi model,
highlighting its performance advantages in segmenting flooded areas in
previously unseen regions. The findings also indicate areas for improvement for
the Prithvi model in terms of adopting multi-scale representation learning,
developing more end-to-end pipelines for high-level image analysis tasks, and
offering more flexibility in terms of input data bands.
Related papers
- Geospatial foundation models for image analysis: evaluating and enhancing NASA-IBM Prithvi's domain adaptability [3.7899026023232136]
This paper evaluates the recently released NASA-IBM GFM Prithvi for its predictive performance on high-level image analysis tasks.
Prithvi is one of the first open-source GFMs trained on time-series of high-resolution remote sensing imagery.
arXiv Detail & Related papers (2024-08-31T15:51:23Z) - Swarm Intelligence in Geo-Localization: A Multi-Agent Large Vision-Language Model Collaborative Framework [51.26566634946208]
We introduce smileGeo, a novel visual geo-localization framework.
By inter-agent communication, smileGeo integrates the inherent knowledge of these agents with additional retrieved information.
Results show that our approach significantly outperforms current state-of-the-art methods.
arXiv Detail & Related papers (2024-08-21T03:31:30Z) - Towards Vision-Language Geo-Foundation Model: A Survey [65.70547895998541]
Vision-Language Foundation Models (VLFMs) have made remarkable progress on various multimodal tasks.
This paper thoroughly reviews VLGFMs, summarizing and analyzing recent developments in the field.
arXiv Detail & Related papers (2024-06-13T17:57:30Z) - ImplicitTerrain: a Continuous Surface Model for Terrain Data Analysis [14.013976303831313]
ImplicitTerrain is an implicit neural representation (INR) approach for modeling high-resolution terrain continuously and differentiably.
Our experiments demonstrate superior surface fitting accuracy, effective topological feature retrieval, and various topographical feature extraction.
arXiv Detail & Related papers (2024-05-31T23:05:34Z) - Segment Anything Model Can Not Segment Anything: Assessing AI Foundation
Model's Generalizability in Permafrost Mapping [19.307294875969827]
This paper introduces AI foundation models and their defining characteristics.
We evaluate the performance of large AI vision models, especially Meta's Segment Anything Model (SAM)
The results show that although promising, SAM still has room for improvement to support AI-augmented terrain mapping.
arXiv Detail & Related papers (2024-01-16T19:10:09Z) - Innovative Horizons in Aerial Imagery: LSKNet Meets DiffusionDet for
Advanced Object Detection [55.2480439325792]
We present an in-depth evaluation of an object detection model that integrates the LSKNet backbone with the DiffusionDet head.
The proposed model achieves a mean average precision (MAP) of approximately 45.7%, which is a significant improvement.
This advancement underscores the effectiveness of the proposed modifications and sets a new benchmark in aerial image analysis.
arXiv Detail & Related papers (2023-11-21T19:49:13Z) - On the Opportunities and Challenges of Foundation Models for Geospatial
Artificial Intelligence [39.86997089245117]
Foundations models (FMs) can be adapted to a wide range of downstream tasks by fine-tuning, few-shot, or zero-shot learning.
We propose that one of the major challenges of developing a FM for GeoAI is to address the multimodality nature of geospatial tasks.
arXiv Detail & Related papers (2023-04-13T19:50:17Z) - GeoNet: Benchmarking Unsupervised Adaptation across Geographies [71.23141626803287]
We study the problem of geographic robustness and make three main contributions.
First, we introduce a large-scale dataset GeoNet for geographic adaptation.
Second, we hypothesize that the major source of domain shifts arise from significant variations in scene context.
Third, we conduct an extensive evaluation of several state-of-the-art unsupervised domain adaptation algorithms and architectures.
arXiv Detail & Related papers (2023-03-27T17:59:34Z) - Towards Geospatial Foundation Models via Continual Pretraining [22.825065739563296]
We propose a novel paradigm for building highly effective foundation models with minimal resource cost and carbon impact.
We first construct a compact yet diverse dataset from multiple sources to promote feature diversity, which we term GeoPile.
Then, we investigate the potential of continual pretraining from large-scale ImageNet-22k models and propose a multi-objective continual pretraining paradigm.
arXiv Detail & Related papers (2023-02-09T07:39:02Z) - A General Purpose Neural Architecture for Geospatial Systems [142.43454584836812]
We present a roadmap towards the construction of a general-purpose neural architecture (GPNA) with a geospatial inductive bias.
We envision how such a model may facilitate cooperation between members of the community.
arXiv Detail & Related papers (2022-11-04T09:58:57Z) - Structured Landmark Detection via Topology-Adapting Deep Graph Learning [75.20602712947016]
We present a new topology-adapting deep graph learning approach for accurate anatomical facial and medical landmark detection.
The proposed method constructs graph signals leveraging both local image features and global shape features.
Experiments are conducted on three public facial image datasets (WFLW, 300W, and COFW-68) as well as three real-world X-ray medical datasets (Cephalometric (public), Hand and Pelvis)
arXiv Detail & Related papers (2020-04-17T11:55:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.