GeoViT: A Versatile Vision Transformer Architecture for Geospatial Image
Analysis
- URL: http://arxiv.org/abs/2311.14301v1
- Date: Fri, 24 Nov 2023 06:22:38 GMT
- Title: GeoViT: A Versatile Vision Transformer Architecture for Geospatial Image
Analysis
- Authors: Madhav Khirwar, Ankur Narang
- Abstract summary: We introduce GeoViT, a compact vision transformer model adept in processing satellite imagery for multimodal segmentation.
We attain superior accuracy in estimating power generation rates, fuel type, plume coverage for CO2, and high-resolution NO2 concentration mapping.
- Score: 2.1647301294759624
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Greenhouse gases are pivotal drivers of climate change, necessitating precise
quantification and source identification to foster mitigation strategies. We
introduce GeoViT, a compact vision transformer model adept in processing
satellite imagery for multimodal segmentation, classification, and regression
tasks targeting CO2 and NO2 emissions. Leveraging GeoViT, we attain superior
accuracy in estimating power generation rates, fuel type, plume coverage for
CO2, and high-resolution NO2 concentration mapping, surpassing previous
state-of-the-art models while significantly reducing model size. GeoViT
demonstrates the efficacy of vision transformer architectures in harnessing
satellite-derived data for enhanced GHG emission insights, proving instrumental
in advancing climate change monitoring and emission regulation efforts
globally.
Related papers
- Enhancing Carbon Emission Reduction Strategies using OCO and ICOS data [40.572754656757475]
We propose a methodology to enhance local CO2 monitoring by integrating satellite data from the Orbiting Carbon Observatories (OCO-2 and OCO-3) with ground level observations from the Integrated Carbon Observation System (ICOS) and weather data from the ECMWF Reanalysis v5 (ERA5)
We employ weighted K-nearest neighbor (KNN) with machine learning models to predict ground level CO2 from satellite measurements, achieving a Root Mean Squared Error of 3.92 ppm.
arXiv Detail & Related papers (2024-10-05T21:23:58Z) - Machine Learning for Methane Detection and Quantification from Space -- A survey [49.7996292123687]
Methane (CH_4) is a potent anthropogenic greenhouse gas, contributing 86 times more to global warming than Carbon Dioxide (CO_2) over 20 years.
This work expands existing information on operational methane point source detection sensors in the Short-Wave Infrared (SWIR) bands.
It reviews the state-of-the-art for traditional as well as Machine Learning (ML) approaches.
arXiv Detail & Related papers (2024-08-27T15:03:20Z) - Fine-tuning of Geospatial Foundation Models for Aboveground Biomass Estimation [2.3429628556845405]
Fine-tuning of a geospatial foundation model to estimate above-ground biomass has comparable performance to a U-Net trained from scratch.
We also explore the transfer-learning capabilities of the models by fine-tuning on satellite imagery with sparse labels from different eco-regions in Brazil.
arXiv Detail & Related papers (2024-06-28T12:54:10Z) - Automatic Graph Topology-Aware Transformer [50.2807041149784]
We build a comprehensive graph Transformer search space with the micro-level and macro-level designs.
EGTAS evolves graph Transformer topologies at the macro level and graph-aware strategies at the micro level.
We demonstrate the efficacy of EGTAS across a range of graph-level and node-level tasks.
arXiv Detail & Related papers (2024-05-30T07:44:31Z) - Gasformer: A Transformer-based Architecture for Segmenting Methane Emissions from Livestock in Optical Gas Imaging [0.0]
Methane emissions from livestock, particularly cattle, significantly contribute to climate change.
We introduce Gasformer, a novel semantic segmentation architecture for detecting low-flow rate methane emissions from livestock.
We present two unique datasets captured with a FLIR GF77 OGI camera.
arXiv Detail & Related papers (2024-04-16T18:38:23Z) - GeoFormer: A Vision and Sequence Transformer-based Approach for
Greenhouse Gas Monitoring [2.1647301294759624]
We introduce a compact model that combines a vision transformer module with a highly efficient time-series transformer module to predict NO2 concentrations.
We train the proposed model to predict surface-level NO2 measurements using a dataset we constructed with Sentinel-5P images of ground-level monitoring stations.
arXiv Detail & Related papers (2024-02-11T11:20:29Z) - 2-D SSM: A General Spatial Layer for Visual Transformers [79.4957965474334]
A central objective in computer vision is to design models with appropriate 2-D inductive bias.
We leverage an expressive variation of the multidimensional State Space Model.
Our approach introduces efficient parameterization, accelerated computation, and a suitable normalization scheme.
arXiv Detail & Related papers (2023-06-11T09:41:37Z) - Multi-modal learning for geospatial vegetation forecasting [1.8180482634934092]
We introduce GreenEarthNet, the first dataset specifically designed for high-resolution vegetation forecasting.
We also present Contextformer, a novel deep learning approach for predicting vegetation greenness from Sentinel 2 satellite images.
To the best of our knowledge, this work presents the first models for continental-scale vegetation modeling at fine resolution able to capture anomalies beyond the seasonal cycle.
arXiv Detail & Related papers (2023-03-28T17:59:05Z) - Real-time high-resolution CO$_2$ geological storage prediction using
nested Fourier neural operators [58.728312684306545]
Carbon capture and storage (CCS) plays an essential role in global decarbonization.
Scaling up CCS deployment requires accurate and high-resolution modeling of the storage reservoir pressure buildup and the gaseous plume migration.
We introduce Nested Fourier Neural Operator (FNO), a machine-learning framework for high-resolution dynamic 3D CO2 storage modeling at a basin scale.
arXiv Detail & Related papers (2022-10-31T04:04:03Z) - Toward Foundation Models for Earth Monitoring: Proposal for a Climate
Change Benchmark [95.19070157520633]
Recent progress in self-supervision shows that pre-training large neural networks on vast amounts of unsupervised data can lead to impressive increases in generalisation for downstream tasks.
Such models, recently coined as foundation models, have been transformational to the field of natural language processing.
We propose to develop a new benchmark comprised of a variety of downstream tasks related to climate change.
arXiv Detail & Related papers (2021-12-01T15:38:19Z) - Estimating Crop Primary Productivity with Sentinel-2 and Landsat 8 using
Machine Learning Methods Trained with Radiative Transfer Simulations [58.17039841385472]
We take advantage of all parallel developments in mechanistic modeling and satellite data availability for advanced monitoring of crop productivity.
Our model successfully estimates gross primary productivity across a variety of C3 crop types and environmental conditions even though it does not use any local information from the corresponding sites.
This highlights its potential to map crop productivity from new satellite sensors at a global scale with the help of current Earth observation cloud computing platforms.
arXiv Detail & Related papers (2020-12-07T16:23:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.