Spatial Distribution-Shift Aware Knowledge-Guided Machine Learning
- URL: http://arxiv.org/abs/2502.14840v1
- Date: Thu, 20 Feb 2025 18:52:24 GMT
- Title: Spatial Distribution-Shift Aware Knowledge-Guided Machine Learning
- Authors: Arun Sharma, Majid Farhadloo, Mingzhou Yang, Ruolei Zeng, Subhankar Ghosh, Shashi Shekhar,
- Abstract summary: Given inputs of diverse soil characteristics and climate data, we aimed to build a model to predict accurate land emissions.
SDSA-KGML models achieve higher local accuracy for the specified states in the Midwest Region.
- Score: 4.414885369283509
- License:
- Abstract: Given inputs of diverse soil characteristics and climate data gathered from various regions, we aimed to build a model to predict accurate land emissions. The problem is important since accurate quantification of the carbon cycle in agroecosystems is crucial for mitigating climate change and ensuring sustainable food production. Predicting accurate land emissions is challenging since calibrating the heterogeneous nature of soil properties, moisture, and environmental conditions is hard at decision-relevant scales. Traditional approaches do not adequately estimate land emissions due to location-independent parameters failing to leverage the spatial heterogeneity and also require large datasets. To overcome these limitations, we proposed Spatial Distribution-Shift Aware Knowledge-Guided Machine Learning (SDSA-KGML), which leverages location-dependent parameters that account for significant spatial heterogeneity in soil moisture from multiple sites within the same region. Experimental results demonstrate that SDSA-KGML models achieve higher local accuracy for the specified states in the Midwest Region.
Related papers
- Towards Kriging-informed Conditional Diffusion for Regional Sea-Level Data Downscaling [3.8178633709015446]
Given coarser-resolution projections from global climate models or satellite data, the downscaling problem aims to estimate finer-resolution regional climate data.
This problem is societally crucial for effective adaptation, mitigation, and resilience against significant risks from climate change.
We propose a novel Kriging-informed Conditional Diffusion Probabilistic Model (Ki-CDPM) to capture spatial variability while preserving fine-scale features.
arXiv Detail & Related papers (2024-10-21T04:24:10Z) - Combining Observational Data and Language for Species Range Estimation [63.65684199946094]
We propose a novel approach combining millions of citizen science species observations with textual descriptions from Wikipedia.
Our framework maps locations, species, and text descriptions into a common space, enabling zero-shot range estimation from textual descriptions.
Our approach also acts as a strong prior when combined with observational data, resulting in more accurate range estimation with less data.
arXiv Detail & Related papers (2024-10-14T17:22:55Z) - Efficient Localized Adaptation of Neural Weather Forecasting: A Case Study in the MENA Region [62.09891513612252]
We focus on limited-area modeling and train our model specifically for localized region-level downstream tasks.
We consider the MENA region due to its unique climatic challenges, where accurate localized weather forecasting is crucial for managing water resources, agriculture and mitigating the impacts of extreme weather events.
Our study aims to validate the effectiveness of integrating parameter-efficient fine-tuning (PEFT) methodologies, specifically Low-Rank Adaptation (LoRA) and its variants, to enhance forecast accuracy, as well as training speed, computational resource utilization, and memory efficiency in weather and climate modeling for specific regions.
arXiv Detail & Related papers (2024-09-11T19:31:56Z) - Stratospheric aerosol source inversion: Noise, variability, and uncertainty quantification [0.0]
This article presents a framework for stratospheric aerosol source inversion using a Bayesian approximation error approach.
We leverage specially designed earth system model simulations using the Energy Exascale Earth System Model (E3SM)
A comprehensive framework for data generation, data processing, dimension reduction, operator learning, and Bayesian inversion is presented.
arXiv Detail & Related papers (2024-09-10T20:12:36Z) - MambaDS: Near-Surface Meteorological Field Downscaling with Topography Constrained Selective State Space Modeling [68.69647625472464]
Downscaling, a crucial task in meteorological forecasting, enables the reconstruction of high-resolution meteorological states for target regions.
Previous downscaling methods lacked tailored designs for meteorology and encountered structural limitations.
We propose a novel model called MambaDS, which enhances the utilization of multivariable correlations and topography information.
arXiv Detail & Related papers (2024-08-20T13:45:49Z) - SSL-SoilNet: A Hybrid Transformer-based Framework with Self-Supervised Learning for Large-scale Soil Organic Carbon Prediction [2.554658234030785]
This study introduces a novel approach that aims to learn the geographical link between multimodal features via self-supervised contrastive learning.
The proposed approach has undergone rigorous testing on two distinct large-scale datasets.
arXiv Detail & Related papers (2023-08-07T13:44:44Z) - Improve State-Level Wheat Yield Forecasts in Kazakhstan on GEOGLAM's EO
Data by Leveraging A Simple Spatial-Aware Technique [1.433758865948252]
We propose and investigate a technique called state-wise additive bias to explicitly address the cross-region yield heterogeneity in Kazakhstan.
Our method reduces the overall RMSE by 8.9% and the highest state-wise RMSE by 28.37%.
The effectiveness of state-wise additive bias indicates machine learning's performance can be significantly improved.
arXiv Detail & Related papers (2023-06-01T19:35:13Z) - Multi-scale Digital Twin: Developing a fast and physics-informed
surrogate model for groundwater contamination with uncertain climate models [53.44486283038738]
Climate change exacerbates the long-term soil management problem of groundwater contamination.
We develop a physics-informed machine learning surrogate model using U-Net enhanced Fourier Neural Contaminated (PDENO)
In parallel, we develop a convolutional autoencoder combined with climate data to reduce the dimensionality of climatic region similarities across the United States.
arXiv Detail & Related papers (2022-11-20T06:46:35Z) - Spatiotemporal modeling of European paleoclimate using doubly sparse
Gaussian processes [61.31361524229248]
We build on recent scale sparsetemporal GPs to reduce the computational burden.
We successfully employ such a doubly sparse GP to construct a probabilistic model of paleoclimate.
arXiv Detail & Related papers (2022-11-15T14:15:04Z) - High-resolution landscape-scale biomass mapping using a spatiotemporal
patchwork of LiDAR coverages [0.0]
Estimating forest aboveground biomass at fine scales has become increasingly important for greenhouse gas estimation.
Here we address common obstacles including selection of training data, the investigation of regional or coverage specific bias and error, and map patterns at multiple scales.
Our model was overall accurate (% RMSE 13-33%), had very low bias (MBE $leq$ $pm$5 Mg ha$-1$), explained most field-observed variation.
arXiv Detail & Related papers (2022-05-17T17:53:50Z) - Jalisco's multiclass land cover analysis and classification using a
novel lightweight convnet with real-world multispectral and relief data [51.715517570634994]
We present our novel lightweight (only 89k parameters) Convolution Neural Network (ConvNet) to make LC classification and analysis.
In this work, we combine three real-world open data sources to obtain 13 channels.
Our embedded analysis anticipates the limited performance in some classes and gives us the opportunity to group the most similar.
arXiv Detail & Related papers (2022-01-26T14:58:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.