AgroFlux: A Spatial-Temporal Benchmark for Carbon and Nitrogen Flux Prediction in Agricultural Ecosystems
- URL: http://arxiv.org/abs/2602.01614v1
- Date: Mon, 02 Feb 2026 04:04:07 GMT
- Title: AgroFlux: A Spatial-Temporal Benchmark for Carbon and Nitrogen Flux Prediction in Agricultural Ecosystems
- Authors: Qi Cheng, Licheng Liu, Yao Zhang, Mu Hong, Yiqun Xie, Xiaowei Jia,
- Abstract summary: We introduce a first-of-its-kind spatial-temporal agroecosystem GHG benchmark dataset.<n>We evaluate the performance of various sequential deep learning models on carbon and nitrogen flux prediction.<n>Our benchmark dataset and evaluation framework contribute to the development of more accurate and scalable AI-driven agroecosystem models.
- Score: 32.91715282741263
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Agroecosystem, which heavily influenced by human actions and accounts for a quarter of global greenhouse gas emissions (GHGs), plays a crucial role in mitigating global climate change and securing environmental sustainability. However, we can't manage what we can't measure. Accurately quantifying the pools and fluxes in the carbon, nutrient, and water nexus of the agroecosystem is therefore essential for understanding the underlying drivers of GHG and developing effective mitigation strategies. Conventional approaches like soil sampling, process-based models, and black-box machine learning models are facing challenges such as data sparsity, high spatiotemporal heterogeneity, and complex subsurface biogeochemical and physical processes. Developing new trustworthy approaches such as AI-empowered models, will require the AI-ready benchmark dataset and outlined protocols, which unfortunately do not exist. In this work, we introduce a first-of-its-kind spatial-temporal agroecosystem GHG benchmark dataset that integrates physics-based model simulations from Ecosys and DayCent with real-world observations from eddy covariance flux towers and controlled-environment facilities. We evaluate the performance of various sequential deep learning models on carbon and nitrogen flux prediction, including LSTM-based models, temporal CNN-based model, and Transformer-based models. Furthermore, we explored transfer learning to leverage simulated data to improve the generalization of deep learning models on real-world observations. Our benchmark dataset and evaluation framework contribute to the development of more accurate and scalable AI-driven agroecosystem models, advancing our understanding of ecosystem-climate interactions.
Related papers
- Role-Aware Conditional Inference for Spatiotemporal Ecosystem Carbon Flux Prediction [32.547109051574836]
Role-Aware Inference (RACI) is a process-informed learning framework that formulates Conditional ecosystem flux prediction as a conditional problem.<n>By explicitly modeling these distinct functional roles, RACI enables a model to adapt its predictions across diverse environmental regimes.
arXiv Detail & Related papers (2026-03-03T21:29:20Z) - X-MethaneWet: A Cross-scale Global Wetland Methane Emission Benchmark Dataset for Advancing Science Discovery with AI [15.813459313530625]
Methane (CH$_4$) is the second most powerful greenhouse gas after carbon dioxide and plays a crucial role in climate change.<n>We introduce the first-of-its-kind cross-scale global wetland methane benchmark dataset (X-MethaneWet)<n>This dataset can offer opportunities for improving global wetland CH$_4$ modeling and science discovery with new AI algorithms.
arXiv Detail & Related papers (2025-05-23T20:24:09Z) - LLM-based Evaluation Policy Extraction for Ecological Modeling [22.432508855430797]
evaluating ecological time series is critical for benchmarking model performance in many important applications.<n>Traditional numerical metrics fail to capture domain-specific temporal patterns critical to ecological processes.<n>We propose a novel framework that integrates metric learning with large language model (LLM)-based natural language policy extraction.
arXiv Detail & Related papers (2025-05-20T01:02:29Z) - EnviroPiNet: A Physics-Guided AI Model for Predicting Biofilter Performance [0.9895793818721335]
We present the first application of Buckingham Pi theory to modelling biofilter performance.<n>This dimensionality reduction technique identifies meaningful, dimensionless variables that enhance predictive accuracy.<n>We develop the Environmental Buckingham Pi Neural Network (EnviroPiNet), a physics-guided model benchmarked against traditional data-driven methods.
arXiv Detail & Related papers (2025-04-24T13:52:51Z) - Fourier Neural Operator based surrogates for $CO_2$ storage in realistic geologies [57.23978190717341]
We develop a Neural Operator (FNO) based model for real-time, high-resolution simulation of $CO$ plume migration.<n>The model is trained on a comprehensive dataset generated from realistic subsurface parameters.<n>We present various strategies for improving the reliability of predictions from the model, which is crucial while assessing actual geological sites.
arXiv Detail & Related papers (2025-03-14T02:58:24Z) - Machine Learning for Methane Detection and Quantification from Space -- A survey [49.7996292123687]
Methane (CH_4) is a potent anthropogenic greenhouse gas, contributing 86 times more to global warming than Carbon Dioxide (CO_2) over 20 years.
This work expands existing information on operational methane point source detection sensors in the Short-Wave Infrared (SWIR) bands.
It reviews the state-of-the-art for traditional as well as Machine Learning (ML) approaches.
arXiv Detail & Related papers (2024-08-27T15:03:20Z) - Comparing Data-Driven and Mechanistic Models for Predicting Phenology in
Deciduous Broadleaf Forests [47.285748922842444]
We train a deep neural network to predict a phenological index from meteorological time series.
We find that this approach outperforms traditional process-based models.
arXiv Detail & Related papers (2024-01-08T15:29:23Z) - FREE: The Foundational Semantic Recognition for Modeling Environmental Ecosystems [56.0640340392818]
We introduce a framework, FREE, that enables the use of varying features and available information to train a universal model.<n>The core idea is to map available environmental data into a text space and then convert the traditional predictive modeling task in environmental science to a semantic recognition problem.<n>Our evaluation on two societally important real-world applications, stream water temperature prediction and crop yield prediction, demonstrates the superiority of FREE over multiple baselines.
arXiv Detail & Related papers (2023-11-17T00:53:09Z) - SSL-SoilNet: A Hybrid Transformer-based Framework with Self-Supervised Learning for Large-scale Soil Organic Carbon Prediction [2.554658234030785]
This study introduces a novel approach that aims to learn the geographical link between multimodal features via self-supervised contrastive learning.
The proposed approach has undergone rigorous testing on two distinct large-scale datasets.
arXiv Detail & Related papers (2023-08-07T13:44:44Z) - A Comparative Study of Machine Learning Algorithms for Anomaly Detection
in Industrial Environments: Performance and Environmental Impact [62.997667081978825]
This study seeks to address the demands of high-performance machine learning models with environmental sustainability.
Traditional machine learning algorithms, such as Decision Trees and Random Forests, demonstrate robust efficiency and performance.
However, superior outcomes were obtained with optimised configurations, albeit with a commensurate increase in resource consumption.
arXiv Detail & Related papers (2023-07-01T15:18:00Z) - Spatiotemporal modeling of European paleoclimate using doubly sparse
Gaussian processes [61.31361524229248]
We build on recent scale sparsetemporal GPs to reduce the computational burden.
We successfully employ such a doubly sparse GP to construct a probabilistic model of paleoclimate.
arXiv Detail & Related papers (2022-11-15T14:15:04Z) - Real-time high-resolution CO$_2$ geological storage prediction using
nested Fourier neural operators [58.728312684306545]
Carbon capture and storage (CCS) plays an essential role in global decarbonization.
Scaling up CCS deployment requires accurate and high-resolution modeling of the storage reservoir pressure buildup and the gaseous plume migration.
We introduce Nested Fourier Neural Operator (FNO), a machine-learning framework for high-resolution dynamic 3D CO2 storage modeling at a basin scale.
arXiv Detail & Related papers (2022-10-31T04:04:03Z) - Machine Learning for Robust Identification of Complex Nonlinear
Dynamical Systems: Applications to Earth Systems Modeling [8.896888286819635]
Systems exhibiting chaos are ubiquitous across Earth Sciences.
System Identification remains a challenge in climate science.
We consider a chaotic system - two-level Lorenz-96 - used as a benchmark model in the climate science literature.
arXiv Detail & Related papers (2020-08-12T22:37:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.