Adaptive Fusion of Multi-view Remote Sensing data for Optimal Sub-field
Crop Yield Prediction
- URL: http://arxiv.org/abs/2401.11844v1
- Date: Mon, 22 Jan 2024 11:01:52 GMT
- Title: Adaptive Fusion of Multi-view Remote Sensing data for Optimal Sub-field
Crop Yield Prediction
- Authors: Francisco Mena, Deepak Pathak, Hiba Najjar, Cristhian Sanchez, Patrick
Helber, Benjamin Bischke, Peter Habelitz, Miro Miranda, Jayanth Siddamsetty,
Marlon Nuske, Marcela Charfuelan, Diego Arenas, Michaela Vollmer, Andreas
Dengel
- Abstract summary: We present a novel multi-view learning approach to predict crop yield for different crops (soybean, wheat, rapeseed) and regions (Argentina, Uruguay, and Germany).
Our input data includes multi-spectral optical images from Sentinel-2 satellites and weather data as dynamic features during the crop growing season, complemented by static features like soil properties and topographic information.
To effectively fuse the data, we introduce a Multi-view Gated Fusion (MVGF) model, comprising dedicated view-encoders and a Gated Unit (GU) module.
The MVGF model is trained at sub-field level with 10 m resolution
- Score: 24.995959334158986
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Accurate crop yield prediction is of utmost importance for informed
decision-making in agriculture, aiding farmers, and industry stakeholders.
However, this task is complex and depends on multiple factors, such as
environmental conditions, soil properties, and management practices. Combining
heterogeneous data views poses a fusion challenge, like identifying the
view-specific contribution to the predictive task. We present a novel
multi-view learning approach to predict crop yield for different crops
(soybean, wheat, rapeseed) and regions (Argentina, Uruguay, and Germany). Our
multi-view input data includes multi-spectral optical images from Sentinel-2
satellites and weather data as dynamic features during the crop growing season,
complemented by static features like soil properties and topographic
information. To effectively fuse the data, we introduce a Multi-view Gated
Fusion (MVGF) model, comprising dedicated view-encoders and a Gated Unit (GU)
module. The view-encoders handle the heterogeneity of data sources with varying
temporal resolutions by learning a view-specific representation. These
representations are adaptively fused via a weighted sum. The fusion weights are
computed for each sample by the GU using a concatenation of the
view-representations. The MVGF model is trained at sub-field level with 10 m
resolution pixels. Our evaluations show that the MVGF outperforms conventional
models on the same task, achieving the best results by incorporating all the
data sources, unlike the usual fusion results in the literature. For Argentina,
the MVGF model achieves an R2 value of 0.68 at sub-field yield prediction,
while at field level evaluation (comparing field averages), it reaches around
0.80 across different countries. The GU module learned different weights based
on the country and crop-type, aligning with the variable significance of each
data source to the prediction task.
Related papers
- EarthView: A Large Scale Remote Sensing Dataset for Self-Supervision [72.84868704100595]
This paper presents a dataset specifically designed for self-supervision on remote sensing data, intended to enhance deep learning applications on Earth monitoring tasks.
The dataset spans 15 tera pixels of global remote-sensing data, combining imagery from a diverse range of sources, including NEON, Sentinel, and a novel release of 1m spatial resolution data from Satellogic.
Accompanying the dataset is EarthMAE, a tailored Masked Autoencoder developed to tackle the distinct challenges of remote sensing data.
arXiv Detail & Related papers (2025-01-14T13:42:22Z) - CMAViT: Integrating Climate, Managment, and Remote Sensing Data for Crop Yield Estimation with Multimodel Vision Transformers [0.0]
We introduce a deep learning-based multi-model called Climate-Management Aware Vision Transformer (CMAViT)
CMAViT integrates both spatial and temporal data by leveraging remote sensing imagery and short-term meteorological data.
It outperforms traditional models like UNet-ConvLSTM, excelling in spatial variability capture and yield prediction.
arXiv Detail & Related papers (2024-11-25T23:34:53Z) - Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary Segmentation [12.039406240082515]
Fields of The World (FTW) is a novel benchmark dataset for agricultural field instance segmentation.
FTW is an order of magnitude larger than previous datasets with 70,462 samples.
We show that models trained on FTW have better zero-shot and fine-tuning performance in held-out countries.
arXiv Detail & Related papers (2024-09-24T17:20:58Z) - Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models [49.439311430360284]
We introduce a novel data synthesis method inspired by contrastive learning and image difference captioning.
Our key idea involves challenging the model to discern both matching and distinct elements.
We leverage this generated dataset to fine-tune state-of-the-art (SOTA) MLLMs.
arXiv Detail & Related papers (2024-08-08T17:10:16Z) - T1: Scaling Diffusion Probabilistic Fields to High-Resolution on Unified
Visual Modalities [69.16656086708291]
Diffusion Probabilistic Field (DPF) models the distribution of continuous functions defined over metric spaces.
We propose a new model comprising of a view-wise sampling algorithm to focus on local structure learning.
The model can be scaled to generate high-resolution data while unifying multiple modalities.
arXiv Detail & Related papers (2023-05-24T03:32:03Z) - Revisiting the Evaluation of Image Synthesis with GANs [55.72247435112475]
This study presents an empirical investigation into the evaluation of synthesis performance, with generative adversarial networks (GANs) as a representative of generative models.
In particular, we make in-depth analyses of various factors, including how to represent a data point in the representation space, how to calculate a fair distance using selected samples, and how many instances to use from each set.
arXiv Detail & Related papers (2023-04-04T17:54:32Z) - MuRAG: Multimodal Retrieval-Augmented Generator for Open Question
Answering over Images and Text [58.655375327681774]
We propose the first Multimodal Retrieval-Augmented Transformer (MuRAG)
MuRAG accesses an external non-parametric multimodal memory to augment language generation.
Our results show that MuRAG achieves state-of-the-art accuracy, outperforming existing models by 10-20% absolute on both datasets.
arXiv Detail & Related papers (2022-10-06T13:58:03Z) - Aggregated Multi-output Gaussian Processes with Knowledge Transfer
Across Domains [39.25639417233822]
This article offers a multi-output Gaussian process (MoGP) model that infers functions for attributes using multiple aggregate datasets of respective granularities.
Experiments demonstrate that the proposed model outperforms in the task of refining coarse-grained aggregate data on real-world datasets.
arXiv Detail & Related papers (2022-06-24T08:07:20Z) - ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for
Image Recognition and Beyond [76.35955924137986]
We propose a Vision Transformer Advanced by Exploring intrinsic IB from convolutions, i.e., ViTAE.
ViTAE has several spatial pyramid reduction modules to downsample and embed the input image into tokens with rich multi-scale context.
We obtain the state-of-the-art classification performance, i.e., 88.5% Top-1 classification accuracy on ImageNet validation set and the best 91.2% Top-1 accuracy on ImageNet real validation set.
arXiv Detail & Related papers (2022-02-21T10:40:05Z) - Meta-Learning for Few-Shot Land Cover Classification [3.8529010979482123]
We evaluate the model-agnostic meta-learning (MAML) algorithm on classification and segmentation tasks.
We find that few-shot model adaptation outperforms pre-training with regular gradient descent.
This indicates that model optimization with meta-learning may benefit tasks in the Earth sciences.
arXiv Detail & Related papers (2020-04-28T09:42:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.