Extended Agriculture-Vision: An Extension of a Large Aerial Image
Dataset for Agricultural Pattern Analysis
- URL: http://arxiv.org/abs/2303.02460v1
- Date: Sat, 4 Mar 2023 17:35:24 GMT
- Title: Extended Agriculture-Vision: An Extension of a Large Aerial Image
Dataset for Agricultural Pattern Analysis
- Authors: Jing Wu, David Pichler, Daniel Marley, David Wilson, Naira Hovakimyan,
Jennifer Hobbs
- Abstract summary: We release an improved version of the Agriculture-Vision dataset (Chiu et al., 2020b)
We extend this dataset with the release of 3600 large, high-resolution (10cm/pixel), full-field, red-green-blue and near-infrared images for pre-training.
We demonstrate the usefulness of this data by benchmarking different contrastive learning approaches on both downstream classification and semantic segmentation tasks.
- Score: 11.133807938044804
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A key challenge for much of the machine learning work on remote sensing and
earth observation data is the difficulty in acquiring large amounts of
accurately labeled data. This is particularly true for semantic segmentation
tasks, which are much less common in the remote sensing domain because of the
incredible difficulty in collecting precise, accurate, pixel-level annotations
at scale. Recent efforts have addressed these challenges both through the
creation of supervised datasets as well as the application of self-supervised
methods. We continue these efforts on both fronts. First, we generate and
release an improved version of the Agriculture-Vision dataset (Chiu et al.,
2020b) to include raw, full-field imagery for greater experimental flexibility.
Second, we extend this dataset with the release of 3600 large, high-resolution
(10cm/pixel), full-field, red-green-blue and near-infrared images for
pre-training. Third, we incorporate the Pixel-to-Propagation Module Xie et al.
(2021b) originally built on the SimCLR framework into the framework of MoCo-V2
Chen et al.(2020b). Finally, we demonstrate the usefulness of this data by
benchmarking different contrastive learning approaches on both downstream
classification and semantic segmentation tasks. We explore both CNN and Swin
Transformer Liu et al. (2021a) architectures within different frameworks based
on MoCo-V2. Together, these approaches enable us to better detect key
agricultural patterns of interest across a field from aerial imagery so that
farmers may be alerted to problematic areas in a timely fashion to inform their
management decisions. Furthermore, the release of these datasets will support
numerous avenues of research for computer vision in remote sensing for
agriculture.
Related papers
- Plant detection from ultra high resolution remote sensing images: A Semantic Segmentation approach based on fuzzy loss [2.6489824612123716]
We tackle the challenge of identifying plant species from ultra high resolution (UHR) remote sensing images.
Our approach involves introducing an RGB remote sensing dataset, characterized by millimeter-level spatial resolution.
First experimental results obtained on both our UHR dataset and a public dataset are presented, showing the relevance of the proposed methodology.
arXiv Detail & Related papers (2024-08-31T17:40:17Z) - Dehazing Remote Sensing and UAV Imagery: A Review of Deep Learning, Prior-based, and Hybrid Approaches [4.516330345599765]
High-quality images are crucial in remote sensing and UAV applications.
atmospheric haze can severely degrade image quality, making image dehazing a critical research area.
arXiv Detail & Related papers (2024-05-13T07:35:24Z) - Semi-Self-Supervised Domain Adaptation: Developing Deep Learning Models with Limited Annotated Data for Wheat Head Segmentation [0.10923877073891444]
We introduce a semi-self-supervised domain adaptation technique based on deep convolutional neural networks with a probabilistic diffusion process.
We develop a two-branch convolutional encoder-decoder model architecture that uses both synthesized image-mask pairs and unannotated images.
The proposed model achieved a Dice score of 80.7% on an internal test dataset and a Dice score of 64.8% on an external test set.
arXiv Detail & Related papers (2024-05-12T04:35:49Z) - SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation [69.42764583465508]
We explore the potential of generative image diffusion to address the scarcity of annotated data in earth observation tasks.
To the best of our knowledge, we are the first to generate both images and corresponding masks for satellite segmentation.
arXiv Detail & Related papers (2024-03-25T10:30:22Z) - Explainable AI in Grassland Monitoring: Enhancing Model Performance and
Domain Adaptability [0.6131022957085438]
Grasslands are known for their high biodiversity and ability to provide multiple ecosystem services.
Challenges in automating the identification of indicator plants are key obstacles to large-scale grassland monitoring.
This paper delves into the latter two challenges, with a specific focus on transfer learning and XAI approaches to grassland monitoring.
arXiv Detail & Related papers (2023-12-13T10:17:48Z) - Semantic Segmentation of Vegetation in Remote Sensing Imagery Using Deep
Learning [77.34726150561087]
We propose an approach for creating a multi-modal and large-temporal dataset comprised of publicly available Remote Sensing data.
We use Convolutional Neural Networks (CNN) models that are capable of separating different classes of vegetation.
arXiv Detail & Related papers (2022-09-28T18:51:59Z) - MetaGraspNet: A Large-Scale Benchmark Dataset for Scene-Aware
Ambidextrous Bin Picking via Physics-based Metaverse Synthesis [72.85526892440251]
We introduce MetaGraspNet, a large-scale photo-realistic bin picking dataset constructed via physics-based metaverse synthesis.
The proposed dataset contains 217k RGBD images across 82 different article types, with full annotations for object detection, amodal perception, keypoint detection, manipulation order and ambidextrous grasp labels for a parallel-jaw and vacuum gripper.
We also provide a real dataset consisting of over 2.3k fully annotated high-quality RGBD images, divided into 5 levels of difficulties and an unseen object set to evaluate different object and layout properties.
arXiv Detail & Related papers (2022-08-08T08:15:34Z) - Towards Scale Consistent Monocular Visual Odometry by Learning from the
Virtual World [83.36195426897768]
We propose VRVO, a novel framework for retrieving the absolute scale from virtual data.
We first train a scale-aware disparity network using both monocular real images and stereo virtual data.
The resulting scale-consistent disparities are then integrated with a direct VO system.
arXiv Detail & Related papers (2022-03-11T01:51:54Z) - Superpixels and Graph Convolutional Neural Networks for Efficient
Detection of Nutrient Deficiency Stress from Aerial Imagery [3.6843744304889183]
We seek to identify nutrient deficient areas from remotely sensed data to alert farmers to regions that require attention.
We propose a much lighter graph-based method to perform node-based classification.
This model has 4-orders-of-magnitude fewer parameters than a CNN model and trains in a matter of minutes.
arXiv Detail & Related papers (2021-04-20T21:18:16Z) - X-ModalNet: A Semi-Supervised Deep Cross-Modal Network for
Classification of Remote Sensing Data [69.37597254841052]
We propose a novel cross-modal deep-learning framework called X-ModalNet.
X-ModalNet generalizes well, owing to propagating labels on an updatable graph constructed by high-level features on the top of the network.
We evaluate X-ModalNet on two multi-modal remote sensing datasets (HSI-MSI and HSI-SAR) and achieve a significant improvement in comparison with several state-of-the-art methods.
arXiv Detail & Related papers (2020-06-24T15:29:41Z) - Agriculture-Vision: A Large Aerial Image Database for Agricultural
Pattern Analysis [110.30849704592592]
We present Agriculture-Vision: a large-scale aerial farmland image dataset for semantic segmentation of agricultural patterns.
Each image consists of RGB and Near-infrared (NIR) channels with resolution as high as 10 cm per pixel.
We annotate nine types of field anomaly patterns that are most important to farmers.
arXiv Detail & Related papers (2020-01-05T20:19:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.