CWD30: A Comprehensive and Holistic Dataset for Crop Weed Recognition in
Precision Agriculture
- URL: http://arxiv.org/abs/2305.10084v1
- Date: Wed, 17 May 2023 09:39:01 GMT
- Title: CWD30: A Comprehensive and Holistic Dataset for Crop Weed Recognition in
Precision Agriculture
- Authors: Talha Ilyas, Dewa Made Sri Arsa, Khubaib Ahmad, Yong Chae Jeong, Okjae
Won, Jong Hoon Lee, Hyongsuk Kim
- Abstract summary: We present the CWD30 dataset, a large-scale, diverse, holistic, and hierarchical dataset tailored for crop-weed recognition tasks in precision agriculture.
CWD30 comprises over 219,770 high-resolution images of 20 weed species and 10 crop species, encompassing various growth stages, multiple viewing angles, and environmental conditions.
The dataset's hierarchical taxonomy enables fine-grained classification and facilitates the development of more accurate, robust, and generalizable deep learning models.
- Score: 1.64709990449384
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The growing demand for precision agriculture necessitates efficient and
accurate crop-weed recognition and classification systems. Current datasets
often lack the sample size, diversity, and hierarchical structure needed to
develop robust deep learning models for discriminating crops and weeds in
agricultural fields. Moreover, the similar external structure and phenomics of
crops and weeds complicate recognition tasks. To address these issues, we
present the CWD30 dataset, a large-scale, diverse, holistic, and hierarchical
dataset tailored for crop-weed recognition tasks in precision agriculture.
CWD30 comprises over 219,770 high-resolution images of 20 weed species and 10
crop species, encompassing various growth stages, multiple viewing angles, and
environmental conditions. The images were collected from diverse agricultural
fields across different geographic locations and seasons, ensuring a
representative dataset. The dataset's hierarchical taxonomy enables
fine-grained classification and facilitates the development of more accurate,
robust, and generalizable deep learning models. We conduct extensive baseline
experiments to validate the efficacy of the CWD30 dataset. Our experiments
reveal that the dataset poses significant challenges due to intra-class
variations, inter-class similarities, and data imbalance. Additionally, we
demonstrate that minor training modifications like using CWD30 pretrained
backbones can significantly enhance model performance and reduce convergence
time, saving training resources on several downstream tasks. These challenges
provide valuable insights and opportunities for future research in crop-weed
recognition. We believe that the CWD30 dataset will serve as a benchmark for
evaluating crop-weed recognition algorithms, promoting advancements in
precision agriculture, and fostering collaboration among researchers in the
field.
Related papers
- Self-supervised transformer-based pre-training method with General Plant Infection dataset [3.969851116372513]
This study proposes an advanced network architecture that combines Contrastive Learning and Masked Image Modeling (MIM)
The proposed network architecture demonstrates effectiveness in addressing plant pest and disease recognition tasks, achieving notable detection accuracy.
Our code and dataset will be publicly available to advance research in plant pest and disease recognition.
arXiv Detail & Related papers (2024-07-20T15:48:35Z) - Generating Diverse Agricultural Data for Vision-Based Farming Applications [74.79409721178489]
This model is capable of simulating distinct growth stages of plants, diverse soil conditions, and randomized field arrangements under varying lighting conditions.
Our dataset includes 12,000 images with semantic labels, offering a comprehensive resource for computer vision tasks in precision agriculture.
arXiv Detail & Related papers (2024-03-27T08:42:47Z) - HarvestNet: A Dataset for Detecting Smallholder Farming Activity Using
Harvest Piles and Remote Sensing [50.4506590177605]
HarvestNet is a dataset for mapping the presence of farms in the Ethiopian regions of Tigray and Amhara during 2020-2023.
We introduce a new approach based on the detection of harvest piles characteristic of many smallholder systems.
We conclude that remote sensing of harvest piles can contribute to more timely and accurate cropland assessments in food insecure regions.
arXiv Detail & Related papers (2023-08-23T11:03:28Z) - PhenoBench -- A Large Dataset and Benchmarks for Semantic Image Interpretation in the Agricultural Domain [29.395926321984565]
We present an annotated dataset and benchmarks for the semantic interpretation of real agricultural fields.
Our dataset recorded with a UAV provides high-quality, pixel-wise annotations of crops and weeds, but also crop leaf instances at the same time.
We provide benchmarks for various tasks on a hidden test set comprised of different fields.
arXiv Detail & Related papers (2023-06-07T16:04:08Z) - Concept Drift and Long-Tailed Distribution in Fine-Grained Visual Categorization: Benchmark and Method [84.68818879525568]
We present a Concept Drift and Long-Tailed Distribution dataset.
The characteristics of instances tend to vary with time and exhibit a long-tailed distribution.
We propose a feature recombination framework to address the learning challenges associated with CDLT.
arXiv Detail & Related papers (2023-06-04T12:42:45Z) - Agave crop segmentation and maturity classification with deep learning
data-centric strategies using very high-resolution satellite imagery [101.18253437732933]
We present an Agave tequilana Weber azul crop segmentation and maturity classification using very high resolution satellite imagery.
We solve real-world deep learning problems in the very specific context of agave crop segmentation.
With the resulting accurate models, agave production forecasting can be made available for large regions.
arXiv Detail & Related papers (2023-03-21T03:15:29Z) - TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual
Environments [84.6017003787244]
This work proposes a synthetic data generation pipeline to address the difficulties and domain-gaps present in simulated datasets.
We show that using annotations and visual cues from existing datasets, we can facilitate automated multi-modal data generation.
arXiv Detail & Related papers (2022-08-16T20:46:08Z) - Weed Recognition using Deep Learning Techniques on Class-imbalanced
Imagery [4.96981595868944]
We have investigated five state-of-the-art deep neural networks and evaluated their performance for weed recognition.
VGG16 performed better than others on small-scale datasets, while ResNet-50 performed better than other deep networks on the large combined dataset.
arXiv Detail & Related papers (2021-12-15T01:00:05Z) - Enlisting 3D Crop Models and GANs for More Data Efficient and
Generalizable Fruit Detection [0.0]
We propose a method that generates agricultural images from a synthetic 3D crop model domain into real world crop domains.
The method uses a semantically constrained GAN (generative adversarial network) to preserve the fruit position and geometry.
Incremental training experiments in vineyard grape detection tasks show that the images generated from our method can significantly speed the domain process.
arXiv Detail & Related papers (2021-08-30T16:11:59Z) - Agriculture-Vision: A Large Aerial Image Database for Agricultural
Pattern Analysis [110.30849704592592]
We present Agriculture-Vision: a large-scale aerial farmland image dataset for semantic segmentation of agricultural patterns.
Each image consists of RGB and Near-infrared (NIR) channels with resolution as high as 10 cm per pixel.
We annotate nine types of field anomaly patterns that are most important to farmers.
arXiv Detail & Related papers (2020-01-05T20:19:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.