Hierarchical Approach for Joint Semantic, Plant Instance, and Leaf
Instance Segmentation in the Agricultural Domain
- URL: http://arxiv.org/abs/2210.07879v2
- Date: Wed, 14 Jun 2023 14:24:57 GMT
- Title: Hierarchical Approach for Joint Semantic, Plant Instance, and Leaf
Instance Segmentation in the Agricultural Domain
- Authors: Gianmarco Roggiolani, Matteo Sodano, Tiziano Guadagnino, Federico
Magistri, Jens Behley, Cyrill Stachniss
- Abstract summary: Plant phenotyping is a central task in agriculture, as it describes plants' growth stage, development, and other relevant quantities.
In this paper, we address the problem of joint semantic, plant instance, and leaf instance segmentation of crop fields from RGB data.
We propose a single convolutional neural network that addresses the three tasks simultaneously, exploiting their underlying hierarchical structure.
- Score: 29.647846446064992
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Plant phenotyping is a central task in agriculture, as it describes plants'
growth stage, development, and other relevant quantities. Robots can help
automate this process by accurately estimating plant traits such as the number
of leaves, leaf area, and the plant size. In this paper, we address the problem
of joint semantic, plant instance, and leaf instance segmentation of crop
fields from RGB data. We propose a single convolutional neural network that
addresses the three tasks simultaneously, exploiting their underlying
hierarchical structure. We introduce task-specific skip connections, which our
experimental evaluation proves to be more beneficial than the usual schemes. We
also propose a novel automatic post-processing, which explicitly addresses the
problem of spatially close instances, common in the agricultural domain because
of overlapping leaves. Our architecture simultaneously tackles these problems
jointly in the agricultural context. Previous works either focus on plant or
leaf segmentation, or do not optimise for semantic segmentation. Results show
that our system has superior performance compared to state-of-the-art
approaches, while having a reduced number of parameters and is operating at
camera frame rate.
Related papers
- GMT: Guided Mask Transformer for Leaf Instance Segmentation [14.458970589296554]
Leaf instance segmentation is a challenging task, aiming to separate and delineate each leaf in an image of a plant.
We propose the Guided Mask Transformer (GMT), which leverages and integrates leaf spatial distribution priors into a Transformer-based segmentor.
Our GMT consistently outperforms the state-of-the-art on three public plant datasets.
arXiv Detail & Related papers (2024-06-24T19:52:27Z) - Generating Diverse Agricultural Data for Vision-Based Farming Applications [74.79409721178489]
This model is capable of simulating distinct growth stages of plants, diverse soil conditions, and randomized field arrangements under varying lighting conditions.
Our dataset includes 12,000 images with semantic labels, offering a comprehensive resource for computer vision tasks in precision agriculture.
arXiv Detail & Related papers (2024-03-27T08:42:47Z) - Unsupervised Pre-Training for 3D Leaf Instance Segmentation [34.122575664767915]
This paper addresses the problem of reducing the labeling effort required to perform leaf instance segmentation on 3D point clouds.
We propose a novel self-supervised task-specific pre-training approach to initialize the backbone of a network for leaf instance segmentation.
We also introduce a novel automatic postprocessing that considers the difficulty of correctly segmenting the points close to the stem.
arXiv Detail & Related papers (2024-01-16T08:11:08Z) - Zoom in on the Plant: Fine-grained Analysis of Leaf, Stem and Vein
Instances [3.399289369740637]
We develop a model to extract fine-grained phenotypic information, such as leaf-, stem-, and vein instances.
The underlying dataset RumexLeaves is made publicly available and is the first of its kind.
We introduce an adapted metric POKS complying with the concept of keypoint-guided polylines.
arXiv Detail & Related papers (2023-12-14T10:45:54Z) - Domain Generalization for Crop Segmentation with Standardized Ensemble Knowledge Distillation [42.39035033967183]
Service robots need a real-time perception system that understands their surroundings and identifies their targets in the wild.
Existing methods, however, often fall short in generalizing to new crops and environmental conditions.
We propose a novel approach to enhance domain generalization using knowledge distillation.
arXiv Detail & Related papers (2023-04-03T14:28:29Z) - Semantic Image Segmentation with Deep Learning for Vine Leaf Phenotyping [59.0626764544669]
In this study, we use Deep Learning methods to semantically segment grapevine leaves images in order to develop an automated object detection system for leaf phenotyping.
Our work contributes to plant lifecycle monitoring through which dynamic traits such as growth and development can be captured and quantified.
arXiv Detail & Related papers (2022-10-24T14:37:09Z) - Graph Reasoning Transformer for Image Parsing [67.76633142645284]
We propose a novel Graph Reasoning Transformer (GReaT) for image parsing to enable image patches to interact following a relation reasoning pattern.
Compared to the conventional transformer, GReaT has higher interaction efficiency and a more purposeful interaction pattern.
Results show that GReaT achieves consistent performance gains with slight computational overheads on the state-of-the-art transformer baselines.
arXiv Detail & Related papers (2022-09-20T08:21:37Z) - Iterative Scene Graph Generation [55.893695946885174]
Scene graph generation involves identifying object entities and their corresponding interaction predicates in a given image (or video)
Existing approaches to scene graph generation assume certain factorization of the joint distribution to make the estimation iteration feasible.
We propose a novel framework that addresses this limitation, as well as introduces dynamic conditioning on the image.
arXiv Detail & Related papers (2022-07-27T10:37:29Z) - A Trainable Optimal Transport Embedding for Feature Aggregation and its
Relationship to Attention [96.77554122595578]
We introduce a parametrized representation of fixed size, which embeds and then aggregates elements from a given input set according to the optimal transport plan between the set and a trainable reference.
Our approach scales to large datasets and allows end-to-end training of the reference, while also providing a simple unsupervised learning mechanism with small computational cost.
arXiv Detail & Related papers (2020-06-22T08:35:58Z) - Two-View Fine-grained Classification of Plant Species [66.75915278733197]
We propose a novel method based on a two-view leaf image representation and a hierarchical classification strategy for fine-grained recognition of plant species.
A deep metric based on Siamese convolutional neural networks is used to reduce the dependence on a large number of training samples and make the method scalable to new plant species.
arXiv Detail & Related papers (2020-05-18T21:57:47Z) - Deep Transfer Learning For Plant Center Localization [19.322420819302263]
This paper investigates methods that estimate plant locations for a field-based crop using RGB aerial images captured using Unmanned Aerial Vehicles (UAVs)
Deep learning approaches provide promising capability for locating plants observed in RGB images, but they require large quantities of labeled data (ground truth) for training.
We propose a method for estimating plant centers by transferring an existing model to a new scenario using limited ground truth data.
arXiv Detail & Related papers (2020-04-29T06:29:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.