Deep learning-based approach for tomato classification in complex scenes
- URL: http://arxiv.org/abs/2401.15055v1
- Date: Fri, 26 Jan 2024 18:33:57 GMT
- Title: Deep learning-based approach for tomato classification in complex scenes
- Authors: Mikael A. Mousse, Bethel C. A. R. K. Atohoun, Cina Motamed
- Abstract summary: We have proposed a tomato ripening monitoring approach based on deep learning in complex scenes.
The objective is to detect mature tomatoes and harvest them in a timely manner.
Experiments are based on images collected from the internet gathered through searches using tomato state across diverse languages.
- Score: 0.8287206589886881
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Tracking ripening tomatoes is time consuming and labor intensive. Artificial
intelligence technologies combined with those of computer vision can help users
optimize the process of monitoring the ripening status of plants. To this end,
we have proposed a tomato ripening monitoring approach based on deep learning
in complex scenes. The objective is to detect mature tomatoes and harvest them
in a timely manner. The proposed approach is declined in two parts. Firstly,
the images of the scene are transmitted to the pre-processing layer. This
process allows the detection of areas of interest (area of the image containing
tomatoes). Then, these images are used as input to the maturity detection
layer. This layer, based on a deep neural network learning algorithm,
classifies the tomato thumbnails provided to it in one of the following five
categories: green, brittle, pink, pale red, mature red. The experiments are
based on images collected from the internet gathered through searches using
tomato state across diverse languages including English, German, French, and
Spanish. The experimental results of the maturity detection layer on a dataset
composed of images of tomatoes taken under the extreme conditions, gave a good
classification rate.
Related papers
- Tomato Maturity Recognition with Convolutional Transformers [5.220581005698766]
Authors propose a novel method for tomato maturity classification using a convolutional transformer.
New tomato dataset named KUTomaData is designed to train deep-learning models for tomato segmentation and classification.
Authors show that the convolutional transformer outperforms state-of-the-art methods for tomato maturity classification.
arXiv Detail & Related papers (2023-07-04T07:33:53Z) - Detection of Tomato Ripening Stages using Yolov3-tiny [0.0]
We use a neural network-based model for tomato classification and detection.
Our experiments showed an f1-score of 90.0% in the localization and classification of ripening stages in a custom dataset.
arXiv Detail & Related papers (2023-02-01T00:57:58Z) - Deep Image Deblurring: A Survey [165.32391279761006]
Deblurring is a classic problem in low-level computer vision, which aims to recover a sharp image from a blurred input image.
Recent advances in deep learning have led to significant progress in solving this problem.
arXiv Detail & Related papers (2022-01-26T01:31:30Z) - Is Deep Image Prior in Need of a Good Education? [57.3399060347311]
Deep image prior was introduced as an effective prior for image reconstruction.
Despite its impressive reconstructive properties, the approach is slow when compared to learned or traditional reconstruction techniques.
We develop a two-stage learning paradigm to address the computational challenge.
arXiv Detail & Related papers (2021-11-23T15:08:26Z) - Evaluating the Single-Shot MultiBox Detector and YOLO Deep Learning
Models for the Detection of Tomatoes in a Greenhouse [2.949270275392492]
This paper contributes with an annotated visual dataset of green and reddish tomatoes.
Considering our robotic platform specifications, only the Single-Shot MultiBox Detector (SSD) and YOLO architectures were considered.
The results proved that the system can detect green and reddish tomatoes, even those occluded by leaves.
arXiv Detail & Related papers (2021-09-02T09:39:12Z) - Geometry-Based Grasping of Vine Tomatoes [6.547498821163685]
We propose a geometry-based grasping method for vine tomatoes.
It relies on a computer-vision pipeline to identify the required geometric features of the tomatoes and of the truss stem.
The grasping method then uses a geometric model of the robotic hand and the truss to determine a suitable grasping location on the stem.
arXiv Detail & Related papers (2021-03-01T19:33:51Z) - Split then Refine: Stacked Attention-guided ResUNets for Blind Single
Image Visible Watermark Removal [69.92767260794628]
Previous watermark removal methods require to gain the watermark location from users or train a multi-task network to recover the background indiscriminately.
We propose a novel two-stage framework with a stacked attention-guided ResUNets to simulate the process of detection, removal and refinement.
We extensively evaluate our algorithm over four different datasets under various settings and the experiments show that our approach outperforms other state-of-the-art methods by a large margin.
arXiv Detail & Related papers (2020-12-13T09:05:37Z) - Remote Sensing Image Scene Classification Meets Deep Learning:
Challenges, Methods, Benchmarks, and Opportunities [81.29441139530844]
This paper provides a systematic survey of deep learning methods for remote sensing image scene classification by covering more than 160 papers.
We discuss the main challenges of remote sensing image scene classification and survey.
We introduce the benchmarks used for remote sensing image scene classification and summarize the performance of more than two dozen representative algorithms.
arXiv Detail & Related papers (2020-05-03T14:18:00Z) - Counting of Grapevine Berries in Images via Semantic Segmentation using
Convolutional Neural Networks [5.826324731529213]
We present an objective framework based on automatic image analysis which works on two different training systems.
The framework uses a convolutional neural network to detect single berries in images by performing a semantic segmentation.
We achieve an accuracy for the berry detection of 94.0% in the VSP and 85.6% in the SMPH.
arXiv Detail & Related papers (2020-04-29T08:10:19Z) - Self-Supervised Viewpoint Learning From Image Collections [116.56304441362994]
We propose a novel learning framework which incorporates an analysis-by-synthesis paradigm to reconstruct images in a viewpoint aware manner.
We show that our approach performs competitively to fully-supervised approaches for several object categories like human faces, cars, buses, and trains.
arXiv Detail & Related papers (2020-04-03T22:01:41Z) - Learning Depth With Very Sparse Supervision [57.911425589947314]
This paper explores the idea that perception gets coupled to 3D properties of the world via interaction with the environment.
We train a specialized global-local network architecture with what would be available to a robot interacting with the environment.
Experiments on several datasets show that, when ground truth is available even for just one of the image pixels, the proposed network can learn monocular dense depth estimation up to 22.5% more accurately than state-of-the-art approaches.
arXiv Detail & Related papers (2020-03-02T10:44:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.