Rice Leaf Disease Detection: A Comparative Study Between CNN, Transformer and Non-neural Network Architectures
- URL: http://arxiv.org/abs/2501.06740v1
- Date: Sun, 12 Jan 2025 07:29:52 GMT
- Title: Rice Leaf Disease Detection: A Comparative Study Between CNN, Transformer and Non-neural Network Architectures
- Authors: Samia Mehnaz, Md. Touhidul Islam,
- Abstract summary: We study the various computer vision techniques for Bangladeshi rice leaf disease detection.
Among the models tested, ResNet50 exhibited the best performance over other CNN and transformer-based models.
- Score: 1.534667887016089
- License:
- Abstract: In nations such as Bangladesh, agriculture plays a vital role in providing livelihoods for a significant portion of the population. Identifying and classifying plant diseases early is critical to prevent their spread and minimize their impact on crop yield and quality. Various computer vision techniques can be used for such detection and classification. While CNNs have been dominant on such image classification tasks, vision transformers has become equally good in recent time also. In this paper we study the various computer vision techniques for Bangladeshi rice leaf disease detection. We use the Dhan-Shomadhan -- a Bangladeshi rice leaf disease dataset, to experiment with various CNN and ViT models. We also compared the performance of such deep neural network architecture with traditional machine learning architecture like Support Vector Machine(SVM). We leveraged transfer learning for better generalization with lower amount of training data. Among the models tested, ResNet50 exhibited the best performance over other CNN and transformer-based models making it the optimal choice for this task.
Related papers
- PotatoGANs: Utilizing Generative Adversarial Networks, Instance Segmentation, and Explainable AI for Enhanced Potato Disease Identification and Classification [0.0]
Our research employs a novel approach termed as PotatoGANs.
Two types of Generative Adversarial Networks (GANs) are utilized to generate synthetic potato disease images.
Using the Inception score as a measure, our experiments show the better quality and realisticness of the images created by PotatoGANs.
arXiv Detail & Related papers (2024-05-12T17:00:52Z) - Machine Learning-Based Jamun Leaf Disease Detection: A Comprehensive
Review [0.0]
Jamun leaf diseases pose a significant threat to agricultural productivity.
The advent of machine learning has opened up new avenues for tackling these diseases effectively.
Various automated systems have been implemented for similar types of disease detection using image processing techniques.
arXiv Detail & Related papers (2023-11-27T11:46:30Z) - Machine Learning-Based Tea Leaf Disease Detection: A Comprehensive
Review [3.3916160303055563]
Tea leaf diseases are a major challenge to agricultural productivity, with far-reaching implications for yield and quality in the tea industry.
The rise of machine learning has enabled the development of innovative approaches to combat these diseases.
For predicting tea leaf disease, several automated systems have already been developed using different image processing techniques.
arXiv Detail & Related papers (2023-11-06T16:30:40Z) - Defect Classification in Additive Manufacturing Using CNN-Based Vision
Processing [76.72662577101988]
This paper examines two scenarios: first, using convolutional neural networks (CNNs) to accurately classify defects in an image dataset from AM and second, applying active learning techniques to the developed classification model.
This allows the construction of a human-in-the-loop mechanism to reduce the size of the data required to train and generate training data.
arXiv Detail & Related papers (2023-07-14T14:36:58Z) - Explainable vision transformer enabled convolutional neural network for
plant disease identification: PlantXViT [11.623005206620498]
Plant diseases are the primary cause of crop losses globally, with an impact on the world economy.
In this study, a Vision Transformer enabled Convolutional Neural Network model called "PlantXViT" is proposed for plant disease identification.
The proposed model has a lightweight structure with only 0.8 million trainable parameters, which makes it suitable for IoT-based smart agriculture services.
arXiv Detail & Related papers (2022-07-16T12:05:06Z) - Facilitated machine learning for image-based fruit quality assessment in
developing countries [68.8204255655161]
Automated image classification is a common task for supervised machine learning in food science.
We propose an alternative method based on pre-trained vision transformers (ViTs)
It can be easily implemented with limited resources on a standard device.
arXiv Detail & Related papers (2022-07-10T19:52:20Z) - Vision Transformers For Weeds and Crops Classification Of High
Resolution UAV Images [3.1083892213758104]
Vision Transformer (ViT) models can achieve competitive or better results without applying any convolution operations.
Our experiments show that with small set of labelled training data, ViT models perform better compared to state-of-the-art CNN-based models.
arXiv Detail & Related papers (2021-09-06T19:58:54Z) - Vision Transformers are Robust Learners [65.91359312429147]
We study the robustness of the Vision Transformer (ViT) against common corruptions and perturbations, distribution shifts, and natural adversarial examples.
We present analyses that provide both quantitative and qualitative indications to explain why ViTs are indeed more robust learners.
arXiv Detail & Related papers (2021-05-17T02:39:22Z) - BreakingBED -- Breaking Binary and Efficient Deep Neural Networks by
Adversarial Attacks [65.2021953284622]
We study robustness of CNNs against white-box and black-box adversarial attacks.
Results are shown for distilled CNNs, agent-based state-of-the-art pruned models, and binarized neural networks.
arXiv Detail & Related papers (2021-03-14T20:43:19Z) - Classification of COVID-19 in CT Scans using Multi-Source Transfer
Learning [91.3755431537592]
We propose the use of Multi-Source Transfer Learning to improve upon traditional Transfer Learning for the classification of COVID-19 from CT scans.
With our multi-source fine-tuning approach, our models outperformed baseline models fine-tuned with ImageNet.
Our best performing model was able to achieve an accuracy of 0.893 and a Recall score of 0.897, outperforming its baseline Recall score by 9.3%.
arXiv Detail & Related papers (2020-09-22T11:53:06Z) - Curriculum By Smoothing [52.08553521577014]
Convolutional Neural Networks (CNNs) have shown impressive performance in computer vision tasks such as image classification, detection, and segmentation.
We propose an elegant curriculum based scheme that smoothes the feature embedding of a CNN using anti-aliasing or low-pass filters.
As the amount of information in the feature maps increases during training, the network is able to progressively learn better representations of the data.
arXiv Detail & Related papers (2020-03-03T07:27:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.