PotatoPestNet: A CTInceptionV3-RS-Based Neural Network for Accurate
Identification of Potato Pests
- URL: http://arxiv.org/abs/2306.06206v2
- Date: Sat, 15 Jul 2023 10:40:26 GMT
- Title: PotatoPestNet: A CTInceptionV3-RS-Based Neural Network for Accurate
Identification of Potato Pests
- Authors: Md. Simul Hasan Talukder, Rejwan Bin Sulaiman, Mohammad Raziuddin
Chowdhury, Musarrat Saberin Nipun, Taminul Islam
- Abstract summary: We propose an efficient PotatoPestNet AI-based automatic potato pest identification system.
We leveraged the power of transfer learning by employing five customized, pre-trained transfer learning models.
Among the models, the Customized Tuned Inception V3 model, optimized through random search, demonstrated outstanding performance.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Potatoes are the third-largest food crop globally, but their production
frequently encounters difficulties because of aggressive pest infestations. The
aim of this study is to investigate the various types and characteristics of
these pests and propose an efficient PotatoPestNet AI-based automatic potato
pest identification system. To accomplish this, we curated a reliable dataset
consisting of eight types of potato pests. We leveraged the power of transfer
learning by employing five customized, pre-trained transfer learning models:
CMobileNetV2, CNASLargeNet, CXception, CDenseNet201, and CInceptionV3, in
proposing a robust PotatoPestNet model to accurately classify potato pests. To
improve the models' performance, we applied various augmentation techniques,
incorporated a global average pooling layer, and implemented proper
regularization methods. To further enhance the performance of the models, we
utilized random search (RS) optimization for hyperparameter tuning. This
optimization technique played a significant role in fine-tuning the models and
achieving improved performance. We evaluated the models both visually and
quantitatively, utilizing different evaluation metrics. The robustness of the
models in handling imbalanced datasets was assessed using the Receiver
Operating Characteristic (ROC) curve. Among the models, the Customized Tuned
Inception V3 (CTInceptionV3) model, optimized through random search,
demonstrated outstanding performance. It achieved the highest accuracy (91%),
precision (91%), recall (91%), and F1-score (91%), showcasing its superior
ability to accurately identify and classify potato pests.
Related papers
- Enhanced Infield Agriculture with Interpretable Machine Learning Approaches for Crop Classification [0.49110747024865004]
This research evaluates four different approaches for crop classification, namely traditional ML with handcrafted feature extraction methods like SIFT, ORB, and Color Histogram; Custom Designed CNN and established DL architecture like AlexNet; transfer learning on five models pre-trained using ImageNet.
Xception outperformed all of them in terms of generalization, achieving 98% accuracy on the test data, with a model size of 80.03 MB and a prediction time of 0.0633 seconds.
arXiv Detail & Related papers (2024-08-22T14:20:34Z) - Advancing Green AI: Efficient and Accurate Lightweight CNNs for Rice Leaf Disease Identification [0.0]
Rice plays a vital role as a primary food source for over half of the world's population.
In this study, we explore three mobile-compatible CNN architectures for rice leaf disease classification.
The best performance was achieved by the EfficientNet-B0 model with an accuracy of 99.8%.
arXiv Detail & Related papers (2024-08-03T11:16:00Z) - SugarcaneNet: An Optimized Ensemble of LASSO-Regularized Pre-trained Models for Accurate Disease Classification [0.46180371154032906]
sugarcaneNet2024 is a unique model that outperforms previous methods for automatically and quickly detecting sugarcane disease.
Our proposed model consolidates an optimized weighted average ensemble of seven customized and LASSO-regularized pre-trained models.
This optimized sugarcaneNet2024 model performed the best for detecting sugarcane diseases, having achieved accuracy, precision, recall, and F1 score of 99.67%, 100%, 100%, and 100%.
arXiv Detail & Related papers (2024-03-26T11:23:08Z) - Self-supervised Feature Adaptation for 3D Industrial Anomaly Detection [59.41026558455904]
We focus on multi-modal anomaly detection. Specifically, we investigate early multi-modal approaches that attempted to utilize models pre-trained on large-scale visual datasets.
We propose a Local-to-global Self-supervised Feature Adaptation (LSFA) method to finetune the adaptors and learn task-oriented representation toward anomaly detection.
arXiv Detail & Related papers (2024-01-06T07:30:41Z) - Efficient Apple Maturity and Damage Assessment: A Lightweight Detection
Model with GAN and Attention Mechanism [7.742643088073472]
This study proposes a method based on lightweight convolutional neural networks (CNN) and generative adversarial networks (GAN)
In apple ripeness grading detection, the proposed model achieves 95.6%, 93.8%, 95.0%, and 56.5 in precision, recall, accuracy, and FPS, respectively.
In apple damage level detection, the proposed model reaches 95.3%, 93.7%, and 94.5% in precision, recall, and mAP, respectively.
arXiv Detail & Related papers (2023-10-13T18:22:30Z) - E^2VPT: An Effective and Efficient Approach for Visual Prompt Tuning [55.50908600818483]
Fine-tuning large-scale pretrained vision models for new tasks has become increasingly parameter-intensive.
We propose an Effective and Efficient Visual Prompt Tuning (E2VPT) approach for large-scale transformer-based model adaptation.
Our approach outperforms several state-of-the-art baselines on two benchmarks.
arXiv Detail & Related papers (2023-07-25T19:03:21Z) - Pre-processing training data improves accuracy and generalisability of
convolutional neural network based landscape semantic segmentation [2.8747398859585376]
We trialled different methods of data preparation for CNN training and semantic segmentation of land use land cover (LULC) features within aerial photography over the Wet Tropics and Atherton Tablelands, Queensland, Australia.
This was conducted through trialling and ranking various training patch selection sampling strategies, patch and batch sizes and data augmentations and scaling.
We fully trained five models on the 2018 training image and applied the model to the 2015 test image with the output LULC classifications achieving an average of 0.84 user accuracy of 0.81 and producer accuracy of 0.87.
arXiv Detail & Related papers (2023-04-28T04:38:45Z) - Model soups: averaging weights of multiple fine-tuned models improves
accuracy without increasing inference time [69.7693300927423]
We show that averaging the weights of multiple models fine-tuned with different hyper parameter configurations improves accuracy and robustness.
We show that the model soup approach extends to multiple image classification and natural language processing tasks.
arXiv Detail & Related papers (2022-03-10T17:03:49Z) - From Sound Representation to Model Robustness [82.21746840893658]
We investigate the impact of different standard environmental sound representations (spectrograms) on the recognition performance and adversarial attack robustness of a victim residual convolutional neural network.
Averaged over various experiments on three environmental sound datasets, we found the ResNet-18 model outperforms other deep learning architectures.
arXiv Detail & Related papers (2020-07-27T17:30:49Z) - SADet: Learning An Efficient and Accurate Pedestrian Detector [68.66857832440897]
This paper proposes a series of systematic optimization strategies for the detection pipeline of one-stage detector.
It forms a single shot anchor-based detector (SADet) for efficient and accurate pedestrian detection.
Though structurally simple, it presents state-of-the-art result and real-time speed of $20$ FPS for VGA-resolution images.
arXiv Detail & Related papers (2020-07-26T12:32:38Z) - Improving 3D Object Detection through Progressive Population Based
Augmentation [91.56261177665762]
We present the first attempt to automate the design of data augmentation policies for 3D object detection.
We introduce the Progressive Population Based Augmentation (PPBA) algorithm, which learns to optimize augmentation strategies by narrowing down the search space and adopting the best parameters discovered in previous iterations.
We find that PPBA may be up to 10x more data efficient than baseline 3D detection models without augmentation, highlighting that 3D detection models may achieve competitive accuracy with far fewer labeled examples.
arXiv Detail & Related papers (2020-04-02T05:57:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.