Towards Robust Plant Disease Diagnosis with Hard-sample Re-mining
Strategy
- URL: http://arxiv.org/abs/2309.01903v2
- Date: Mon, 30 Oct 2023 13:59:57 GMT
- Title: Towards Robust Plant Disease Diagnosis with Hard-sample Re-mining
Strategy
- Authors: Quan Huu Cap, Atsushi Fukuda, Satoshi Kagiwada, Hiroyuki Uga, Nobusuke
Iwasaki, Hitoshi Iyatomi
- Abstract summary: We propose a simple but effective training strategy called hard-sample re-mining (HSReM)
HSReM is designed to enhance the diagnostic performance of healthy data and simultaneously improve the performance of disease data.
Experiments show that our HSReM training strategy leads to a substantial improvement in the overall diagnostic performance on large-scale unseen data.
- Score: 6.844857856353672
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: With rich annotation information, object detection-based automated plant
disease diagnosis systems (e.g., YOLO-based systems) often provide advantages
over classification-based systems (e.g., EfficientNet-based), such as the
ability to detect disease locations and superior classification performance.
One drawback of these detection systems is dealing with unannotated healthy
data with no real symptoms present. In practice, healthy plant data appear to
be very similar to many disease data. Thus, those models often produce
mis-detected boxes on healthy images. In addition, labeling new data for
detection models is typically time-consuming. Hard-sample mining (HSM) is a
common technique for re-training a model by using the mis-detected boxes as new
training samples. However, blindly selecting an arbitrary amount of hard-sample
for re-training will result in the degradation of diagnostic performance for
other diseases due to the high similarity between disease and healthy data. In
this paper, we propose a simple but effective training strategy called
hard-sample re-mining (HSReM), which is designed to enhance the diagnostic
performance of healthy data and simultaneously improve the performance of
disease data by strategically selecting hard-sample training images at an
appropriate level. Experiments based on two practical in-field eight-class
cucumber and ten-class tomato datasets (42.7K and 35.6K images) show that our
HSReM training strategy leads to a substantial improvement in the overall
diagnostic performance on large-scale unseen data. Specifically, the object
detection model trained using the HSReM strategy not only achieved superior
results as compared to the classification-based state-of-the-art
EfficientNetV2-Large model and the original object detection model, but also
outperformed the model using the HSM strategy in multiple evaluation metrics.
Related papers
- Weakly supervised deep learning model with size constraint for prostate cancer detection in multiparametric MRI and generalization to unseen domains [0.90668179713299]
We show that the model achieves on-par performance with strong fully supervised baseline models.
We also observe a performance decrease for both fully supervised and weakly supervised models when tested on unseen data domains.
arXiv Detail & Related papers (2024-11-04T12:24:33Z) - DDPM-MoCo: Advancing Industrial Surface Defect Generation and Detection with Generative and Contrastive Learning [3.789219860006095]
We introduce a novel defect-generation method, named DDPM-MoCo, to address these issues.
Firstly, we utilize the Denoising Diffusion Probabilistic Model (DDPM) to generate high-quality defect data samples.
Secondly, we utilize the unsupervised learning Momentum Contrast model (MoCo) with an enhanced batch contrastive loss function for training the model on unlabeled data.
arXiv Detail & Related papers (2024-05-09T17:17:53Z) - Which Augmentation Should I Use? An Empirical Investigation of Augmentations for Self-Supervised Phonocardiogram Representation Learning [5.438725298163702]
Contrastive Self-Supervised Learning (SSL) offers a potential solution to labeled data scarcity.
We propose uncovering the optimal augmentations for applying contrastive learning in 1D phonocardiogram (PCG) classification.
We demonstrate that depending on its training distribution, the effectiveness of a fully-supervised model can degrade up to 32%, while SSL models only lose up to 10% or even improve in some cases.
arXiv Detail & Related papers (2023-12-01T11:06:00Z) - Vision-Language Modelling For Radiological Imaging and Reports In The
Low Data Regime [70.04389979779195]
This paper explores training medical vision-language models (VLMs) where the visual and language inputs are embedded into a common space.
We explore several candidate methods to improve low-data performance, including adapting generic pre-trained models to novel image and text domains.
Using text-to-image retrieval as a benchmark, we evaluate the performance of these methods with variable sized training datasets of paired chest X-rays and radiological reports.
arXiv Detail & Related papers (2023-03-30T18:20:00Z) - Temporal Output Discrepancy for Loss Estimation-based Active Learning [65.93767110342502]
We present a novel deep active learning approach that queries the oracle for data annotation when the unlabeled sample is believed to incorporate high loss.
Our approach achieves superior performances than the state-of-the-art active learning methods on image classification and semantic segmentation tasks.
arXiv Detail & Related papers (2022-12-20T19:29:37Z) - Feature robustness and sex differences in medical imaging: a case study
in MRI-based Alzheimer's disease detection [1.7616042687330637]
We compare two classification schemes on the ADNI MRI dataset.
We do not find a strong dependence of model performance for male and female test subjects on the sex composition of the training dataset.
arXiv Detail & Related papers (2022-04-04T17:37:54Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Semi-supervised Medical Image Classification with Relation-driven
Self-ensembling Model [71.80319052891817]
We present a relation-driven semi-supervised framework for medical image classification.
It exploits the unlabeled data by encouraging the prediction consistency of given input under perturbations.
Our method outperforms many state-of-the-art semi-supervised learning methods on both single-label and multi-label image classification scenarios.
arXiv Detail & Related papers (2020-05-15T06:57:54Z) - Hemogram Data as a Tool for Decision-making in COVID-19 Management:
Applications to Resource Scarcity Scenarios [62.997667081978825]
COVID-19 pandemics has challenged emergency response systems worldwide, with widespread reports of essential services breakdown and collapse of health care structure.
This work describes a machine learning model derived from hemogram exam data performed in symptomatic patients.
Proposed models can predict COVID-19 qRT-PCR results in symptomatic individuals with high accuracy, sensitivity and specificity.
arXiv Detail & Related papers (2020-05-10T01:45:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.