Trap-Based Pest Counting: Multiscale and Deformable Attention CenterNet
Integrating Internal LR and HR Joint Feature Learning
- URL: http://arxiv.org/abs/2304.02291v1
- Date: Wed, 5 Apr 2023 08:23:17 GMT
- Title: Trap-Based Pest Counting: Multiscale and Deformable Attention CenterNet
Integrating Internal LR and HR Joint Feature Learning
- Authors: Jae-Hyeon Lee, Chang-Hwan Son
- Abstract summary: This study proposes a new pest counting model referred to as multiscale and deformable attention CenterNet.
The proposed model is verified to generate the HR heatmap more accurately and improve pest counting accuracy.
The experimental results show that the proposed model outperforms state-of-the-art crowd counting and object detection models.
- Score: 4.721069729610892
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Pest counting, which predicts the number of pests in the early stage, is very
important because it enables rapid pest control, reduces damage to crops, and
improves productivity. In recent years, light traps have been increasingly used
to lure and photograph pests for pest counting. However, pest images have a
wide range of variability in pest appearance owing to severe occlusion, wide
pose variation, and even scale variation. This makes pest counting more
challenging. To address these issues, this study proposes a new pest counting
model referred to as multiscale and deformable attention CenterNet
(Mada-CenterNet) for internal low-resolution (LR) and high-resolution (HR)
joint feature learning. Compared with the conventional CenterNet, the proposed
Mada-CenterNet adopts a multiscale heatmap generation approach in a two-step
fashion to predict LR and HR heatmaps adaptively learned to scale variations,
that is, changes in the number of pests. In addition, to overcome the pose and
occlusion problems, a new between-hourglass skip connection based on deformable
and multiscale attention is designed to ensure internal LR and HR joint feature
learning and incorporate geometric deformation, thereby resulting in an
improved pest counting accuracy. Through experiments, the proposed
Mada-CenterNet is verified to generate the HR heatmap more accurately and
improve pest counting accuracy owing to multiscale heatmap generation, joint
internal feature learning, and deformable and multiscale attention. In
addition, the proposed model is confirmed to be effective in overcoming severe
occlusions and variations in pose and scale. The experimental results show that
the proposed model outperforms state-of-the-art crowd counting and object
detection models.
Related papers
- Locally Grouped and Scale-Guided Attention for Dense Pest Counting [1.9580473532948401]
This study introduces a new dense pest counting problem to predict densely distributed pests captured by digital traps.
To address these problems, it is essential to incorporate the local attention mechanism.
This study presents a novel design that integrates locally grouped and scale-guided attention into a multiscale CenterNet framework.
arXiv Detail & Related papers (2024-08-29T13:02:01Z) - Open-Set Deepfake Detection: A Parameter-Efficient Adaptation Method with Forgery Style Mixture [58.60915132222421]
We introduce an approach that is both general and parameter-efficient for face forgery detection.
We design a forgery-style mixture formulation that augments the diversity of forgery source domains.
We show that the designed model achieves state-of-the-art generalizability with significantly reduced trainable parameters.
arXiv Detail & Related papers (2024-08-23T01:53:36Z) - MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation [80.47072100963017]
We introduce a novel and low-compute algorithm, Model Merging with Amortized Pareto Front (MAP)
MAP efficiently identifies a set of scaling coefficients for merging multiple models, reflecting the trade-offs involved.
We also introduce Bayesian MAP for scenarios with a relatively low number of tasks and Nested MAP for situations with a high number of tasks, further reducing the computational cost of evaluation.
arXiv Detail & Related papers (2024-06-11T17:55:25Z) - InsectMamba: Insect Pest Classification with State Space Model [8.470757741028661]
InsectMamba is a novel approach that integrates State Space Models (SSMs), Convolutional Neural Networks (CNNs), Multi-Head Self-Attention mechanism (MSA) and Multilayer Perceptrons (MLPs) within Mix-SSM blocks.
It was evaluated against strong competitors across five insect pest classification datasets.
arXiv Detail & Related papers (2024-04-04T17:34:21Z) - ECC-PolypDet: Enhanced CenterNet with Contrastive Learning for Automatic
Polyp Detection [88.4359020192429]
Existing methods either involve computationally expensive context aggregation or lack prior modeling of polyps, resulting in poor performance in challenging cases.
In this paper, we propose the Enhanced CenterNet with Contrastive Learning (ECC-PolypDet), a two-stage training & end-to-end inference framework.
Box-assisted Contrastive Learning (BCL) during training to minimize the intra-class difference and maximize the inter-class difference between foreground polyps and backgrounds, enabling our model to capture concealed polyps.
In the fine-tuning stage, we introduce the IoU-guided Sample Re-weighting
arXiv Detail & Related papers (2024-01-10T07:03:41Z) - SugarViT -- Multi-objective Regression of UAV Images with Vision
Transformers and Deep Label Distribution Learning Demonstrated on Disease
Severity Prediction in Sugar Beet [3.2925222641796554]
This work will introduce a machine learning framework for automatized large-scale plant-specific trait annotation.
We develop an efficient Vision Transformer based model for disease severity scoring called SugarViT.
Although the model is evaluated on this special use case, it is held as generic as possible to also be applicable to various image-based classification and regression tasks.
arXiv Detail & Related papers (2023-11-06T13:01:17Z) - Improving FHB Screening in Wheat Breeding Using an Efficient Transformer
Model [0.0]
Fusarium head blight is a devastating disease that causes significant economic losses annually on small grains.
Image processing techniques have been developed using supervised machine learning algorithms for the early detection of FHB.
A new Context Bridge is proposed to integrate the local representation capability of the U-Net network in the transformer model.
arXiv Detail & Related papers (2023-08-07T15:44:58Z) - Evaluation of the potential of Near Infrared Hyperspectral Imaging for
monitoring the invasive brown marmorated stink bug [53.682955739083056]
The brown marmorated stink bug (BMSB), Halyomorpha halys, is an invasive insect pest of global importance that damages several crops.
The present study consists in a preliminary evaluation at the laboratory level of Near Infrared Hyperspectral Imaging (NIR-HSI) as a possible technology to detect BMSB specimens.
arXiv Detail & Related papers (2023-01-19T11:37:20Z) - Deep Learning-Based Defect Classification and Detection in SEM Images [1.9206693386750882]
In particular, we train RetinaNet models using different ResNet, VGGNet architectures as backbone.
We propose a preference-based ensemble strategy to combine the output predictions from different models in order to achieve better performance on classification and detection of defects.
arXiv Detail & Related papers (2022-06-20T16:34:11Z) - A Model for Multi-View Residual Covariances based on Perspective
Deformation [88.21738020902411]
We derive a model for the covariance of the visual residuals in multi-view SfM, odometry and SLAM setups.
We validate our model with synthetic and real data and integrate it into photometric and feature-based Bundle Adjustment.
arXiv Detail & Related papers (2022-02-01T21:21:56Z) - A Hamiltonian Monte Carlo Method for Probabilistic Adversarial Attack
and Learning [122.49765136434353]
We present an effective method, called Hamiltonian Monte Carlo with Accumulated Momentum (HMCAM), aiming to generate a sequence of adversarial examples.
We also propose a new generative method called Contrastive Adversarial Training (CAT), which approaches equilibrium distribution of adversarial examples.
Both quantitative and qualitative analysis on several natural image datasets and practical systems have confirmed the superiority of the proposed algorithm.
arXiv Detail & Related papers (2020-10-15T16:07:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.