DANet: Enhancing Small Object Detection through an Efficient Deformable
Attention Network
- URL: http://arxiv.org/abs/2310.05768v2
- Date: Fri, 13 Oct 2023 15:00:11 GMT
- Title: DANet: Enhancing Small Object Detection through an Efficient Deformable
Attention Network
- Authors: Md Sohag Mia, Abdullah Al Bary Voban, Abu Bakor Hayat Arnob, Abdu
Naim, Md Kawsar Ahmed, Md Shariful Islam
- Abstract summary: We propose a comprehensive strategy by synergizing Faster R-CNN with cutting-edge methods.
By combining Faster R-CNN with Feature Pyramid Network, we enable the model to handle multi-scale features intrinsic to manufacturing environments.
Deformable Net is used that contorts and conforms to the geometric variations of defects, bringing precision in detecting even the minuscule and complex features.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Efficient and accurate detection of small objects in manufacturing settings,
such as defects and cracks, is crucial for ensuring product quality and safety.
To address this issue, we proposed a comprehensive strategy by synergizing
Faster R-CNN with cutting-edge methods. By combining Faster R-CNN with Feature
Pyramid Network, we enable the model to efficiently handle multi-scale features
intrinsic to manufacturing environments. Additionally, Deformable Net is used
that contorts and conforms to the geometric variations of defects, bringing
precision in detecting even the minuscule and complex features. Then, we
incorporated an attention mechanism called Convolutional Block Attention Module
in each block of our base ResNet50 network to selectively emphasize informative
features and suppress less useful ones. After that we incorporated RoI Align,
replacing RoI Pooling for finer region-of-interest alignment and finally the
integration of Focal Loss effectively handles class imbalance, crucial for rare
defect occurrences. The rigorous evaluation of our model on both the NEU-DET
and Pascal VOC datasets underscores its robust performance and generalization
capabilities. On the NEU-DET dataset, our model exhibited a profound
understanding of steel defects, achieving state-of-the-art accuracy in
identifying various defects. Simultaneously, when evaluated on the Pascal VOC
dataset, our model showcases its ability to detect objects across a wide
spectrum of categories within complex and small scenes.
Related papers
- Task-Oriented Real-time Visual Inference for IoVT Systems: A Co-design Framework of Neural Networks and Edge Deployment [61.20689382879937]
Task-oriented edge computing addresses this by shifting data analysis to the edge.
Existing methods struggle to balance high model performance with low resource consumption.
We propose a novel co-design framework to optimize neural network architecture.
arXiv Detail & Related papers (2024-10-29T19:02:54Z) - Scale-Invariant Object Detection by Adaptive Convolution with Unified Global-Local Context [3.061662434597098]
We propose an object detection model using a Switchable (adaptive) Atrous Convolutional Network (SAC-Net) based on the efficientDet model.
The proposed SAC-Net encapsulates the benefits of both low-level and high-level features to achieve improved performance on multi-scale object detection tasks.
Our experiments on benchmark datasets demonstrate that the proposed SAC-Net outperforms the state-of-the-art models by a significant margin in terms of accuracy.
arXiv Detail & Related papers (2024-09-17T10:08:37Z) - DDNet: Deformable Convolution and Dense FPN for Surface Defect Detection in Recycled Books [13.223022246455077]
DDNet is an innovative detection model designed to enhance defect localization and classification.
We present a dataset specifically curated for surface defect detection in recycled and recirculated books.
DDNet achieves precise localization and classification of surface defects, recording a mAP value of 46.7%.
arXiv Detail & Related papers (2024-09-08T03:18:19Z) - Machine learning-based network intrusion detection for big and
imbalanced data using oversampling, stacking feature embedding and feature
extraction [6.374540518226326]
Intrusion Detection Systems (IDS) play a critical role in protecting interconnected networks by detecting malicious actors and activities.
This paper introduces a novel ML-based network intrusion detection model that uses Random Oversampling (RO) to address data imbalance and Stacking Feature Embedding (PCA) for dimension reduction.
Using the CIC-IDS 2017 dataset, DT, RF, and ET models reach 99.99% accuracy, while DT and RF models obtain 99.94% accuracy on CIC-IDS 2018 dataset.
arXiv Detail & Related papers (2024-01-22T05:49:41Z) - Innovative Horizons in Aerial Imagery: LSKNet Meets DiffusionDet for
Advanced Object Detection [55.2480439325792]
We present an in-depth evaluation of an object detection model that integrates the LSKNet backbone with the DiffusionDet head.
The proposed model achieves a mean average precision (MAP) of approximately 45.7%, which is a significant improvement.
This advancement underscores the effectiveness of the proposed modifications and sets a new benchmark in aerial image analysis.
arXiv Detail & Related papers (2023-11-21T19:49:13Z) - Global Context Aggregation Network for Lightweight Saliency Detection of
Surface Defects [70.48554424894728]
We develop a Global Context Aggregation Network (GCANet) for lightweight saliency detection of surface defects on the encoder-decoder structure.
First, we introduce a novel transformer encoder on the top layer of the lightweight backbone, which captures global context information through a novel Depth-wise Self-Attention (DSA) module.
The experimental results on three public defect datasets demonstrate that the proposed network achieves a better trade-off between accuracy and running efficiency compared with other 17 state-of-the-art methods.
arXiv Detail & Related papers (2023-09-22T06:19:11Z) - CINFormer: Transformer network with multi-stage CNN feature injection
for surface defect segmentation [73.02218479926469]
We propose a transformer network with multi-stage CNN feature injection for surface defect segmentation.
CINFormer presents a simple yet effective feature integration mechanism that injects the multi-level CNN features of the input image into different stages of the transformer network in the encoder.
In addition, CINFormer presents a Top-K self-attention module to focus on tokens with more important information about the defects.
arXiv Detail & Related papers (2023-09-22T06:12:02Z) - Small Object Detection via Coarse-to-fine Proposal Generation and
Imitation Learning [52.06176253457522]
We propose a two-stage framework tailored for small object detection based on the Coarse-to-fine pipeline and Feature Imitation learning.
CFINet achieves state-of-the-art performance on the large-scale small object detection benchmarks, SODA-D and SODA-A.
arXiv Detail & Related papers (2023-08-18T13:13:09Z) - Lightweight wood panel defect detection method incorporating attention
mechanism and feature fusion network [9.775181958901326]
We propose a lightweight wood panel defect detection method called YOLOv5-LW, which incorporates attention mechanisms and a feature fusion network.
The proposed method achieves a 92.8% accuracy rate, reduces the number of parameters by 27.78%, compresses computational volume by 41.25%, improves detection inference speed by 10.16%.
arXiv Detail & Related papers (2023-06-21T08:55:45Z) - TinyDefectNet: Highly Compact Deep Neural Network Architecture for
High-Throughput Manufacturing Visual Quality Inspection [72.88856890443851]
TinyDefectNet is a highly compact deep convolutional network architecture tailored for high- throughput manufacturing visual quality inspection.
TinyDefectNet was deployed on an AMD EPYC 7R32, and achieved 7.6x faster throughput using the nativeflow environment and 9x faster throughput using AMD ZenDNN accelerator library.
arXiv Detail & Related papers (2021-11-29T04:19:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.