An Improved Lightweight YOLOv5 Model Based on Attention Mechanism for
Face Mask Detection
- URL: http://arxiv.org/abs/2203.16506v1
- Date: Wed, 30 Mar 2022 17:41:21 GMT
- Title: An Improved Lightweight YOLOv5 Model Based on Attention Mechanism for
Face Mask Detection
- Authors: Sheng Xu
- Abstract summary: We propose an improved lightweight face mask detector based on YOLOv5.
It achieves a mean average precision of 95.2%, which is 4.4% higher than the baseline and is also more accurate compared with other existing models.
- Score: 3.3398969693904723
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Coronavirus 2019 has brought severe challenges to social stability and public
health worldwide. One effective way of curbing the epidemic is to require
people to wear masks in public places and monitor mask-wearing states by
utilizing suitable automatic detectors. However, existing deep learning based
models struggle to simultaneously achieve the requirements of both high
precision and real-time performance. To solve this problem, we propose an
improved lightweight face mask detector based on YOLOv5, which can achieve an
excellent balance of precision and speed. Firstly, a novel backbone
ShuffleCANet that combines ShuffleNetV2 network with Coordinate Attention
mechanism is proposed as the backbone. Then we use BiFPN as the feature fusion
neck. Furthermore, we replace the loss function of localization with -CIoU to
obtain higher-quality anchors. Some valuable strategies such as data
augmentation, adaptive image scaling, and anchor cluster operation are also
utilized. Experimental results show the performance and effectiveness of the
proposed model. On the basis of the original YOLOv5 model, our work increases
the inference speed by 28.3% while still improving the precision by 0.58% on
the AIZOO face mask dataset. It achieves a mean average precision of 95.2%,
which is 4.4% higher than the baseline and is also more accurate compared with
other existing models.
Related papers
- Model Inversion Attacks Through Target-Specific Conditional Diffusion Models [54.69008212790426]
Model attacks (MIAs) aim to reconstruct private images from a target classifier's training set, thereby raising privacy concerns in AI applications.
Previous GAN-based MIAs tend to suffer from inferior generative fidelity due to GAN's inherent flaws and biased optimization within latent space.
We propose Diffusion-based Model Inversion (Diff-MI) attacks to alleviate these issues.
arXiv Detail & Related papers (2024-07-16T06:38:49Z) - A Review and Implementation of Object Detection Models and Optimizations for Real-time Medical Mask Detection during the COVID-19 Pandemic [0.0]
This work assesses the most fundamental object detection models on the Common Objects in Context (COCO) dataset.
We select a highly efficient model called YOLOv5 to train on the topical and unexplored dataset of human faces with medical masks.
We propose an optimized model based on YOLOv5 using transfer learning for the detection of correctly and incorrectly worn medical masks.
arXiv Detail & Related papers (2024-05-28T17:27:24Z) - Mask wearing object detection algorithm based on improved YOLOv5 [6.129833920546161]
This paper proposes a mask-wearing face detection model based on YOLOv5l.
Our proposed method significantly enhances the detection capability of mask-wearing.
arXiv Detail & Related papers (2023-10-16T10:06:42Z) - EdgeYOLO: An Edge-Real-Time Object Detector [69.41688769991482]
This paper proposes an efficient, low-complexity and anchor-free object detector based on the state-of-the-art YOLO framework.
We develop an enhanced data augmentation method to effectively suppress overfitting during training, and design a hybrid random loss function to improve the detection accuracy of small objects.
Our baseline model can reach the accuracy of 50.6% AP50:95 and 69.8% AP50 in MS 2017 dataset, 26.4% AP50:95 and 44.8% AP50 in VisDrone 2019-DET dataset, and it meets real-time requirements (FPS>=30) on edge-computing device Nvidia
arXiv Detail & Related papers (2023-02-15T06:05:14Z) - An Adversarial Active Sampling-based Data Augmentation Framework for
Manufacturable Chip Design [55.62660894625669]
Lithography modeling is a crucial problem in chip design to ensure a chip design mask is manufacturable.
Recent developments in machine learning have provided alternative solutions in replacing the time-consuming lithography simulations with deep neural networks.
We propose a litho-aware data augmentation framework to resolve the dilemma of limited data and improve the machine learning model performance.
arXiv Detail & Related papers (2022-10-27T20:53:39Z) - Real-Time Mask Detection Based on SSD-MobileNetV2 [2.538209532048867]
An excellent automatic real-time mask detection system can reduce a lot of work pressure for relevant staff.
Existing mask detection approaches are resource-intensive and do not achieve a good balance between speed and accuracy.
In this paper, we propose a new architecture for mask detection.
arXiv Detail & Related papers (2022-08-29T01:59:22Z) - Research on Mask Wearing Detection of Natural Population Based on
Improved YOLOv4 [0.0]
This paper proposes a new mask wearing detection method based on the improved YOLOv4.
We add the Coordinate Attention Module to the backbone to coordinate feature fusion and representation.
Thirdly, we deploy the K-means clustering algorithm to make the nine anchor boxes more suitable for our NPMD dataset.
arXiv Detail & Related papers (2022-08-24T08:04:11Z) - From Environmental Sound Representation to Robustness of 2D CNN Models
Against Adversarial Attacks [82.21746840893658]
This paper investigates the impact of different standard environmental sound representations (spectrograms) on the recognition performance and adversarial attack robustness of a victim residual convolutional neural network.
We show that while the ResNet-18 model trained on DWT spectrograms achieves a high recognition accuracy, attacking this model is relatively more costly for the adversary.
arXiv Detail & Related papers (2022-04-14T15:14:08Z) - PP-PicoDet: A Better Real-Time Object Detector on Mobile Devices [13.62426382827205]
PP-PicoDet family of real-time object detectors achieves superior performance on object detection for mobile devices.
Models achieve better trade-offs between accuracy and latency compared to other popular models.
arXiv Detail & Related papers (2021-11-01T12:53:17Z) - MEMO: Test Time Robustness via Adaptation and Augmentation [131.28104376280197]
We study the problem of test time robustification, i.e., using the test input to improve model robustness.
Recent prior works have proposed methods for test time adaptation, however, they each introduce additional assumptions.
We propose a simple approach that can be used in any test setting where the model is probabilistic and adaptable.
arXiv Detail & Related papers (2021-10-18T17:55:11Z) - BinaryCoP: Binary Neural Network-based COVID-19 Face-Mask Wear and
Positioning Predictor on Edge Devices [63.56630165340053]
Face masks offer an effective solution in healthcare for bi-directional protection against air-borne diseases.
CNNs offer an excellent solution for face recognition and classification of correct mask wearing and positioning.
CNNs can be used at entrances to corporate buildings, airports, shopping areas, and other indoor locations, to mitigate the spread of the virus.
arXiv Detail & Related papers (2021-02-06T00:14:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.