Perception Reinforcement Using Auxiliary Learning Feature Fusion: A
Modified Yolov8 for Head Detection
- URL: http://arxiv.org/abs/2310.09492v1
- Date: Sat, 14 Oct 2023 04:52:35 GMT
- Title: Perception Reinforcement Using Auxiliary Learning Feature Fusion: A
Modified Yolov8 for Head Detection
- Authors: Jiezhou Chen, Guankun Wang, Weixiang Liu, Xiaopin Zhong, Yibin Tian,
ZongZe Wu
- Abstract summary: We present a modified Yolov8 which improves head detection performance through target perception.
An Auxiliary Learning Feature Fusion (ALFF) module comprised of LSTM and convolutional blocks is used as the auxiliary task.
In addition, we introduce Noise into Distribution Focal Loss to facilitate model fitting and improve the accuracy of detection.
- Score: 8.065947209864646
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Head detection provides distribution information of pedestrian, which is
crucial for scene statistical analysis, traffic management, and risk assessment
and early warning. However, scene complexity and large-scale variation in the
real world make accurate detection more difficult. Therefore, we present a
modified Yolov8 which improves head detection performance through reinforcing
target perception. An Auxiliary Learning Feature Fusion (ALFF) module comprised
of LSTM and convolutional blocks is used as the auxiliary task to help the
model perceive targets. In addition, we introduce Noise Calibration into
Distribution Focal Loss to facilitate model fitting and improve the accuracy of
detection. Considering the requirements of high accuracy and speed for the head
detection task, our method is adapted with two kinds of backbone, namely
Yolov8n and Yolov8m. The results demonstrate the superior performance of our
approach in improving detection accuracy and robustness.
Related papers
- Localized Gaussians as Self-Attention Weights for Point Clouds Correspondence [92.07601770031236]
We investigate semantically meaningful patterns in the attention heads of an encoder-only Transformer architecture.
We find that fixing the attention weights not only accelerates the training process but also enhances the stability of the optimization.
arXiv Detail & Related papers (2024-09-20T07:41:47Z) - Open-Set Deepfake Detection: A Parameter-Efficient Adaptation Method with Forgery Style Mixture [58.60915132222421]
We introduce an approach that is both general and parameter-efficient for face forgery detection.
We design a forgery-style mixture formulation that augments the diversity of forgery source domains.
We show that the designed model achieves state-of-the-art generalizability with significantly reduced trainable parameters.
arXiv Detail & Related papers (2024-08-23T01:53:36Z) - Research on target detection method of distracted driving behavior based on improved YOLOv8 [6.405098280736171]
This study proposes an improved YOLOv8 detection method based on the original YOLOv8 model by integrating the BoTNet module, GAM attention mechanism and EIoU loss function.
Experimental results show that the improved model performs well in both detection speed and accuracy, with an accuracy rate of 99.4%.
arXiv Detail & Related papers (2024-07-02T00:43:41Z) - Efficient Meta-Learning Enabled Lightweight Multiscale Few-Shot Object Detection in Remote Sensing Images [15.12889076965307]
YOLOv7 one-stage detector is subjected to a novel meta-learning training framework.
This transformation allows the detector to adeptly address FSOD tasks while capitalizing on its inherent advantage of lightweight.
To validate the effectiveness of our proposed detector, we conducted performance comparisons with current state-of-the-art detectors.
arXiv Detail & Related papers (2024-04-29T04:56:52Z) - HANet: A Hierarchical Attention Network for Change Detection With Bitemporal Very-High-Resolution Remote Sensing Images [6.890268321645873]
We propose a progressive foreground-balanced sampling strategy on the basis of not adding change information.
This strategy helps the model accurately learn the features of the changed pixels during the early training process.
We also design a discriminative Siamese network, hierarchical attention network (HANet), which can integrate multiscale features and refine detailed features.
arXiv Detail & Related papers (2024-04-14T08:01:27Z) - Unsupervised Domain Adaptation for Self-Driving from Past Traversal
Features [69.47588461101925]
We propose a method to adapt 3D object detectors to new driving environments.
Our approach enhances LiDAR-based detection models using spatial quantized historical features.
Experiments on real-world datasets demonstrate significant improvements.
arXiv Detail & Related papers (2023-09-21T15:00:31Z) - Enhancing Infrared Small Target Detection Robustness with Bi-Level
Adversarial Framework [61.34862133870934]
We propose a bi-level adversarial framework to promote the robustness of detection in the presence of distinct corruptions.
Our scheme remarkably improves 21.96% IOU across a wide array of corruptions and notably promotes 4.97% IOU on the general benchmark.
arXiv Detail & Related papers (2023-09-03T06:35:07Z) - Domain Adaptive Synapse Detection with Weak Point Annotations [63.97144211520869]
We present AdaSyn, a framework for domain adaptive synapse detection with weak point annotations.
In the WASPSYN challenge at I SBI 2023, our method ranks the 1st place.
arXiv Detail & Related papers (2023-08-31T05:05:53Z) - Rule-Based Error Detection and Correction to Operationalize Movement Trajectory Classification [1.192247515575942]
We provide a neuro-symbolic rule-based framework to conduct error correction and detection of models to integrate into our movement trajectory platform.
We show an F1 scores for predicting errors of up to 0.984, significant performance increase for out-of distribution accuracy (8.51% improvement over SOTA for zero-shot accuracy) and accuracy improvement over the SOTA model.
arXiv Detail & Related papers (2023-08-28T01:57:38Z) - Diffusion Denoising Process for Perceptron Bias in Out-of-distribution
Detection [67.49587673594276]
We introduce a new perceptron bias assumption that suggests discriminator models are more sensitive to certain features of the input, leading to the overconfidence problem.
We demonstrate that the diffusion denoising process (DDP) of DMs serves as a novel form of asymmetric, which is well-suited to enhance the input and mitigate the overconfidence problem.
Our experiments on CIFAR10, CIFAR100, and ImageNet show that our method outperforms SOTA approaches.
arXiv Detail & Related papers (2022-11-21T08:45:08Z) - Improved YOLOv5 network for real-time multi-scale traffic sign detection [4.5598087061051755]
We propose an improved feature pyramid model, named AF-FPN, which utilize the adaptive attention module (AAM) and feature enhancement module (FEM) to reduce the information loss in the process of feature map generation.
We replace the original feature pyramid network in YOLOv5 with AF-FPN, which improves the detection performance for multi-scale targets of the YOLOv5 network.
arXiv Detail & Related papers (2021-12-16T11:02:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.