AGSFCOS: Based on attention mechanism and Scale-Equalizing pyramid
network of object detection
- URL: http://arxiv.org/abs/2105.09596v1
- Date: Thu, 20 May 2021 08:41:02 GMT
- Title: AGSFCOS: Based on attention mechanism and Scale-Equalizing pyramid
network of object detection
- Authors: Li Wang, Wei Xiang, Ruhui Xue, Kaida Zou, Laili Zhu
- Abstract summary: Our model has a certain improvement in accuracy compared with the current popular detection models on the COCO dataset.
Our optimal model can get 39.5% COCO AP under the background of ResNet50.
- Score: 10.824032219531095
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Recently, the anchor-free object detection model has shown great potential
for accuracy and speed to exceed anchor-based object detection. Therefore, two
issues are mainly studied in this article: (1) How to let the backbone network
in the anchor-free object detection model learn feature extraction? (2) How to
make better use of the feature pyramid network? In order to solve the above
problems, Experiments show that our model has a certain improvement in accuracy
compared with the current popular detection models on the COCO dataset, the
designed attention mechanism module can capture contextual information well,
improve detection accuracy, and use sepc network to help balance abstract and
detailed information, and reduce the problem of semantic gap in the feature
pyramid network. Whether it is anchor-based network model YOLOv3, Faster RCNN,
or anchor-free network model Foveabox, FSAF, FCOS. Our optimal model can get
39.5% COCO AP under the background of ResNet50.
Related papers
- Scale-Invariant Object Detection by Adaptive Convolution with Unified Global-Local Context [3.061662434597098]
We propose an object detection model using a Switchable (adaptive) Atrous Convolutional Network (SAC-Net) based on the efficientDet model.
The proposed SAC-Net encapsulates the benefits of both low-level and high-level features to achieve improved performance on multi-scale object detection tasks.
Our experiments on benchmark datasets demonstrate that the proposed SAC-Net outperforms the state-of-the-art models by a significant margin in terms of accuracy.
arXiv Detail & Related papers (2024-09-17T10:08:37Z) - PVAFN: Point-Voxel Attention Fusion Network with Multi-Pooling Enhancing for 3D Object Detection [59.355022416218624]
integration of point and voxel representations is becoming more common in LiDAR-based 3D object detection.
We propose a novel two-stage 3D object detector, called Point-Voxel Attention Fusion Network (PVAFN)
PVAFN uses a multi-pooling strategy to integrate both multi-scale and region-specific information effectively.
arXiv Detail & Related papers (2024-08-26T19:43:01Z) - Innovative Horizons in Aerial Imagery: LSKNet Meets DiffusionDet for
Advanced Object Detection [55.2480439325792]
We present an in-depth evaluation of an object detection model that integrates the LSKNet backbone with the DiffusionDet head.
The proposed model achieves a mean average precision (MAP) of approximately 45.7%, which is a significant improvement.
This advancement underscores the effectiveness of the proposed modifications and sets a new benchmark in aerial image analysis.
arXiv Detail & Related papers (2023-11-21T19:49:13Z) - Rethinking Voxelization and Classification for 3D Object Detection [68.8204255655161]
The main challenge in 3D object detection from LiDAR point clouds is achieving real-time performance without affecting the reliability of the network.
We present a solution to improve network inference speed and precision at the same time by implementing a fast dynamic voxelizer.
In addition, we propose a lightweight detection sub-head model for classifying predicted objects and filter out false detected objects.
arXiv Detail & Related papers (2023-01-10T16:22:04Z) - Part-Based Models Improve Adversarial Robustness [57.699029966800644]
We show that combining human prior knowledge with end-to-end learning can improve the robustness of deep neural networks.
Our model combines a part segmentation model with a tiny classifier and is trained end-to-end to simultaneously segment objects into parts.
Our experiments indicate that these models also reduce texture bias and yield better robustness against common corruptions and spurious correlations.
arXiv Detail & Related papers (2022-09-15T15:41:47Z) - a novel attention-based network for fast salient object detection [14.246237737452105]
In the current salient object detection network, the most popular method is using U-shape structure.
We propose a new deep convolution network architecture with three contributions.
Results demonstrate that the proposed method can compress the model to 1/3 of the original size nearly without losing the accuracy.
arXiv Detail & Related papers (2021-12-20T12:30:20Z) - Efficient Person Search: An Anchor-Free Approach [86.45858994806471]
Person search aims to simultaneously localize and identify a query person from realistic, uncropped images.
To achieve this goal, state-of-the-art models typically add a re-id branch upon two-stage detectors like Faster R-CNN.
In this work, we present an anchor-free approach to efficiently tackling this challenging task, by introducing the following dedicated designs.
arXiv Detail & Related papers (2021-09-01T07:01:33Z) - An Attention Module for Convolutional Neural Networks [5.333582981327498]
We propose an attention module for convolutional neural networks by developing an AW-convolution.
Experiments on several datasets for image classification and object detection tasks show the effectiveness of our proposed attention module.
arXiv Detail & Related papers (2021-08-18T15:36:18Z) - Recalibration of Neural Networks for Point Cloud Analysis [3.7814216736076434]
We introduce re-calibration modules on deep neural networks for 3D point clouds.
We demonstrate the benefit and versatility of our proposed modules by incorporating them into three state-of-the-art networks for 3D point cloud analysis.
In the second set of experiments, we investigate the benefits of re-calibration blocks on Alzheimer's Disease diagnosis.
arXiv Detail & Related papers (2020-11-25T17:14:34Z) - Zero Cost Improvements for General Object Detection Network [6.072666305426913]
Two modules are proposed to improve detection precision with zero cost.
We employ the scale attention mechanism to efficiently fuse multi-level feature maps with less parameters.
Experiment results show that the networks with the two modules can surpass original networks by 1.1 AP and 0.8 AP with zero cost.
arXiv Detail & Related papers (2020-11-16T07:21:57Z) - BiDet: An Efficient Binarized Object Detector [96.19708396510894]
We propose a binarized neural network learning method called BiDet for efficient object detection.
Our BiDet fully utilizes the representational capacity of the binary neural networks for object detection by redundancy removal.
Our method outperforms the state-of-the-art binary neural networks by a sizable margin.
arXiv Detail & Related papers (2020-03-09T08:16:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.