FSSD: Feature Fusion Single Shot Multibox Detector
- URL: http://arxiv.org/abs/1712.00960v4
- Date: Fri, 23 Feb 2024 03:16:50 GMT
- Title: FSSD: Feature Fusion Single Shot Multibox Detector
- Authors: Zuoxin Li, Lu Yang and Fuqiang Zhou
- Abstract summary: FSSD (Feature Fusion Single Shot Multibox Detector) is an enhanced SSD with a novel and lightweight feature fusion module.
Our network can achieve 82.7 mAP (mean average precision) at the speed of 65.8 FPS (frame per second) with the input size 300$times$300 using a single Nvidia 1080Ti GPU.
- Score: 8.016875965887815
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: SSD (Single Shot Multibox Detector) is one of the best object detection
algorithms with both high accuracy and fast speed. However, SSD's feature
pyramid detection method makes it hard to fuse the features from different
scales. In this paper, we proposed FSSD (Feature Fusion Single Shot Multibox
Detector), an enhanced SSD with a novel and lightweight feature fusion module
which can improve the performance significantly over SSD with just a little
speed drop. In the feature fusion module, features from different layers with
different scales are concatenated together, followed by some down-sampling
blocks to generate new feature pyramid, which will be fed to multibox detectors
to predict the final detection results. On the Pascal VOC 2007 test, our
network can achieve 82.7 mAP (mean average precision) at the speed of 65.8 FPS
(frame per second) with the input size 300$\times$300 using a single Nvidia
1080Ti GPU. In addition, our result on COCO is also better than the
conventional SSD with a large margin. Our FSSD outperforms a lot of
state-of-the-art object detection algorithms in both aspects of accuracy and
speed. Code is available at https://github.com/lzx1413/CAFFE_SSD/tree/fssd.
Related papers
- ASAG: Building Strong One-Decoder-Layer Sparse Detectors via Adaptive
Sparse Anchor Generation [50.01244854344167]
We bridge the performance gap between sparse and dense detectors by proposing Adaptive Sparse Anchor Generator (ASAG)
ASAG predicts dynamic anchors on patches rather than grids in a sparse way so that it alleviates the feature conflict problem.
Our method outperforms dense-d ones and achieves a better speed-accuracy trade-off.
arXiv Detail & Related papers (2023-08-18T02:06:49Z) - Follow the Soldiers with Optimized Single-Shot Multibox Detection and
Reinforcement Learning [0.0]
We build an autonomous system using DeepRacer which will follow a specific person (for our project, a soldier) when they will be moving in any direction.
Two main components to accomplish this project is an optimized Single-Shot Multibox Detection (SSD) object detection model and a Reinforcement Learning (RL) model.
Experimental results show that SSD Lite gives better performance among these three techniques and exhibits a considerable boost in inference speed (2-3 times) without compromising accuracy.
arXiv Detail & Related papers (2023-08-02T19:08:57Z) - Faster OreFSDet : A Lightweight and Effective Few-shot Object Detector
for Ore Images [36.26089993900946]
General object detection methods often suffer from severe over-fitting with scarce labeled data.
We propose a lightweight and effective few-shot detector to achieve competitive performance with general object detection.
Our method achieves the smallest model size of 19MB as well as being competitive at 50 FPS detection speed.
arXiv Detail & Related papers (2023-05-02T03:30:03Z) - Precise Single-stage Detector [2.2719729705587155]
We propose a modified version of Single Shot Multibox Detector (SSD) named Precise Single Stage Detector (PSSD)
In order to address these aforementioned issues, we propose a new architecture, named Precise Single Stage Detector (PSSD)
arXiv Detail & Related papers (2022-10-09T12:58:37Z) - Fully Sparse 3D Object Detection [57.05834683261658]
We build a fully sparse 3D object detector (FSD) for long-range LiDAR-based object detection.
FSD is built upon the general sparse voxel encoder and a novel sparse instance recognition (SIR) module.
SIR avoids the time-consuming neighbor queries in previous point-based methods by grouping points into instances.
arXiv Detail & Related papers (2022-07-20T17:01:33Z) - Embracing Single Stride 3D Object Detector with Sparse Transformer [63.179720817019096]
In LiDAR-based 3D object detection for autonomous driving, the ratio of the object size to input scene size is significantly smaller compared to 2D detection cases.
Many 3D detectors directly follow the common practice of 2D detectors, which downsample the feature maps even after quantizing the point clouds.
We propose Single-stride Sparse Transformer (SST) to maintain the original resolution from the beginning to the end of the network.
arXiv Detail & Related papers (2021-12-13T02:12:02Z) - Detecting Small Objects in Thermal Images Using Single-Shot Detector [12.72157936831052]
SSD (Single Shot Multibox Detector) is one of the most successful object detectors for its high accuracy and fast speed.
In this paper, we proposed an enhanced SSD with a novel feature fusion module which can improve the performance over SSD for small object detection.
arXiv Detail & Related papers (2021-08-25T07:54:36Z) - FastFlowNet: A Lightweight Network for Fast Optical Flow Estimation [81.76975488010213]
Dense optical flow estimation plays a key role in many robotic vision tasks.
Current networks often occupy large number of parameters and require heavy computation costs.
Our proposed FastFlowNet works in the well-known coarse-to-fine manner with following innovations.
arXiv Detail & Related papers (2021-03-08T03:09:37Z) - CIA-SSD: Confident IoU-Aware Single-Stage Object Detector From Point
Cloud [40.871850077528485]
We present a new single-stage detector named the Confident IoU-Aware Single-Stage object Detector (CIA-SSD)
CIA-SSD adaptively fuses high-level abstract semantic features and low-level spatial features for accurate predictions of bounding boxes and classification confidence.
It attains top performance in terms of the official ranking metric and above 32 FPS inference speed.
arXiv Detail & Related papers (2020-12-05T12:00:08Z) - Single Shot Video Object Detector [215.06904478667337]
Single Shot Video Object Detector (SSVD) is a new architecture that novelly integrates feature aggregation into a one-stage detector for object detection in videos.
For $448 times 448$ input, SSVD achieves 79.2% mAP on ImageNet VID dataset.
arXiv Detail & Related papers (2020-07-07T15:36:26Z) - ASFD: Automatic and Scalable Face Detector [129.82350993748258]
We propose a novel Automatic and Scalable Face Detector (ASFD)
ASFD is based on a combination of neural architecture search techniques as well as a new loss design.
Our ASFD-D6 outperforms the prior strong competitors, and our lightweight ASFD-D0 runs at more than 120 FPS with Mobilenet for VGA-resolution images.
arXiv Detail & Related papers (2020-03-25T06:00:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.