MobileDenseNet: A new approach to object detection on mobile devices
- URL: http://arxiv.org/abs/2207.11031v1
- Date: Fri, 22 Jul 2022 12:13:59 GMT
- Title: MobileDenseNet: A new approach to object detection on mobile devices
- Authors: Mohammad Hajizadeh, Mohammad Sabokrou, Adel Rahmani
- Abstract summary: The main goal of this article is to increase accuracy in state-of-the-art algorithms while maintaining speed and real-time efficiency.
We created a new network by the name of MobileDenseNet suitable for embedded systems.
We also developed a light neck FCPNLite for mobile devices that will aid with the detection of small objects.
- Score: 9.05607520128194
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Object detection problem solving has developed greatly within the past few
years. There is a need for lighter models in instances where hardware
limitations exist, as well as a demand for models to be tailored to mobile
devices. In this article, we will assess the methods used when creating
algorithms that address these issues. The main goal of this article is to
increase accuracy in state-of-the-art algorithms while maintaining speed and
real-time efficiency. The most significant issues in one-stage object detection
pertains to small objects and inaccurate localization. As a solution, we
created a new network by the name of MobileDenseNet suitable for embedded
systems. We also developed a light neck FCPNLite for mobile devices that will
aid with the detection of small objects. Our research revealed that very few
papers cited necks in embedded systems. What differentiates our network from
others is our use of concatenation features. A small yet significant change to
the head of the network amplified accuracy without increasing speed or limiting
parameters. In short, our focus on the challenging CoCo and Pascal VOC datasets
were 24.8 and 76.8 in percentage terms respectively - a rate higher than that
recorded by other state-of-the-art systems thus far. Our network is able to
increase accuracy while maintaining real-time efficiency on mobile devices. We
calculated operational speed on Pixel 3 (Snapdragon 845) to 22.8 fps. The
source code of this research is available on
https://github.com/hajizadeh/MobileDenseNet.
Related papers
- Learning to Make Keypoints Sub-Pixel Accurate [80.55676599677824]
This work addresses the challenge of sub-pixel accuracy in detecting 2D local features.
We propose a novel network that enhances any detector with sub-pixel precision by learning an offset vector for detected features.
arXiv Detail & Related papers (2024-07-16T12:39:56Z) - Patch-based Selection and Refinement for Early Object Detection [20.838511460733038]
We propose a novel set of algorithms that divide the image into patches, select patches with objects at various scales, elaborate the details of a small object, and detect it as early as possible.
Our approach is built upon a transformer-based network and integrates the diffusion model to improve the detection accuracy.
Our algorithms enhance the mAP for small objects from 1.03 to 8.93, and reduce the data volume in computation by more than 77%.
arXiv Detail & Related papers (2023-11-03T23:41:13Z) - SqueezerFaceNet: Reducing a Small Face Recognition CNN Even More Via
Filter Pruning [55.84746218227712]
We develop SqueezerFaceNet, a light face recognition network which less than 1M parameters.
We show that it can be further reduced (up to 40%) without an appreciable loss in performance.
arXiv Detail & Related papers (2023-07-20T08:38:50Z) - Improved detection of small objects in road network sequences [0.0]
We propose a new procedure for detecting small-scale objects by applying super-resolution processes based on detections performed by convolutional neural networks.
This work has been tested for a set of traffic images containing elements of different scales to test the efficiency according to the detections obtained by the model.
arXiv Detail & Related papers (2021-05-18T10:13:23Z) - AmphibianDetector: adaptive computation for moving objects detection [0.913755431537592]
We propose an approach to object detection which makes it possible to reduce the number of false-positive detections.
The proposed approach is a modification of CNN already trained for object detection task.
The efficiency of the proposed approach was demonstrated on the open dataset "CDNet2014 pedestrian"
arXiv Detail & Related papers (2020-11-15T12:37:44Z) - Detecting soccer balls with reduced neural networks: a comparison of
multiple architectures under constrained hardware scenarios [0.8808021343665321]
This work provides a comparative study of recent proposals of neural networks targeted towards constrained hardware environments.
We train multiple open implementations of MobileNetV2 and MobileNetV3 models with different underlying architectures.
Results show that MobileNetV3 models have a good trade-off between mAP and inference time in constrained scenarios only, while MobileNetV2 with high width multipliers are appropriate for server-side inference.
arXiv Detail & Related papers (2020-09-28T23:26:25Z) - RT3D: Achieving Real-Time Execution of 3D Convolutional Neural Networks
on Mobile Devices [57.877112704841366]
This paper proposes RT3D, a model compression and mobile acceleration framework for 3D CNNs.
For the first time, real-time execution of 3D CNNs is achieved on off-the-shelf mobiles.
arXiv Detail & Related papers (2020-07-20T02:05:32Z) - Making DensePose fast and light [78.49552144907513]
Existing neural network models capable of solving this task are heavily parameterized.
To enable Dense Pose inference on the end device with current models, one needs to support an expensive server-side infrastructure and have a stable internet connection.
In this work, we target the problem of redesigning the DensePose R-CNN model's architecture so that the final network retains most of its accuracy but becomes more light-weight and fast.
arXiv Detail & Related papers (2020-06-26T19:42:20Z) - MobileDets: Searching for Object Detection Architectures for Mobile
Accelerators [61.30355783955777]
Inverted bottleneck layers have been the predominant building blocks in state-of-the-art object detection models on mobile devices.
Regular convolutions are a potent component to boost the latency-accuracy trade-off for object detection on accelerators.
We obtain a family of object detection models, MobileDets, that achieve state-of-the-art results across mobile accelerators.
arXiv Detail & Related papers (2020-04-30T00:21:30Z) - NETNet: Neighbor Erasing and Transferring Network for Better Single Shot
Object Detection [170.30694322460045]
We propose a new Neighbor Erasing and Transferring (NET) mechanism to reconfigure the pyramid features and explore scale-aware features.
A single-shot network called NETNet is constructed for scale-aware object detection.
arXiv Detail & Related papers (2020-01-18T15:21:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.