You Better Look Twice: a new perspective for designing accurate
detectors with reduced computations
- URL: http://arxiv.org/abs/2107.10050v1
- Date: Wed, 21 Jul 2021 12:39:51 GMT
- Title: You Better Look Twice: a new perspective for designing accurate
detectors with reduced computations
- Authors: Alexandra Dana, Maor Shutman, Yotam Perlitz, Ran Vitek, Tomer Peleg,
Roy Jevnisek
- Abstract summary: BLT-net is a new low-computation two-stage object detection architecture.
It reduces computations by separating objects from background using a very lite first-stage.
Resulting image proposals are then processed in the second-stage by a highly accurate model.
- Score: 56.34005280792013
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: General object detectors use powerful backbones that uniformly extract
features from images for enabling detection of a vast amount of object types.
However, utilization of such backbones in object detection applications
developed for specific object types can unnecessarily over-process an extensive
amount of background. In addition, they are agnostic to object scales, thus
redundantly process all image regions at the same resolution. In this work we
introduce BLT-net, a new low-computation two-stage object detection
architecture designed to process images with a significant amount of background
and objects of variate scales. BLT-net reduces computations by separating
objects from background using a very lite first-stage. BLT-net then efficiently
merges obtained proposals to further decrease processed background and then
dynamically reduces their resolution to minimize computations. Resulting image
proposals are then processed in the second-stage by a highly accurate model. We
demonstrate our architecture on the pedestrian detection problem, where objects
are of different sizes, images are of high resolution and object detection is
required to run in real-time. We show that our design reduces computations by a
factor of x4-x7 on the Citypersons and Caltech datasets with respect to leading
pedestrian detectors, on account of a small accuracy degradation. This method
can be applied on other object detection applications in scenes with a
considerable amount of background and variate object sizes to reduce
computations.
Related papers
- ESOD: Efficient Small Object Detection on High-Resolution Images [36.80623357577051]
Small objects are usually sparsely distributed and locally clustered.
Massive feature extraction computations are wasted on the non-target background area of images.
We propose to reuse the detector's backbone to conduct feature-level object-seeking and patch-slicing.
arXiv Detail & Related papers (2024-07-23T12:21:23Z) - YOLC: You Only Look Clusters for Tiny Object Detection in Aerial Images [33.80392696735718]
YOLC (You Only Look Clusters) is an efficient and effective framework that builds on an anchor-free object detector, CenterNet.
To overcome the challenges posed by large-scale images and non-uniform object distribution, we introduce a Local Scale Module (LSM) that adaptively searches cluster regions for zooming in for accurate detection.
We perform extensive experiments on two aerial image datasets, including Visdrone 2019 and UAVDT, to demonstrate the effectiveness and superiority of our proposed approach.
arXiv Detail & Related papers (2024-04-09T10:03:44Z) - 3D Small Object Detection with Dynamic Spatial Pruning [62.72638845817799]
We propose an efficient feature pruning strategy for 3D small object detection.
We present a multi-level 3D detector named DSPDet3D which benefits from high spatial resolution.
It takes less than 2s to directly process a whole building consisting of more than 4500k points while detecting out almost all objects.
arXiv Detail & Related papers (2023-05-05T17:57:04Z) - Rethinking the backbone architecture for tiny object detection [0.0]
Existing tiny object detection methods use standard deep neural networks as their backbone architecture.
We argue that such backbones are inappropriate for detecting tiny objects as they are designed for the classification of larger objects, and do not have the spatial resolution to identify small targets.
We design 'bottom-heavy' versions of backbones that allocate more resources to processing higher-resolution features without introducing any additional computational burden overall.
arXiv Detail & Related papers (2023-03-20T16:50:29Z) - A Coarse to Fine Framework for Object Detection in High Resolution Image [8.316322664637537]
Current approaches of object detection seldom consider detecting tiny object or the large scale variance problem in high resolution images.
We introduce a simple yet efficient approach that improves accuracy of object detection especially for small objects and large scale variance scene.
Our approach can make good use of the sparsity of the objects and the information in high-resolution image, thereby making the detection more efficient.
arXiv Detail & Related papers (2023-03-02T13:04:33Z) - SALISA: Saliency-based Input Sampling for Efficient Video Object
Detection [58.22508131162269]
We propose SALISA, a novel non-uniform SALiency-based Input SAmpling technique for video object detection.
We show that SALISA significantly improves the detection of small objects.
arXiv Detail & Related papers (2022-04-05T17:59:51Z) - High-resolution Iterative Feedback Network for Camouflaged Object
Detection [128.893782016078]
Spotting camouflaged objects that are visually assimilated into the background is tricky for object detection algorithms.
We aim to extract the high-resolution texture details to avoid the detail degradation that causes blurred vision in edges and boundaries.
We introduce a novel HitNet to refine the low-resolution representations by high-resolution features in an iterative feedback manner.
arXiv Detail & Related papers (2022-03-22T11:20:21Z) - Analysis of voxel-based 3D object detection methods efficiency for
real-time embedded systems [93.73198973454944]
Two popular voxel-based 3D object detection methods are studied in this paper.
Our experiments show that these methods mostly fail to detect distant small objects due to the sparsity of the input point clouds at large distances.
Our findings suggest that a considerable part of the computations of existing methods is focused on locations of the scene that do not contribute with successful detection.
arXiv Detail & Related papers (2021-05-21T12:40:59Z) - Anchor-free Small-scale Multispectral Pedestrian Detection [88.7497134369344]
We propose a method for effective and efficient multispectral fusion of the two modalities in an adapted single-stage anchor-free base architecture.
We aim at learning pedestrian representations based on object center and scale rather than direct bounding box predictions.
Results show our method's effectiveness in detecting small-scaled pedestrians.
arXiv Detail & Related papers (2020-08-19T13:13:01Z) - Underwater object detection using Invert Multi-Class Adaboost with deep
learning [37.14538666012363]
We propose a novel neural network architecture, namely Sample-WeIghted hyPEr Network (SWIPENet), for small object detection.
We show that the proposed SWIPENet+IMA framework achieves better performance in detection accuracy against several state-of-the-art object detection approaches.
arXiv Detail & Related papers (2020-05-23T15:30:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.