Understanding the Impact of Image Quality and Distance of Objects to
Object Detection Performance
- URL: http://arxiv.org/abs/2209.08237v1
- Date: Sat, 17 Sep 2022 04:05:01 GMT
- Title: Understanding the Impact of Image Quality and Distance of Objects to
Object Detection Performance
- Authors: Yu Hao, Haoyang Pei, Yixuan Lyu, Zhongzheng Yuan, John-Ross Rizzo, Yao
Wang, Yi Fang
- Abstract summary: This paper examines the impact of spatial and amplitude resolution, as well as object distance, on object detection accuracy and computational cost.
We develop a resolution-adaptive variant of YOLOv5 (RA-YOLO), which varies the number of scales in the feature pyramid and detection head based on the spatial resolution of the input image.
- Score: 11.856281907276145
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Deep learning has made great strides for object detection in images. The
detection accuracy and computational cost of object detection depend on the
spatial resolution of an image, which may be constrained by both the camera and
storage considerations. Compression is often achieved by reducing either
spatial or amplitude resolution or, at times, both, both of which have
well-known effects on performance. Detection accuracy also depends on the
distance of the object of interest from the camera. Our work examines the
impact of spatial and amplitude resolution, as well as object distance, on
object detection accuracy and computational cost. We develop a
resolution-adaptive variant of YOLOv5 (RA-YOLO), which varies the number of
scales in the feature pyramid and detection head based on the spatial
resolution of the input image. To train and evaluate this new method, we
created a dataset of images with diverse spatial and amplitude resolutions by
combining images from the TJU and Eurocity datasets and generating different
resolutions by applying spatial resizing and compression. We first show that
RA-YOLO achieves a good trade-off between detection accuracy and inference time
over a large range of spatial resolutions. We then evaluate the impact of
spatial and amplitude resolutions on object detection accuracy using the
proposed RA-YOLO model. We demonstrate that the optimal spatial resolution that
leads to the highest detection accuracy depends on the 'tolerated' image size.
We further assess the impact of the distance of an object to the camera on the
detection accuracy and show that higher spatial resolution enables a greater
detection range. These results provide important guidelines for choosing the
image spatial resolution and compression settings predicated on available
bandwidth, storage, desired inference time, and/or desired detection range, in
practical applications.
Related papers
- SaccadeDet: A Novel Dual-Stage Architecture for Rapid and Accurate Detection in Gigapixel Images [50.742420049839474]
'SaccadeDet' is an innovative architecture for gigapixel-level object detection, inspired by the human eye saccadic movement.
Our approach, evaluated on the PANDA dataset, achieves an 8x speed increase over the state-of-the-art methods.
It also demonstrates significant potential in gigapixel-level pathology analysis through its application to Whole Slide Imaging.
arXiv Detail & Related papers (2024-07-25T11:22:54Z) - DyRA: Portable Dynamic Resolution Adjustment Network for Existing Detectors [0.669087470775851]
This paper introduces DyRA, a dynamic resolution adjustment network providing an image-specific scale factor for existing detectors.
Loss function is devised to minimize the accuracy drop across contrasting objectives of different-sized objects for scaling.
arXiv Detail & Related papers (2023-11-28T07:52:41Z) - OBSUM: An object-based spatial unmixing model for spatiotemporal fusion
of remote sensing images [12.94382743563284]
This study proposes Object-Based Spatial Unmixing Model (OBSUM), which incorporates object-based image analysis and spatial unmixing.
OBSUM can be applied using only one fine image at the base resolution date and one coarse image date, without the need of a coarse image at the base date date.
It has great potential to generate accurate and high-resolution time-series for supporting various remote sensing applications.
arXiv Detail & Related papers (2023-10-14T07:07:27Z) - A Coarse to Fine Framework for Object Detection in High Resolution Image [8.316322664637537]
Current approaches of object detection seldom consider detecting tiny object or the large scale variance problem in high resolution images.
We introduce a simple yet efficient approach that improves accuracy of object detection especially for small objects and large scale variance scene.
Our approach can make good use of the sparsity of the objects and the information in high-resolution image, thereby making the detection more efficient.
arXiv Detail & Related papers (2023-03-02T13:04:33Z) - Towards Spatial Equilibrium Object Detection [88.9747319572368]
In this paper, we study the spatial disequilibrium problem of modern object detectors.
We propose to quantify this problem by measuring the detection performance over zones.
This motivates us to design a more generalized measurement, termed Spatial equilibrium Precision.
arXiv Detail & Related papers (2023-01-14T17:33:26Z) - Progressive Domain Adaptation with Contrastive Learning for Object
Detection in the Satellite Imagery [0.0]
State-of-the-art object detection methods largely fail to identify small and dense objects.
We propose a small object detection pipeline that improves the feature extraction process.
We show we can alleviate the degradation of object identification in previously unseen datasets.
arXiv Detail & Related papers (2022-09-06T15:16:35Z) - Fast Fourier Convolution Based Remote Sensor Image Object Detection for
Earth Observation [0.0]
We propose a Frequency-aware Feature Pyramid Framework (FFPF) for remote sensing object detection.
F-ResNet is proposed to perceive the spectral context information by plugging the frequency domain convolution into each stage of the backbone.
The BSFPN is designed to use a bilateral sampling strategy and skipping connection to better model the association of object features at different scales.
arXiv Detail & Related papers (2022-09-01T15:50:58Z) - Towards Model Generalization for Monocular 3D Object Detection [57.25828870799331]
We present an effective unified camera-generalized paradigm (CGP) for Mono3D object detection.
We also propose the 2D-3D geometry-consistent object scaling strategy (GCOS) to bridge the gap via an instance-level augment.
Our method called DGMono3D achieves remarkable performance on all evaluated datasets and surpasses the SoTA unsupervised domain adaptation scheme.
arXiv Detail & Related papers (2022-05-23T23:05:07Z) - SALISA: Saliency-based Input Sampling for Efficient Video Object
Detection [58.22508131162269]
We propose SALISA, a novel non-uniform SALiency-based Input SAmpling technique for video object detection.
We show that SALISA significantly improves the detection of small objects.
arXiv Detail & Related papers (2022-04-05T17:59:51Z) - You Better Look Twice: a new perspective for designing accurate
detectors with reduced computations [56.34005280792013]
BLT-net is a new low-computation two-stage object detection architecture.
It reduces computations by separating objects from background using a very lite first-stage.
Resulting image proposals are then processed in the second-stage by a highly accurate model.
arXiv Detail & Related papers (2021-07-21T12:39:51Z) - AdaZoom: Adaptive Zoom Network for Multi-Scale Object Detection in Large
Scenes [57.969186815591186]
Detection in large-scale scenes is a challenging problem due to small objects and extreme scale variation.
We propose a novel Adaptive Zoom (AdaZoom) network as a selective magnifier with flexible shape and focal length to adaptively zoom the focus regions for object detection.
arXiv Detail & Related papers (2021-06-19T03:30:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.