Density Map Guided Object Detection in Aerial Images
- URL: http://arxiv.org/abs/2004.05520v1
- Date: Sun, 12 Apr 2020 01:32:00 GMT
- Title: Density Map Guided Object Detection in Aerial Images
- Authors: Changlin Li and Taojiannan Yang and Sijie Zhu and Chen Chen and
Shanyue Guan
- Abstract summary: Object detection in high-resolution aerial images is a challenging task because of 1) the large variation in object size, and 2) non-uniform distribution of objects.
A common solution is to divide the large aerial image into small (uniform) crops and then apply object detection on each small crop.
We propose a Density-Map guided object detection Network (DMNet), which is inspired from the observation that the object density map of an image presents how objects distribute in terms of the pixel intensity of the map.
- Score: 18.874882959220887
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Object detection in high-resolution aerial images is a challenging task
because of 1) the large variation in object size, and 2) non-uniform
distribution of objects. A common solution is to divide the large aerial image
into small (uniform) crops and then apply object detection on each small crop.
In this paper, we investigate the image cropping strategy to address these
challenges. Specifically, we propose a Density-Map guided object detection
Network (DMNet), which is inspired from the observation that the object density
map of an image presents how objects distribute in terms of the pixel intensity
of the map. As pixel intensity varies, it is able to tell whether a region has
objects or not, which in turn provides guidance for cropping images
statistically. DMNet has three key components: a density map generation module,
an image cropping module and an object detector. DMNet generates a density map
and learns scale information based on density intensities to form cropping
regions. Extensive experiments show that DMNet achieves state-of-the-art
performance on two popular aerial image datasets, i.e. VisionDrone and UAVDT.
Related papers
- Toward General Object-level Mapping from Sparse Views with 3D Diffusion Priors [8.701106353658346]
General Object-level mapping builds a 3D map of objects in a scene with detailed shapes and poses from multi-view sensor observations.
Recent work introduces generative shape priors for object-level mapping from sparse views, but is limited to single-category objects.
In this work, we propose a General Object-level Mapping system, GOM, which leverages a 3D diffusion model as shape prior with multi-category support and outputs Neural Radiance Fields (NeRFs) for both texture and geometry for all objects in a scene.
arXiv Detail & Related papers (2024-10-07T21:33:30Z) - VFMM3D: Releasing the Potential of Image by Vision Foundation Model for Monocular 3D Object Detection [80.62052650370416]
monocular 3D object detection holds significant importance across various applications, including autonomous driving and robotics.
In this paper, we present VFMM3D, an innovative framework that leverages the capabilities of Vision Foundation Models (VFMs) to accurately transform single-view images into LiDAR point cloud representations.
arXiv Detail & Related papers (2024-04-15T03:12:12Z) - 3D Small Object Detection with Dynamic Spatial Pruning [62.72638845817799]
We propose an efficient feature pruning strategy for 3D small object detection.
We present a multi-level 3D detector named DSPDet3D which benefits from high spatial resolution.
It takes less than 2s to directly process a whole building consisting of more than 4500k points while detecting out almost all objects.
arXiv Detail & Related papers (2023-05-05T17:57:04Z) - Improved Counting and Localization from Density Maps for Object
Detection in 2D and 3D Microscopy Imaging [4.746727774540763]
We propose an alternative method to count and localize objects from the density map.
Our results show improved performance in counting and localization of objects in 2D and 3D microscopy data.
arXiv Detail & Related papers (2022-03-29T15:54:19Z) - You Better Look Twice: a new perspective for designing accurate
detectors with reduced computations [56.34005280792013]
BLT-net is a new low-computation two-stage object detection architecture.
It reduces computations by separating objects from background using a very lite first-stage.
Resulting image proposals are then processed in the second-stage by a highly accurate model.
arXiv Detail & Related papers (2021-07-21T12:39:51Z) - Scale Normalized Image Pyramids with AutoFocus for Object Detection [75.71320993452372]
A scale normalized image pyramid (SNIP) is generated that, like human vision, only attends to objects within a fixed size range at different scales.
We propose an efficient spatial sub-sampling scheme which only operates on fixed-size sub-regions likely to contain objects.
The resulting algorithm is referred to as AutoFocus and results in a 2.5-5 times speed-up during inference when used with SNIP.
arXiv Detail & Related papers (2021-02-10T18:57:53Z) - AMRNet: Chips Augmentation in Aerial Images Object Detection [7.817259518365044]
Three augmentation methods are introduced to relieve problems such as scale variation, object sparsity, and class imbalance.
Our model achieves state-of-the-art perfomance on two popular aerial image datasets of VisDrone and UAVDT.
arXiv Detail & Related papers (2020-09-15T15:16:06Z) - Counting from Sky: A Large-scale Dataset for Remote Sensing Object
Counting and A Benchmark Method [52.182698295053264]
We are interested in counting dense objects from remote sensing images. Compared with object counting in a natural scene, this task is challenging in the following factors: large scale variation, complex cluttered background, and orientation arbitrariness.
To address these issues, we first construct a large-scale object counting dataset with remote sensing images, which contains four important geographic objects.
We then benchmark the dataset by designing a novel neural network that can generate a density map of an input image.
arXiv Detail & Related papers (2020-08-28T03:47:49Z) - Counting dense objects in remote sensing images [52.182698295053264]
Estimating number of interested objects from a given image is a challenging yet important task.
In this paper, we are interested in counting dense objects from remote sensing images.
To address these issues, we first construct a large-scale object counting dataset based on remote sensing images.
We then benchmark the dataset by designing a novel neural network which can generate density map of an input image.
arXiv Detail & Related papers (2020-02-14T09:13:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.