PDANet: Pyramid Density-aware Attention Net for Accurate Crowd Counting
- URL: http://arxiv.org/abs/2001.05643v10
- Date: Wed, 15 Apr 2020 02:21:27 GMT
- Title: PDANet: Pyramid Density-aware Attention Net for Accurate Crowd Counting
- Authors: Saeed Amirgholipour, Xiangjian He, Wenjing Jia, Dadong Wang, and Lei
Liu
- Abstract summary: Crowd counting remains an open real-world problem due to the vast scale variations in crowd density within the interested area.
We propose a novel Pyramid Density-Aware Attention-based network, abbreviated as PDANet, that leverages the attention, pyramid scale feature and two branch decoder modules for density-aware crowd counting.
- Score: 7.02081613648832
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Crowd counting, i.e., estimating the number of people in a crowded area, has
attracted much interest in the research community. Although many attempts have
been reported, crowd counting remains an open real-world problem due to the
vast scale variations in crowd density within the interested area, and severe
occlusion among the crowd. In this paper, we propose a novel Pyramid
Density-Aware Attention-based network, abbreviated as PDANet, that leverages
the attention, pyramid scale feature and two branch decoder modules for
density-aware crowd counting. The PDANet utilizes these modules to extract
different scale features, focus on the relevant information, and suppress the
misleading ones. We also address the variation of crowdedness levels among
different images with an exclusive Density-Aware Decoder (DAD). For this
purpose, a classifier evaluates the density level of the input features and
then passes them to the corresponding high and low crowded DAD modules.
Finally, we generate an overall density map by considering the summation of low
and high crowded density maps as spatial attention. Meanwhile, we employ two
losses to create a precise density map for the input scene. Extensive
evaluations conducted on the challenging benchmark datasets well demonstrate
the superior performance of the proposed PDANet in terms of the accuracy of
counting and generated density maps over the well-known state of the arts.
Related papers
- CrowdMAC: Masked Crowd Density Completion for Robust Crowd Density Forecasting [10.332817296500533]
A crowd density forecasting task aims to predict how the crowd density map will change in the future from observed past crowd density maps.
Past crowd density maps are often incomplete due to the miss-detection of pedestrians.
This paper presents a MAsked crowd density Completion framework for crowd density forecasting (CrowdMAC)
CrowdMAC is simultaneously trained to forecast future crowd density maps from partially masked past crowd density maps.
arXiv Detail & Related papers (2024-07-20T02:18:43Z) - Diffusion-based Data Augmentation for Object Counting Problems [62.63346162144445]
We develop a pipeline that utilizes a diffusion model to generate extensive training data.
We are the first to generate images conditioned on a location dot map with a diffusion model.
Our proposed counting loss for the diffusion model effectively minimizes the discrepancies between the location dot map and the crowd images generated.
arXiv Detail & Related papers (2024-01-25T07:28:22Z) - Focus for Free in Density-Based Counting [56.961229110268036]
We introduce two methods that repurpose the available point annotations to enhance counting performance.
The first is a counting-specific augmentation that leverages point annotations to simulate occluded objects in both input and density images.
The second method, foreground distillation, generates foreground masks from the point annotations, from which we train an auxiliary network on images with blacked-out backgrounds.
arXiv Detail & Related papers (2023-06-08T11:54:37Z) - Redesigning Multi-Scale Neural Network for Crowd Counting [68.674652984003]
We introduce a hierarchical mixture of density experts, which hierarchically merges multi-scale density maps for crowd counting.
Within the hierarchical structure, an expert competition and collaboration scheme is presented to encourage contributions from all scales.
Experiments show that our method achieves the state-of-the-art performance on five public datasets.
arXiv Detail & Related papers (2022-08-04T21:49:29Z) - Crowd counting with crowd attention convolutional neural network [23.96936386014949]
We propose a novel end-to-end model called Crowd Attention Convolutional Neural Network (CAT-CNN)
Our CAT-CNN can adaptively assess the importance of a human head at each pixel location by automatically encoding a confidence map.
With the guidance of the confidence map, the position of human head in estimated density map gets more attention to encode the final density map, which can avoid enormous misjudgements effectively.
arXiv Detail & Related papers (2022-04-15T06:51:58Z) - LDC-Net: A Unified Framework for Localization, Detection and Counting in
Dense Crowds [103.8635206945196]
The rapid development in visual crowd analysis shows a trend to count people by positioning or even detecting, rather than simply summing a density map.
Some recent work on crowd localization and detection has two limitations: 1) The typical detection methods can not handle the dense crowds and a large variation in scale; 2) The density map methods suffer from performance deficiency in position and box prediction, especially in high density or large-size crowds.
arXiv Detail & Related papers (2021-10-10T07:55:44Z) - Cascaded Residual Density Network for Crowd Counting [63.714719914701014]
We propose a novel Cascaded Residual Density Network (CRDNet) in a coarse-to-fine approach to generate the high-quality density map for crowd counting more accurately.
A novel additional local count loss is presented to refine the accuracy of crowd counting.
arXiv Detail & Related papers (2021-07-29T03:07:11Z) - Multi-Scale Context Aggregation Network with Attention-Guided for Crowd
Counting [23.336181341124746]
Crowd counting aims to predict the number of people and generate the density map in the image.
There are many challenges, including varying head scales, the diversity of crowd distribution across images and cluttered backgrounds.
We propose a multi-scale context aggregation network (MSCANet) based on single-column encoder-decoder architecture for crowd counting.
arXiv Detail & Related papers (2021-04-06T02:24:06Z) - Coarse- and Fine-grained Attention Network with Background-aware Loss
for Crowd Density Map Estimation [2.690502103971799]
CFANet is a novel method for generating high-quality crowd density maps and people count estimation.
We devise a from-coarse-to-fine progressive attention mechanism by integrating Crowd Region Recognizer (CRR) and Density Level Estimator (DLE) branch.
Our method can not only outperform previous state-of-the-art methods in terms of count accuracy but also improve the image quality of density maps as well as reduce the false recognition ratio.
arXiv Detail & Related papers (2020-11-07T08:05:54Z) - CNN-based Density Estimation and Crowd Counting: A Survey [65.06491415951193]
This paper comprehensively studies the crowd counting models, mainly CNN-based density map estimation methods.
According to the evaluation metrics, we select the top three performers on their crowd counting datasets.
We expect to make reasonable inference and prediction for the future development of crowd counting.
arXiv Detail & Related papers (2020-03-28T13:17:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.