Recurrent Distillation based Crowd Counting
- URL: http://arxiv.org/abs/2006.07755v1
- Date: Sun, 14 Jun 2020 01:04:52 GMT
- Title: Recurrent Distillation based Crowd Counting
- Authors: Yue Gu, Wenxi Liu
- Abstract summary: We propose a simple yet effective crowd counting framework that is able to achieve the state-of-the-art performance on various crowded scenes.
In experiments, we demonstrate that, with our simple convolutional neural network architecture strengthened by our proposed training algorithm, our model is able to outperform or be comparable with the state-of-the-art methods.
- Score: 23.4315417286694
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In recent years, with the progress of deep learning technologies, crowd
counting has been rapidly developed. In this work, we propose a simple yet
effective crowd counting framework that is able to achieve the state-of-the-art
performance on various crowded scenes. In particular, we first introduce a
perspective-aware density map generation method that is able to produce
ground-truth density maps from point annotations to train crowd counting model
to accomplish superior performance than prior density map generation
techniques. Besides, leveraging our density map generation method, we propose
an iterative distillation algorithm to progressively enhance our model with
identical network structures, without significantly sacrificing the dimension
of the output density maps. In experiments, we demonstrate that, with our
simple convolutional neural network architecture strengthened by our proposed
training algorithm, our model is able to outperform or be comparable with the
state-of-the-art methods. Furthermore, we also evaluate our density map
generation approach and distillation algorithm in ablation studies.
Related papers
- A Paradigm Shift in Mouza Map Vectorization: A Human-Machine Collaboration Approach [2.315458677488431]
Current manual digitization methods are time-consuming and labor-intensive.
Our study proposes a semi-automated approach to streamline the digitization process, saving both time and human resources.
arXiv Detail & Related papers (2024-10-21T12:47:36Z) - Diffusion-based Data Augmentation for Object Counting Problems [62.63346162144445]
We develop a pipeline that utilizes a diffusion model to generate extensive training data.
We are the first to generate images conditioned on a location dot map with a diffusion model.
Our proposed counting loss for the diffusion model effectively minimizes the discrepancies between the location dot map and the crowd images generated.
arXiv Detail & Related papers (2024-01-25T07:28:22Z) - Hi-Map: Hierarchical Factorized Radiance Field for High-Fidelity
Monocular Dense Mapping [51.739466714312805]
We introduce Hi-Map, a novel monocular dense mapping approach based on Neural Radiance Field (NeRF)
Hi-Map is exceptional in its capacity to achieve efficient and high-fidelity mapping using only posed RGB inputs.
arXiv Detail & Related papers (2024-01-06T12:32:25Z) - $CrowdDiff$: Multi-hypothesis Crowd Density Estimation using Diffusion Models [26.55769846846542]
Crowd counting is a fundamental problem in crowd analysis which is typically accomplished by estimating a crowd density map and summing over the density values.
We present $CrowdDiff$ that generates the crowd density map as a reverse diffusion process.
In addition, owing to the nature of the diffusion model, we introduce producing multiple density maps to improve the counting performance.
arXiv Detail & Related papers (2023-03-22T17:58:01Z) - Redesigning Multi-Scale Neural Network for Crowd Counting [68.674652984003]
We introduce a hierarchical mixture of density experts, which hierarchically merges multi-scale density maps for crowd counting.
Within the hierarchical structure, an expert competition and collaboration scheme is presented to encourage contributions from all scales.
Experiments show that our method achieves the state-of-the-art performance on five public datasets.
arXiv Detail & Related papers (2022-08-04T21:49:29Z) - PANet: Perspective-Aware Network with Dynamic Receptive Fields and
Self-Distilling Supervision for Crowd Counting [63.84828478688975]
We propose a novel perspective-aware approach called PANet to address the perspective problem.
Based on the observation that the size of the objects varies greatly in one image due to the perspective effect, we propose the dynamic receptive fields (DRF) framework.
The framework is able to adjust the receptive field by the dilated convolution parameters according to the input image, which helps the model to extract more discriminative features for each local region.
arXiv Detail & Related papers (2021-10-31T04:43:05Z) - Cascaded Residual Density Network for Crowd Counting [63.714719914701014]
We propose a novel Cascaded Residual Density Network (CRDNet) in a coarse-to-fine approach to generate the high-quality density map for crowd counting more accurately.
A novel additional local count loss is presented to refine the accuracy of crowd counting.
arXiv Detail & Related papers (2021-07-29T03:07:11Z) - CAMERAS: Enhanced Resolution And Sanity preserving Class Activation
Mapping for image saliency [61.40511574314069]
Backpropagation image saliency aims at explaining model predictions by estimating model-centric importance of individual pixels in the input.
We propose CAMERAS, a technique to compute high-fidelity backpropagation saliency maps without requiring any external priors.
arXiv Detail & Related papers (2021-06-20T08:20:56Z) - Enhancing Deep Neural Network Saliency Visualizations with Gradual
Extrapolation [0.0]
We propose an enhancement technique of the Class Activation Mapping methods like Grad-CAM or Excitation Backpropagation.
Our idea, called Gradual Extrapolation, can supplement any method that generates a heatmap picture by sharpening the output.
arXiv Detail & Related papers (2021-04-11T07:39:35Z) - A Strong Baseline for Crowd Counting and Unsupervised People
Localization [2.690502103971799]
We explore a strong baseline for crowd counting and an unsupervised people localization algorithm based on estimated density maps.
We collect different backbones and training tricks and evaluate the impact of changing them.
We propose a clustering algorithm named isolated KMeans to locate the heads in density maps.
arXiv Detail & Related papers (2020-11-07T08:29:03Z) - Coarse- and Fine-grained Attention Network with Background-aware Loss
for Crowd Density Map Estimation [2.690502103971799]
CFANet is a novel method for generating high-quality crowd density maps and people count estimation.
We devise a from-coarse-to-fine progressive attention mechanism by integrating Crowd Region Recognizer (CRR) and Density Level Estimator (DLE) branch.
Our method can not only outperform previous state-of-the-art methods in terms of count accuracy but also improve the image quality of density maps as well as reduce the false recognition ratio.
arXiv Detail & Related papers (2020-11-07T08:05:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.