LDC-Net: A Unified Framework for Localization, Detection and Counting in
Dense Crowds
- URL: http://arxiv.org/abs/2110.04727v1
- Date: Sun, 10 Oct 2021 07:55:44 GMT
- Title: LDC-Net: A Unified Framework for Localization, Detection and Counting in
Dense Crowds
- Authors: Qi wang, Tao Han, Junyu Gao, Yuan Yuan, Xuelong Li
- Abstract summary: The rapid development in visual crowd analysis shows a trend to count people by positioning or even detecting, rather than simply summing a density map.
Some recent work on crowd localization and detection has two limitations: 1) The typical detection methods can not handle the dense crowds and a large variation in scale; 2) The density map methods suffer from performance deficiency in position and box prediction, especially in high density or large-size crowds.
- Score: 103.8635206945196
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The rapid development in visual crowd analysis shows a trend to count people
by positioning or even detecting, rather than simply summing a density map. It
also enlightens us back to the essence of the field, detection to count, which
can give more abundant crowd information and has more practical applications.
However, some recent work on crowd localization and detection has two
limitations: 1) The typical detection methods can not handle the dense crowds
and a large variation in scale; 2) The density map heuristic methods suffer
from performance deficiency in position and box prediction, especially in high
density or large-size crowds. In this paper, we devise a tailored baseline for
dense crowds location, detection, and counting from a new perspective, named as
LDC-Net for convenience, which has the following features: 1) A strong but
minimalist paradigm to detect objects by only predicting a location map and a
size map, which endows an ability to detect in a scene with any capacity ($0
\sim 10,000+$ persons); 2) Excellent cross-scale ability in facing a large
variation, such as the head ranging in $0 \sim 100,000+$ pixels; 3) Achieve
superior performance in location and box prediction tasks, as well as a
competitive counting performance compared with the density-based methods.
Finally, the source code and pre-trained models will be released.
Related papers
- Crowd counting with crowd attention convolutional neural network [23.96936386014949]
We propose a novel end-to-end model called Crowd Attention Convolutional Neural Network (CAT-CNN)
Our CAT-CNN can adaptively assess the importance of a human head at each pixel location by automatically encoding a confidence map.
With the guidance of the confidence map, the position of human head in estimated density map gets more attention to encode the final density map, which can avoid enormous misjudgements effectively.
arXiv Detail & Related papers (2022-04-15T06:51:58Z) - Cascaded Residual Density Network for Crowd Counting [63.714719914701014]
We propose a novel Cascaded Residual Density Network (CRDNet) in a coarse-to-fine approach to generate the high-quality density map for crowd counting more accurately.
A novel additional local count loss is presented to refine the accuracy of crowd counting.
arXiv Detail & Related papers (2021-07-29T03:07:11Z) - Rethinking Counting and Localization in Crowds:A Purely Point-Based
Framework [59.578339075658995]
We propose a purely point-based framework for joint crowd counting and individual localization.
We design an intuitive solution under this framework, which is called Point to Point Network (P2PNet)
arXiv Detail & Related papers (2021-07-27T11:41:50Z) - Exploring Visual Context for Weakly Supervised Person Search [155.46727990750227]
Person search has recently emerged as a challenging task that jointly addresses pedestrian detection and person re-identification.
Existing approaches follow a fully supervised setting where both bounding box and identity annotations are available.
This paper inventively considers weakly supervised person search with only bounding box annotations.
arXiv Detail & Related papers (2021-06-19T14:47:13Z) - Dense Point Prediction: A Simple Baseline for Crowd Counting and
Localization [17.92958745980573]
We propose a simple yet effective crowd counting and localization network named SCALNet.
We consider those tasks as a pixel-wise dense prediction problem and integrate them into an end-to-end framework.
Experiments on the recent and large-scale benchmark, NWPU-Crowd, show that our approach outperforms the state-of-the-art methods by more than 5% and 10% improvement in crowd localization and counting tasks, respectively.
arXiv Detail & Related papers (2021-04-26T12:08:08Z) - Reciprocal Distance Transform Maps for Crowd Counting and People
Localization in Dense Crowd [16.224760698133462]
We propose a novel Reciprocal Distance Transform (R-DT) map for crowd counting.
Compared with the density maps, the R-DT maps accurately describe the people location, without overlap between nearby heads in dense regions.
We simultaneously implement crowd counting and people localization with a simple network by replacing density maps with R-DT maps.
arXiv Detail & Related papers (2021-02-16T02:25:55Z) - Learning Independent Instance Maps for Crowd Localization [44.6430092887941]
We propose an end-to-end and straightforward framework for crowd localization, named Independent Instance Map segmentation (IIM)
IIM segment crowds into independent connected components, the positions and the crowd counts are obtained.
To improve the segmentation quality for different density regions, we present a differentiable Binarization Module (BM)
BM brings two advantages into localization models: 1) adaptively learn a threshold map for different images to detect each instance more accurately; 2) directly train the model using loss on binary predictions and labels.
arXiv Detail & Related papers (2020-12-08T02:17:19Z) - A Self-Training Approach for Point-Supervised Object Detection and
Counting in Crowds [54.73161039445703]
We propose a novel self-training approach that enables a typical object detector trained only with point-level annotations.
During training, we utilize the available point annotations to supervise the estimation of the center points of objects.
Experimental results show that our approach significantly outperforms state-of-the-art point-supervised methods under both detection and counting tasks.
arXiv Detail & Related papers (2020-07-25T02:14:42Z) - Towards Using Count-level Weak Supervision for Crowd Counting [55.58468947486247]
This paper studies the problem of weakly-supervised crowd counting which learns a model from only a small amount of location-level annotations (fully-supervised) but a large amount of count-level annotations (weakly-supervised)
We devise a simple-yet-effective training strategy, namely Multiple Auxiliary Tasks Training (MATT), to construct regularizes for restricting the freedom of the generated density maps.
arXiv Detail & Related papers (2020-02-29T02:58:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.