Dense Point Prediction: A Simple Baseline for Crowd Counting and
Localization
- URL: http://arxiv.org/abs/2104.12505v1
- Date: Mon, 26 Apr 2021 12:08:08 GMT
- Title: Dense Point Prediction: A Simple Baseline for Crowd Counting and
Localization
- Authors: Yi Wang, Xinyu Hou, and Lap-Pui Chau
- Abstract summary: We propose a simple yet effective crowd counting and localization network named SCALNet.
We consider those tasks as a pixel-wise dense prediction problem and integrate them into an end-to-end framework.
Experiments on the recent and large-scale benchmark, NWPU-Crowd, show that our approach outperforms the state-of-the-art methods by more than 5% and 10% improvement in crowd localization and counting tasks, respectively.
- Score: 17.92958745980573
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we propose a simple yet effective crowd counting and
localization network named SCALNet. Unlike most existing works that separate
the counting and localization tasks, we consider those tasks as a pixel-wise
dense prediction problem and integrate them into an end-to-end framework.
Specifically, for crowd counting, we adopt a counting head supervised by the
Mean Square Error (MSE) loss. For crowd localization, the key insight is to
recognize the keypoint of people, i.e., the center point of heads. We propose a
localization head to distinguish dense crowds trained by two loss functions,
i.e., Negative-Suppressed Focal (NSF) loss and False-Positive (FP) loss, which
balances the positive/negative examples and handles the false-positive
predictions. Experiments on the recent and large-scale benchmark, NWPU-Crowd,
show that our approach outperforms the state-of-the-art methods by more than 5%
and 10% improvement in crowd localization and counting tasks, respectively. The
code is publicly available at https://github.com/WangyiNTU/SCALNet.
Related papers
- Robust Zero-Shot Crowd Counting and Localization With Adaptive Resolution SAM [55.93697196726016]
We propose a simple yet effective crowd counting method by utilizing the Segment-Everything-Everywhere Model (SEEM)
We show that SEEM's performance in dense crowd scenes is limited, primarily due to the omission of many persons in high-density areas.
Our proposed method achieves the best unsupervised performance in crowd counting, while also being comparable to some supervised methods.
arXiv Detail & Related papers (2024-02-27T13:55:17Z) - Scale-Aware Crowd Counting Using a Joint Likelihood Density Map and
Synthetic Fusion Pyramid Network [15.882525477601183]
We develop a Synthetic Fusion Pyramid Network (SPF-Net) with a scale-aware loss function design for accurate crowd counting.
Existing crowd-counting methods assume that the training annotation points were accurate and thus ignore the fact that noisy annotations can lead to large model-learning bias and counting error.
This work is the first to properly handle such noise at multiple scales in end-to-end loss design and thus push the crowd counting state-of-the-art.
arXiv Detail & Related papers (2022-11-13T06:52:47Z) - Reinforcing Local Feature Representation for Weakly-Supervised Dense
Crowd Counting [21.26385035473938]
We propose a self-adaptive feature similarity learning network and a global-local consistency loss to reinforce local representation.
Our proposed method based on different backbones narrows the gap between weakly-supervised and fully-supervised dense crowd counting.
arXiv Detail & Related papers (2022-02-22T05:53:51Z) - LDC-Net: A Unified Framework for Localization, Detection and Counting in
Dense Crowds [103.8635206945196]
The rapid development in visual crowd analysis shows a trend to count people by positioning or even detecting, rather than simply summing a density map.
Some recent work on crowd localization and detection has two limitations: 1) The typical detection methods can not handle the dense crowds and a large variation in scale; 2) The density map methods suffer from performance deficiency in position and box prediction, especially in high density or large-size crowds.
arXiv Detail & Related papers (2021-10-10T07:55:44Z) - Rethinking Counting and Localization in Crowds:A Purely Point-Based
Framework [59.578339075658995]
We propose a purely point-based framework for joint crowd counting and individual localization.
We design an intuitive solution under this framework, which is called Point to Point Network (P2PNet)
arXiv Detail & Related papers (2021-07-27T11:41:50Z) - Direct Measure Matching for Crowd Counting [59.66286603624411]
We propose a new measure-based counting approach to regress the predicted density maps to the scattered point-annotated ground truth directly.
In this paper, we derive a semi-balanced form of Sinkhorn divergence, based on which a Sinkhorn counting loss is designed for measure matching.
arXiv Detail & Related papers (2021-07-04T06:37:33Z) - Learning Independent Instance Maps for Crowd Localization [44.6430092887941]
We propose an end-to-end and straightforward framework for crowd localization, named Independent Instance Map segmentation (IIM)
IIM segment crowds into independent connected components, the positions and the crowd counts are obtained.
To improve the segmentation quality for different density regions, we present a differentiable Binarization Module (BM)
BM brings two advantages into localization models: 1) adaptively learn a threshold map for different images to detect each instance more accurately; 2) directly train the model using loss on binary predictions and labels.
arXiv Detail & Related papers (2020-12-08T02:17:19Z) - Completely Self-Supervised Crowd Counting via Distribution Matching [92.09218454377395]
We propose a complete self-supervision approach to training models for dense crowd counting.
The only input required to train, apart from a large set of unlabeled crowd images, is the approximate upper limit of the crowd count.
Our method dwells on the idea that natural crowds follow a power law distribution, which could be leveraged to yield error signals for backpropagation.
arXiv Detail & Related papers (2020-09-14T13:20:12Z) - A Self-Training Approach for Point-Supervised Object Detection and
Counting in Crowds [54.73161039445703]
We propose a novel self-training approach that enables a typical object detector trained only with point-level annotations.
During training, we utilize the available point annotations to supervise the estimation of the center points of objects.
Experimental results show that our approach significantly outperforms state-of-the-art point-supervised methods under both detection and counting tasks.
arXiv Detail & Related papers (2020-07-25T02:14:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.