Reinforcing Local Feature Representation for Weakly-Supervised Dense
Crowd Counting
- URL: http://arxiv.org/abs/2202.10681v1
- Date: Tue, 22 Feb 2022 05:53:51 GMT
- Title: Reinforcing Local Feature Representation for Weakly-Supervised Dense
Crowd Counting
- Authors: Xiaoshuang Chen, Hongtao Lu
- Abstract summary: We propose a self-adaptive feature similarity learning network and a global-local consistency loss to reinforce local representation.
Our proposed method based on different backbones narrows the gap between weakly-supervised and fully-supervised dense crowd counting.
- Score: 21.26385035473938
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Fully-supervised crowd counting is a laborious task due to the large amounts
of annotations. Few works focus on weekly-supervised crowd counting, where only
the global crowd numbers are available for training. The main challenge of
weekly-supervised crowd counting is the lack of local supervision information.
To address this problem, we propose a self-adaptive feature similarity learning
(SFSL) network and a global-local consistency (GLC) loss to reinforce local
feature representation. We introduce a feature vector which represents the
unbiased feature estimation of persons. The network updates the feature vector
self-adaptively and utilizes the feature similarity for the regression of crowd
numbers. Besides, the proposed GLC loss leverages the consistency between the
network estimations from global and local areas. The experimental results
demonstrate that our proposed method based on different backbones narrows the
gap between weakly-supervised and fully-supervised dense crowd counting.
Related papers
- Robust Zero-Shot Crowd Counting and Localization With Adaptive Resolution SAM [55.93697196726016]
We propose a simple yet effective crowd counting method by utilizing the Segment-Everything-Everywhere Model (SEEM)
We show that SEEM's performance in dense crowd scenes is limited, primarily due to the omission of many persons in high-density areas.
Our proposed method achieves the best unsupervised performance in crowd counting, while also being comparable to some supervised methods.
arXiv Detail & Related papers (2024-02-27T13:55:17Z) - Integrating Local Real Data with Global Gradient Prototypes for
Classifier Re-Balancing in Federated Long-Tailed Learning [60.41501515192088]
Federated Learning (FL) has become a popular distributed learning paradigm that involves multiple clients training a global model collaboratively.
The data samples usually follow a long-tailed distribution in the real world, and FL on the decentralized and long-tailed data yields a poorly-behaved global model.
In this work, we integrate the local real data with the global gradient prototypes to form the local balanced datasets.
arXiv Detail & Related papers (2023-01-25T03:18:10Z) - CrowdMLP: Weakly-Supervised Crowd Counting via Multi-Granularity MLP [19.42355176075503]
Existing state-of-the-art crowd counting algorithms rely excessively on location-level annotations.
CrowdMLP is presented, which probes into modelling global dependencies of embeddings and regressing total counts.
Experiments demonstrate that CrowdMLP significantly outperforms existing weakly-supervised counting algorithms.
arXiv Detail & Related papers (2022-03-15T19:57:55Z) - CrowdFormer: Weakly-supervised Crowd counting with Improved
Generalizability [2.8174125805742416]
We propose a weakly-supervised method for crowd counting using a pyramid vision transformer.
Our method is comparable to the state-of-the-art on the benchmark crowd datasets.
arXiv Detail & Related papers (2022-03-07T23:10:40Z) - Boosting Crowd Counting via Multifaceted Attention [109.89185492364386]
Large-scale variations often exist within crowd images.
Neither fixed-size convolution kernel of CNN nor fixed-size attention of recent vision transformers can handle this kind of variation.
We propose a Multifaceted Attention Network (MAN) to improve transformer models in local spatial relation encoding.
arXiv Detail & Related papers (2022-03-05T01:36:43Z) - Wisdom of (Binned) Crowds: A Bayesian Stratification Paradigm for Crowd
Counting [16.09823718637455]
We analyze the performance of crowd counting approaches across standard datasets at per strata level and in aggregate.
Our contributions represent a nuanced, statistically balanced and fine-grained characterization of performance for crowd counting approaches.
arXiv Detail & Related papers (2021-08-19T16:50:31Z) - Decentralized Local Stochastic Extra-Gradient for Variational
Inequalities [125.62877849447729]
We consider distributed variational inequalities (VIs) on domains with the problem data that is heterogeneous (non-IID) and distributed across many devices.
We make a very general assumption on the computational network that covers the settings of fully decentralized calculations.
We theoretically analyze its convergence rate in the strongly-monotone, monotone, and non-monotone settings.
arXiv Detail & Related papers (2021-06-15T17:45:51Z) - Dense Point Prediction: A Simple Baseline for Crowd Counting and
Localization [17.92958745980573]
We propose a simple yet effective crowd counting and localization network named SCALNet.
We consider those tasks as a pixel-wise dense prediction problem and integrate them into an end-to-end framework.
Experiments on the recent and large-scale benchmark, NWPU-Crowd, show that our approach outperforms the state-of-the-art methods by more than 5% and 10% improvement in crowd localization and counting tasks, respectively.
arXiv Detail & Related papers (2021-04-26T12:08:08Z) - PSCNet: Pyramidal Scale and Global Context Guided Network for Crowd
Counting [44.306790250158954]
This paper proposes a novel crowd counting approach based on pyramidal scale module (PSM) and global context module (GCM)
PSM is used to adaptively capture multi-scale information, which can identify a fine boundary of crowds with different image scales.
GCM is devised with low-complexity and lightweight manner, to make the interactive information across the channels of the feature maps more efficient.
arXiv Detail & Related papers (2020-12-07T11:35:56Z) - Semi-Supervised Crowd Counting via Self-Training on Surrogate Tasks [50.78037828213118]
This paper tackles the semi-supervised crowd counting problem from the perspective of feature learning.
We propose a novel semi-supervised crowd counting method which is built upon two innovative components.
arXiv Detail & Related papers (2020-07-07T05:30:53Z) - Global Distance-distributions Separation for Unsupervised Person
Re-identification [93.39253443415392]
Existing unsupervised ReID approaches often fail in correctly identifying the positive samples and negative samples through the distance-based matching/ranking.
We introduce a global distance-distributions separation constraint over the two distributions to encourage the clear separation of positive and negative samples from a global view.
We show that our method leads to significant improvement over the baselines and achieves the state-of-the-art performance.
arXiv Detail & Related papers (2020-06-01T07:05:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.