HDNet: A Hierarchically Decoupled Network for Crowd Counting
- URL: http://arxiv.org/abs/2212.05722v1
- Date: Mon, 12 Dec 2022 06:01:26 GMT
- Title: HDNet: A Hierarchically Decoupled Network for Crowd Counting
- Authors: Chenliang Gu, Changan Wang, Bin-Bin Gao, Jun Liu, Tianliang Zhang
- Abstract summary: We propose a Hierarchically Decoupled Network (HDNet) to solve the above two problems within a unified framework.
HDNet achieves state-of-the-art performance on several popular counting benchmarks.
- Score: 11.530565995318696
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recently, density map regression-based methods have dominated in crowd
counting owing to their excellent fitting ability on density distribution.
However, further improvement tends to saturate mainly because of the confusing
background noise and the large density variation. In this paper, we propose a
Hierarchically Decoupled Network (HDNet) to solve the above two problems within
a unified framework. Specifically, a background classification sub-task is
decomposed from the density map prediction task, which is then assigned to a
Density Decoupling Module (DDM) to exploit its highly discriminative ability.
For the remaining foreground prediction sub-task, it is further hierarchically
decomposed to several density-specific sub-tasks by the DDM, which are then
solved by the regression-based experts in a Foreground Density Estimation
Module (FDEM). Although the proposed strategy effectively reduces the
hypothesis space so as to relieve the optimization for those task-specific
experts, the high correlation of these sub-tasks are ignored. Therefore, we
introduce three types of interaction strategies to unify the whole framework,
which are Feature Interaction, Gradient Interaction, and Scale Interaction.
Integrated with the above spirits, HDNet achieves state-of-the-art performance
on several popular counting benchmarks.
Related papers
- Towards the Uncharted: Density-Descending Feature Perturbation for Semi-supervised Semantic Segmentation [51.66997548477913]
We propose a novel feature-level consistency learning framework named Density-Descending Feature Perturbation (DDFP)
Inspired by the low-density separation assumption in semi-supervised learning, our key insight is that feature density can shed a light on the most promising direction for the segmentation classifier to explore.
The proposed DDFP outperforms other designs on feature-level perturbations and shows state of the art performances on both Pascal VOC and Cityscapes dataset.
arXiv Detail & Related papers (2024-03-11T06:59:05Z) - Object Segmentation by Mining Cross-Modal Semantics [68.88086621181628]
We propose a novel approach by mining the Cross-Modal Semantics to guide the fusion and decoding of multimodal features.
Specifically, we propose a novel network, termed XMSNet, consisting of (1) all-round attentive fusion (AF), (2) coarse-to-fine decoder (CFD), and (3) cross-layer self-supervision.
arXiv Detail & Related papers (2023-05-17T14:30:11Z) - Semi-supervised Crowd Counting via Density Agency [57.3635501421658]
We build a learnable auxiliary structure, namely the density agency to bring the recognized foreground regional features close to corresponding density sub-classes.
Second, we propose a density-guided contrastive learning loss to consolidate the backbone feature extractor.
Third, we build a regression head by using a transformer structure to refine the foreground features further.
arXiv Detail & Related papers (2022-09-07T06:34:00Z) - Redesigning Multi-Scale Neural Network for Crowd Counting [68.674652984003]
We introduce a hierarchical mixture of density experts, which hierarchically merges multi-scale density maps for crowd counting.
Within the hierarchical structure, an expert competition and collaboration scheme is presented to encourage contributions from all scales.
Experiments show that our method achieves the state-of-the-art performance on five public datasets.
arXiv Detail & Related papers (2022-08-04T21:49:29Z) - Cascaded Residual Density Network for Crowd Counting [63.714719914701014]
We propose a novel Cascaded Residual Density Network (CRDNet) in a coarse-to-fine approach to generate the high-quality density map for crowd counting more accurately.
A novel additional local count loss is presented to refine the accuracy of crowd counting.
arXiv Detail & Related papers (2021-07-29T03:07:11Z) - PSCNet: Pyramidal Scale and Global Context Guided Network for Crowd
Counting [44.306790250158954]
This paper proposes a novel crowd counting approach based on pyramidal scale module (PSM) and global context module (GCM)
PSM is used to adaptively capture multi-scale information, which can identify a fine boundary of crowds with different image scales.
GCM is devised with low-complexity and lightweight manner, to make the interactive information across the channels of the feature maps more efficient.
arXiv Detail & Related papers (2020-12-07T11:35:56Z) - Deep Semantic Matching with Foreground Detection and Cycle-Consistency [103.22976097225457]
We address weakly supervised semantic matching based on a deep network.
We explicitly estimate the foreground regions to suppress the effect of background clutter.
We develop cycle-consistent losses to enforce the predicted transformations across multiple images to be geometrically plausible and consistent.
arXiv Detail & Related papers (2020-03-31T22:38:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.