Hybrid attention network based on progressive embedding scale-context
for crowd counting
- URL: http://arxiv.org/abs/2106.02324v1
- Date: Fri, 4 Jun 2021 08:10:21 GMT
- Title: Hybrid attention network based on progressive embedding scale-context
for crowd counting
- Authors: Fusen Wang and Jun Sang and Zhongyuan Wu and Qi Liu and Nong Sang
- Abstract summary: We propose a Hybrid Attention Network (HAN) by employing Progressive Embedding Scale-context (PES) information.
We build the hybrid attention mechanism through paralleling spatial attention and channel attention module.
PES information enables the network to simultaneously suppress noise and adapt head scale variation.
- Score: 25.866856497266884
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The existing crowd counting methods usually adopted attention mechanism to
tackle background noise, or applied multi-level features or multi-scales
context fusion to tackle scale variation. However, these approaches deal with
these two problems separately. In this paper, we propose a Hybrid Attention
Network (HAN) by employing Progressive Embedding Scale-context (PES)
information, which enables the network to simultaneously suppress noise and
adapt head scale variation. We build the hybrid attention mechanism through
paralleling spatial attention and channel attention module, which makes the
network to focus more on the human head area and reduce the interference of
background objects. Besides, we embed certain scale-context to the hybrid
attention along the spatial and channel dimensions for alleviating these
counting errors caused by the variation of perspective and head scale. Finally,
we propose a progressive learning strategy through cascading multiple hybrid
attention modules with embedding different scale-context, which can gradually
integrate different scale-context information into the current feature map from
global to local. Ablation experiments provides that the network architecture
can gradually learn multi-scale features and suppress background noise.
Extensive experiments demonstrate that HANet obtain state-of-the-art counting
performance on four mainstream datasets.
Related papers
- Hybrid Convolutional and Attention Network for Hyperspectral Image Denoising [54.110544509099526]
Hyperspectral image (HSI) denoising is critical for the effective analysis and interpretation of hyperspectral data.
We propose a hybrid convolution and attention network (HCANet) to enhance HSI denoising.
Experimental results on mainstream HSI datasets demonstrate the rationality and effectiveness of the proposed HCANet.
arXiv Detail & Related papers (2024-03-15T07:18:43Z) - Multi-scale Adaptive Fusion Network for Hyperspectral Image Denoising [35.491878332394265]
We propose a novel solution to investigate the HSI denoising using a Multi-scale Adaptive Fusion Network (MAFNet)
The proposed MAFNet has achieved better denoising performance than other state-of-the-art techniques.
arXiv Detail & Related papers (2023-04-19T02:00:21Z) - Boosting Crowd Counting via Multifaceted Attention [109.89185492364386]
Large-scale variations often exist within crowd images.
Neither fixed-size convolution kernel of CNN nor fixed-size attention of recent vision transformers can handle this kind of variation.
We propose a Multifaceted Attention Network (MAN) to improve transformer models in local spatial relation encoding.
arXiv Detail & Related papers (2022-03-05T01:36:43Z) - Augmenting Convolutional networks with attention-based aggregation [55.97184767391253]
We show how to augment any convolutional network with an attention-based global map to achieve non-local reasoning.
We plug this learned aggregation layer with a simplistic patch-based convolutional network parametrized by 2 parameters (width and depth)
It yields surprisingly competitive trade-offs between accuracy and complexity, in particular in terms of memory consumption.
arXiv Detail & Related papers (2021-12-27T14:05:41Z) - Multi-View Stereo Network with attention thin volume [0.0]
We propose an efficient multi-view stereo (MVS) network for infering depth value from multiple RGB images.
We introduce the self-attention mechanism to fully aggregate the dominant information from input images.
We also introduce the group-wise correlation to feature aggregation, which greatly reduces the memory and calculation burden.
arXiv Detail & Related papers (2021-10-16T11:51:23Z) - Crowd Counting via Hierarchical Scale Recalibration Network [61.09833400167511]
We propose a novel Hierarchical Scale Recalibration Network (HSRNet) to tackle the task of crowd counting.
HSRNet models rich contextual dependencies and recalibrating multiple scale-associated information.
Our approach can ignore various noises selectively and focus on appropriate crowd scales automatically.
arXiv Detail & Related papers (2020-03-07T10:06:47Z) - ADRN: Attention-based Deep Residual Network for Hyperspectral Image
Denoising [52.01041506447195]
We propose an attention-based deep residual network to learn a mapping from noisy HSI to the clean one.
Experimental results demonstrate that our proposed ADRN scheme outperforms the state-of-the-art methods both in quantitative and visual evaluations.
arXiv Detail & Related papers (2020-03-04T08:36:27Z) - Global Context-Aware Progressive Aggregation Network for Salient Object
Detection [117.943116761278]
We propose a novel network named GCPANet to integrate low-level appearance features, high-level semantic features, and global context features.
We show that the proposed approach outperforms the state-of-the-art methods both quantitatively and qualitatively.
arXiv Detail & Related papers (2020-03-02T04:26:10Z) - Hybrid Multiple Attention Network for Semantic Segmentation in Aerial
Images [24.35779077001839]
We propose a novel attention-based framework named Hybrid Multiple Attention Network (HMANet) to adaptively capture global correlations.
We introduce a simple yet effective region shuffle attention (RSA) module to reduce feature redundant and improve the efficiency of self-attention mechanism.
arXiv Detail & Related papers (2020-01-09T07:47:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.