Plug-and-Play Rescaling Based Crowd Counting in Static Images
- URL: http://arxiv.org/abs/2001.01786v1
- Date: Mon, 6 Jan 2020 21:43:25 GMT
- Title: Plug-and-Play Rescaling Based Crowd Counting in Static Images
- Authors: Usman Sajid and Guanghui Wang
- Abstract summary: We propose a new image patch rescaling module (PRM) and three independent PRM employed crowd counting methods.
The proposed frameworks use the PRM module to rescale the image regions (patches) that require special treatment, whereas the classification process helps in recognizing and discarding any cluttered crowd-like background regions which may result in overestimation.
- Score: 24.150701096083242
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Crowd counting is a challenging problem especially in the presence of huge
crowd diversity across images and complex cluttered crowd-like background
regions, where most previous approaches do not generalize well and consequently
produce either huge crowd underestimation or overestimation. To address these
challenges, we propose a new image patch rescaling module (PRM) and three
independent PRM employed crowd counting methods. The proposed frameworks use
the PRM module to rescale the image regions (patches) that require special
treatment, whereas the classification process helps in recognizing and
discarding any cluttered crowd-like background regions which may result in
overestimation. Experiments on three standard benchmarks and cross-dataset
evaluation show that our approach outperforms the state-of-the-art models in
the RMSE evaluation metric with an improvement up to 10.4%, and possesses
superior generalization ability to new datasets.
Related papers
- Robust Zero-Shot Crowd Counting and Localization With Adaptive Resolution SAM [55.93697196726016]
We propose a simple yet effective crowd counting method by utilizing the Segment-Everything-Everywhere Model (SEEM)
We show that SEEM's performance in dense crowd scenes is limited, primarily due to the omission of many persons in high-density areas.
Our proposed method achieves the best unsupervised performance in crowd counting, while also being comparable to some supervised methods.
arXiv Detail & Related papers (2024-02-27T13:55:17Z) - Progressive Feature Self-reinforcement for Weakly Supervised Semantic
Segmentation [55.69128107473125]
We propose a single-stage approach for Weakly Supervised Semantic (WSSS) with image-level labels.
We adaptively partition the image content into deterministic regions (e.g., confident foreground and background) and uncertain regions (e.g., object boundaries and misclassified categories) for separate processing.
Building upon this, we introduce a complementary self-enhancement method that constrains the semantic consistency between these confident regions and an augmented image with the same class labels.
arXiv Detail & Related papers (2023-12-14T13:21:52Z) - Learning from Multi-Perception Features for Real-Word Image
Super-resolution [87.71135803794519]
We propose a novel SR method called MPF-Net that leverages multiple perceptual features of input images.
Our method incorporates a Multi-Perception Feature Extraction (MPFE) module to extract diverse perceptual information.
We also introduce a contrastive regularization term (CR) that improves the model's learning capability.
arXiv Detail & Related papers (2023-05-26T07:35:49Z) - Progressive Multi-view Human Mesh Recovery with Self-Supervision [68.60019434498703]
Existing solutions typically suffer from poor generalization performance to new settings.
We propose a novel simulation-based training pipeline for multi-view human mesh recovery.
arXiv Detail & Related papers (2022-12-10T06:28:29Z) - Towards More Effective PRM-based Crowd Counting via A Multi-resolution
Fusion and Attention Network [22.235440703471518]
We propose a new PRM based multi-resolution and multi-task crowd counting network.
The proposed model consists of three deep-layered branches with each branch generating feature maps of different resolutions.
The integration of these deep branches with the PRM module and the early-attended blocks proves to be more effective than the original PRM based schemes.
arXiv Detail & Related papers (2021-12-17T18:17:02Z) - Self-supervised Human Detection and Segmentation via Multi-view
Consensus [116.92405645348185]
We propose a multi-camera framework in which geometric constraints are embedded in the form of multi-view consistency during training.
We show that our approach outperforms state-of-the-art self-supervised person detection and segmentation techniques on images that visually depart from those of standard benchmarks.
arXiv Detail & Related papers (2020-12-09T15:47:21Z) - Multi-Resolution Fusion and Multi-scale Input Priors Based Crowd
Counting [20.467558675556173]
The paper proposes a new multi-resolution fusion based end-to-end crowd counting network.
Three input priors are introduced to serve as an efficient and effective alternative to the PRM module.
The proposed approach also has better generalization capability with the best results during the cross-dataset experiments.
arXiv Detail & Related papers (2020-10-04T19:30:13Z) - Shallow Feature Based Dense Attention Network for Crowd Counting [103.67446852449551]
We propose a Shallow feature based Dense Attention Network (SDANet) for crowd counting from still images.
Our method outperforms other existing methods by a large margin, as is evident from a remarkable 11.9% Mean Absolute Error (MAE) drop of our SDANet.
arXiv Detail & Related papers (2020-06-17T13:34:42Z) - ZoomCount: A Zooming Mechanism for Crowd Counting in Static Images [22.387393675233124]
Current approaches cannot handle huge crowd diversity well and perform poorly in extreme cases.
The proposed solution is based on the observation that detecting and handling such extreme cases leads to better crowd estimation.
arXiv Detail & Related papers (2020-02-27T16:57:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.