Revisiting Crowd Counting: State-of-the-art, Trends, and Future
Perspectives
- URL: http://arxiv.org/abs/2209.07271v1
- Date: Wed, 14 Sep 2022 08:51:02 GMT
- Title: Revisiting Crowd Counting: State-of-the-art, Trends, and Future
Perspectives
- Authors: Muhammad Asif Khan, Hamid Menouar, and Ridha Hamila
- Abstract summary: Crowd counting is an effective tool for situational awareness in public places.
Deep learning methods have been developed to achieve state-of-the-art performance.
- Score: 3.2575001434344286
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Crowd counting is an effective tool for situational awareness in public
places. Automated crowd counting using images and videos is an interesting yet
challenging problem that has gained significant attention in computer vision.
Over the past few years, various deep learning methods have been developed to
achieve state-of-the-art performance. The methods evolved over time vary in
many aspects such as model architecture, input pipeline, learning paradigm,
computational complexity, and accuracy gains etc. In this paper, we present a
systematic and comprehensive review of the most significant contributions in
the area of crowd counting. Although few surveys exist on the topic, our survey
is most up-to date and different in several aspects. First, it provides a more
meaningful categorization of the most significant contributions by model
architectures, learning methods (i.e., loss functions), and evaluation methods
(i.e., evaluation metrics). We chose prominent and distinct works and excluded
similar works. We also sort the well-known crowd counting models by their
performance over benchmark datasets. We believe that this survey can be a good
resource for novice researchers to understand the progressive developments and
contributions over time and the current state-of-the-art.
Related papers
- A Comprehensive Survey on Underwater Image Enhancement Based on Deep Learning [51.7818820745221]
Underwater image enhancement (UIE) presents a significant challenge within computer vision research.
Despite the development of numerous UIE algorithms, a thorough and systematic review is still absent.
arXiv Detail & Related papers (2024-05-30T04:46:40Z) - Universal Time-Series Representation Learning: A Survey [14.340399848964662]
Time-series data exists in every corner of real-world systems and services.
Deep learning has demonstrated remarkable performance in extracting hidden patterns and features from time-series data.
arXiv Detail & Related papers (2024-01-08T08:00:04Z) - Fairness meets Cross-Domain Learning: a new perspective on Models and
Metrics [80.07271410743806]
We study the relationship between cross-domain learning (CD) and model fairness.
We introduce a benchmark on face and medical images spanning several demographic groups as well as classification and localization tasks.
Our study covers 14 CD approaches alongside three state-of-the-art fairness algorithms and shows how the former can outperform the latter.
arXiv Detail & Related papers (2023-03-25T09:34:05Z) - Deep Depth Completion: A Survey [26.09557446012222]
We provide a comprehensive literature review that helps readers better grasp the research trends and clearly understand the current advances.
We investigate the related studies from the design aspects of network architectures, loss functions, benchmark datasets, and learning strategies.
We present a quantitative comparison of model performance on two widely used benchmark datasets, including an indoor and an outdoor dataset.
arXiv Detail & Related papers (2022-05-11T08:24:00Z) - Perceptual Score: What Data Modalities Does Your Model Perceive? [73.75255606437808]
We introduce the perceptual score, a metric that assesses the degree to which a model relies on the different subsets of the input features.
We find that recent, more accurate multi-modal models for visual question-answering tend to perceive the visual data less than their predecessors.
Using the perceptual score also helps to analyze model biases by decomposing the score into data subset contributions.
arXiv Detail & Related papers (2021-10-27T12:19:56Z) - Deep Long-Tailed Learning: A Survey [163.16874896812885]
Deep long-tailed learning aims to train well-performing deep models from a large number of images that follow a long-tailed class distribution.
Long-tailed class imbalance is a common problem in practical visual recognition tasks.
This paper provides a comprehensive survey on recent advances in deep long-tailed learning.
arXiv Detail & Related papers (2021-10-09T15:25:22Z) - From Handcrafted to Deep Features for Pedestrian Detection: A Survey [148.35460817092908]
Pedestrian detection is an important but challenging problem in computer vision.
Over the past decade, significant improvement has been witnessed with the help of handcrafted features and deep features.
In addition to single-spectral pedestrian detection, we also review multi-spectral pedestrian detection.
arXiv Detail & Related papers (2020-10-01T14:51:10Z) - CNN-based Density Estimation and Crowd Counting: A Survey [65.06491415951193]
This paper comprehensively studies the crowd counting models, mainly CNN-based density map estimation methods.
According to the evaluation metrics, we select the top three performers on their crowd counting datasets.
We expect to make reasonable inference and prediction for the future development of crowd counting.
arXiv Detail & Related papers (2020-03-28T13:17:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.