Semi-supervised Counting via Pixel-by-pixel Density Distribution
Modelling
- URL: http://arxiv.org/abs/2402.15297v1
- Date: Fri, 23 Feb 2024 12:48:02 GMT
- Title: Semi-supervised Counting via Pixel-by-pixel Density Distribution
Modelling
- Authors: Hui Lin and Zhiheng Ma and Rongrong Ji and Yaowei Wang and Zhou Su and
Xiaopeng Hong and Deyu Meng
- Abstract summary: This paper focuses on semi-supervised crowd counting, where only a small portion of the training data are labeled.
We formulate the pixel-wise density value to regress as a probability distribution, instead of a single deterministic value.
Our method clearly outperforms the competitors by a large margin under various labeled ratio settings.
- Score: 135.66138766927716
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper focuses on semi-supervised crowd counting, where only a small
portion of the training data are labeled. We formulate the pixel-wise density
value to regress as a probability distribution, instead of a single
deterministic value. On this basis, we propose a semi-supervised crowd-counting
model. Firstly, we design a pixel-wise distribution matching loss to measure
the differences in the pixel-wise density distributions between the prediction
and the ground truth; Secondly, we enhance the transformer decoder by using
density tokens to specialize the forwards of decoders w.r.t. different density
intervals; Thirdly, we design the interleaving consistency self-supervised
learning mechanism to learn from unlabeled data efficiently. Extensive
experiments on four datasets are performed to show that our method clearly
outperforms the competitors by a large margin under various labeled ratio
settings. Code will be released at
https://github.com/LoraLinH/Semi-supervised-Counting-via-Pixel-by-pixel-Density-Distribution-Modelli ng.
Related papers
- Rethinking Polyp Segmentation from an Out-of-Distribution Perspective [37.1338930936671]
We leverage the ability of masked autoencoders -- self-supervised vision transformers trained on a reconstruction task -- to learn in-distribution representations.
We perform out-of-distribution reconstruction and inference, with feature space standardisation to align the latent distribution of the diverse abnormal samples with the statistics of the healthy samples.
Experimental results on six benchmarks show that our model has excellent segmentation performance and generalises across datasets.
arXiv Detail & Related papers (2023-06-13T14:13:16Z) - Using Unreliable Pseudo-Labels for Label-Efficient Semantic Segmentation [78.56076985502291]
We argue that every pixel matters to the model training, even those unreliable and ambiguous pixels.
We separate reliable and unreliable pixels via the entropy of predictions, push each unreliable pixel to a category-wise queue that consists of negative keys.
Considering the training evolution, we adaptively adjust the threshold for the reliable-unreliable partition.
arXiv Detail & Related papers (2023-06-04T09:40:25Z) - Probabilistic Deep Metric Learning for Hyperspectral Image
Classification [91.5747859691553]
This paper proposes a probabilistic deep metric learning framework for hyperspectral image classification.
It aims to predict the category of each pixel for an image captured by hyperspectral sensors.
Our framework can be readily applied to existing hyperspectral image classification methods.
arXiv Detail & Related papers (2022-11-15T17:57:12Z) - Localized Randomized Smoothing for Collective Robustness Certification [60.83383487495282]
We propose a more general collective robustness certificate for all types of models.
We show that this approach is beneficial for the larger class of softly local models.
The certificate is based on our novel localized randomized smoothing approach.
arXiv Detail & Related papers (2022-10-28T14:10:24Z) - Class Balanced PixelNet for Neurological Image Segmentation [20.56747443955369]
We propose an automatic brain tumor segmentation approach (e.g., PixelNet) using a pixel-level convolutional neural network (CNN)
The proposed model has achieved promising results in brain tumor and ischemic stroke segmentation datasets.
arXiv Detail & Related papers (2022-04-23T10:57:54Z) - SePiCo: Semantic-Guided Pixel Contrast for Domain Adaptive Semantic
Segmentation [52.62441404064957]
Domain adaptive semantic segmentation attempts to make satisfactory dense predictions on an unlabeled target domain by utilizing the model trained on a labeled source domain.
Many methods tend to alleviate noisy pseudo labels, however, they ignore intrinsic connections among cross-domain pixels with similar semantic concepts.
We propose Semantic-Guided Pixel Contrast (SePiCo), a novel one-stage adaptation framework that highlights the semantic concepts of individual pixel.
arXiv Detail & Related papers (2022-04-19T11:16:29Z) - Completely Self-Supervised Crowd Counting via Distribution Matching [92.09218454377395]
We propose a complete self-supervision approach to training models for dense crowd counting.
The only input required to train, apart from a large set of unlabeled crowd images, is the approximate upper limit of the crowd count.
Our method dwells on the idea that natural crowds follow a power law distribution, which could be leveraged to yield error signals for backpropagation.
arXiv Detail & Related papers (2020-09-14T13:20:12Z) - Active Crowd Counting with Limited Supervision [13.09054893296829]
We present an active learning framework which enables accurate crowd counting with limited supervision.
We first introduce an active labeling strategy to annotate the most informative images in the dataset and learn the counting model upon them.
In the last cycle when the labeling budget is met, the large amount of unlabeled data are also utilized.
arXiv Detail & Related papers (2020-07-13T12:07:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.