Tolerating Annotation Displacement in Dense Object Counting via Point
Annotation Probability Map
- URL: http://arxiv.org/abs/2308.00530v2
- Date: Wed, 8 Nov 2023 07:43:06 GMT
- Title: Tolerating Annotation Displacement in Dense Object Counting via Point
Annotation Probability Map
- Authors: Yuehai Chen, Jing Yang, Badong Chen, Hua Gang, Shaoyi Du
- Abstract summary: Counting objects in crowded scenes remains a challenge to computer vision.
We present a learning target point annotation probability map (PAPM)
We also propose an adaptively learned PAPM method (AL-PAPM)
- Score: 25.203803417049528
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Counting objects in crowded scenes remains a challenge to computer vision.
The current deep learning based approach often formulate it as a Gaussian
density regression problem. Such a brute-force regression, though effective,
may not consider the annotation displacement properly which arises from the
human annotation process and may lead to different distributions. We conjecture
that it would be beneficial to consider the annotation displacement in the
dense object counting task. To obtain strong robustness against annotation
displacement, generalized Gaussian distribution (GGD) function with a tunable
bandwidth and shape parameter is exploited to form the learning target point
annotation probability map, PAPM. Specifically, we first present a
hand-designed PAPM method (HD-PAPM), in which we design a function based on GGD
to tolerate the annotation displacement. For end-to-end training, the
hand-designed PAPM may not be optimal for the particular network and dataset.
An adaptively learned PAPM method (AL-PAPM) is proposed. To improve the
robustness to annotation displacement, we design an effective transport cost
function based on GGD. The proposed PAPM is capable of integration with other
methods. We also combine PAPM with P2PNet through modifying the matching cost
matrix, forming P2P-PAPM. This could also improve the robustness to annotation
displacement of P2PNet. Extensive experiments show the superiority of our
proposed methods.
Related papers
- PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection [14.396629790635474]
Single point supervised oriented object detection has gained attention and made initial progress within the community.
We propose PointOBB-v2, a simpler, faster, and stronger method to generate pseudo rotated boxes from points without relying on any other prior.
arXiv Detail & Related papers (2024-10-10T17:59:56Z) - CPR++: Object Localization via Single Coarse Point Supervision [55.8671776333499]
coarse point refinement (CPR) is first attempt to alleviate semantic variance from an algorithmic perspective.
CPR reduces semantic variance by selecting a semantic centre point in a neighbourhood region to replace the initial annotated point.
CPR++ can obtain scale information and further reduce the semantic variance in a global region.
arXiv Detail & Related papers (2024-01-30T17:38:48Z) - Regressor-Segmenter Mutual Prompt Learning for Crowd Counting [70.49246560246736]
We propose mutual prompt learning (mPrompt) to solve bias and inaccuracy caused by annotation variance.
Experiments show that mPrompt significantly reduces the Mean Average Error (MAE)
arXiv Detail & Related papers (2023-12-04T07:53:59Z) - Robust Face Anti-Spoofing with Dual Probabilistic Modeling [49.14353429234298]
We propose a unified framework called Dual Probabilistic Modeling (DPM), with two dedicated modules, DPM-LQ (Label Quality aware learning) and DPM-DQ (Data Quality aware learning)
DPM-LQ is able to produce robust feature representations without overfitting to the distribution of noisy semantic labels.
DPM-DQ can eliminate data noise from False Reject' and False Accept' during inference by correcting the prediction confidence of noisy data based on its quality distribution.
arXiv Detail & Related papers (2022-04-27T03:44:18Z) - Partial Wasserstein Adversarial Network for Non-rigid Point Set
Registration [33.70389309762202]
Given two point sets, the problem of registration is to recover a transformation that matches one set to the other.
We formulate the registration problem as a partial distribution matching (PDM) problem, where the goal is to partially match the distributions represented by point sets in a metric space.
We propose a partial Wasserstein adversarial network (PWAN), which is able to approximate the PW discrepancy by a neural network, and minimize it by gradient descent.
arXiv Detail & Related papers (2022-03-04T10:23:48Z) - Layout-to-Image Translation with Double Pooling Generative Adversarial
Networks [76.83075646527521]
We propose a novel Double Pooing GAN (DPGAN) for generating photo-realistic and semantically-consistent results from the input layout.
We also propose a novel Double Pooling Module (DPM), which consists of the Square-shape Pooling Module (SPM) and the Rectangle-shape Pooling Module ( RPM)
arXiv Detail & Related papers (2021-08-29T19:55:14Z) - Regressive Domain Adaptation for Unsupervised Keypoint Detection [67.2950306888855]
Domain adaptation (DA) aims at transferring knowledge from a labeled source domain to an unlabeled target domain.
We present a method of regressive domain adaptation (RegDA) for unsupervised keypoint detection.
Our method brings large improvement by 8% to 11% in terms of PCK on different datasets.
arXiv Detail & Related papers (2021-03-10T16:45:22Z) - Augmented Parallel-Pyramid Net for Attention Guided Pose-Estimation [90.28365183660438]
This paper proposes an augmented parallel-pyramid net with attention partial module and differentiable auto-data augmentation.
We define a new pose search space where the sequences of data augmentations are formulated as a trainable and operational CNN component.
Notably, our method achieves the top-1 accuracy on the challenging COCO keypoint benchmark and the state-of-the-art results on the MPII datasets.
arXiv Detail & Related papers (2020-03-17T03:52:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.