Which to Match? Selecting Consistent GT-Proposal Assignment for
Pedestrian Detection
- URL: http://arxiv.org/abs/2103.10091v1
- Date: Thu, 18 Mar 2021 08:54:51 GMT
- Title: Which to Match? Selecting Consistent GT-Proposal Assignment for
Pedestrian Detection
- Authors: Yan Luo, Chongyang Zhang, Muming Zhao, Hao Zhou, Jun Sun
- Abstract summary: The fixed Intersection over Union (IoU) based assignment-regression manner still limits their performance.
We introduce one geometric sensitive search algorithm as a new assignment and regression metric.
Specifically, we boost the MR-FPPI under R$_75$ by 8.8% on Citypersons dataset.
- Score: 23.92066492219922
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Accurate pedestrian classification and localization have received
considerable attention due to their wide applications such as security
monitoring, autonomous driving, etc. Although pedestrian detectors have made
great progress in recent years, the fixed Intersection over Union (IoU) based
assignment-regression manner still limits their performance. Two main factors
are responsible for this: 1) the IoU threshold faces a dilemma that a lower one
will result in more false positives, while a higher one will filter out the
matched positives; 2) the IoU-based GT-Proposal assignment suffers from the
inconsistent supervision problem that spatially adjacent proposals with similar
features are assigned to different ground-truth boxes, which means some very
similar proposals may be forced to regress towards different targets, and thus
confuses the bounding-box regression when predicting the location results. In
this paper, we first put forward the question that \textbf{Regression
Direction} would affect the performance for pedestrian detection. Consequently,
we address the weakness of IoU by introducing one geometric sensitive search
algorithm as a new assignment and regression metric. Different from the
previous IoU-based \textbf{one-to-one} assignment manner of one proposal to one
ground-truth box, the proposed method attempts to seek a reasonable matching
between the sets of proposals and ground-truth boxes. Specifically, we boost
the MR-FPPI under R$_{75}$ by 8.8\% on Citypersons dataset. Furthermore, by
incorporating this method as a metric into the state-of-the-art pedestrian
detectors, we show a consistent improvement.
Related papers
- Prototypical Contrastive Learning through Alignment and Uniformity for
Recommendation [6.790779112538357]
We present underlinePrototypical contrastive learning through underlineAlignment and underlineUniformity for recommendation.
Specifically, we first propose prototypes as a latent space to ensure consistency across different augmentations from the origin graph.
The absence of explicit negatives means that directly optimizing the consistency loss between instance and prototype could easily result in dimensional collapse issues.
arXiv Detail & Related papers (2024-02-03T08:19:26Z) - Rank-DETR for High Quality Object Detection [52.82810762221516]
A highly performant object detector requires accurate ranking for the bounding box predictions.
In this work, we introduce a simple and highly performant DETR-based object detector by proposing a series of rank-oriented designs.
arXiv Detail & Related papers (2023-10-13T04:48:32Z) - Hausdorff Distance Matching with Adaptive Query Denoising for Rotated Detection Transformer [4.137346786534721]
We introduce a Hausdorff distance-based cost for bipartite matching, which more accurately quantifies the discrepancy between predictions and ground truths.
We propose an adaptive query denoising method that employs bipartite matching to selectively eliminate noised queries that detract from model improvement.
arXiv Detail & Related papers (2023-05-12T16:42:54Z) - Ranking-Based Siamese Visual Tracking [31.2428211299895]
Siamese-based trackers mainly formulate the visual tracking into two independent subtasks, including classification and localization.
This paper proposes a ranking-based optimization algorithm to explore the relationship among different proposals.
The proposed two ranking losses are compatible with most Siamese trackers and incur no additional computation for inference.
arXiv Detail & Related papers (2022-05-24T03:46:40Z) - Decoupled IoU Regression for Object Detection [31.9114940121939]
Non-maximum suppression (NMS) is widely used in object detection pipelines for removing duplicated bounding boxes.
Inconsistency between the confidence for NMS and the real localization confidence seriously affects detection performance.
We propose a novel Decoupled IoU Regression model to handle these problems.
arXiv Detail & Related papers (2022-02-02T04:01:11Z) - Higher Performance Visual Tracking with Dual-Modal Localization [106.91097443275035]
Visual Object Tracking (VOT) has synchronous needs for both robustness and accuracy.
We propose a dual-modal framework for target localization, consisting of robust localization suppressingors via ONR and the accurate localization attending to the target center precisely via OFC.
arXiv Detail & Related papers (2021-03-18T08:47:56Z) - Regressive Domain Adaptation for Unsupervised Keypoint Detection [67.2950306888855]
Domain adaptation (DA) aims at transferring knowledge from a labeled source domain to an unlabeled target domain.
We present a method of regressive domain adaptation (RegDA) for unsupervised keypoint detection.
Our method brings large improvement by 8% to 11% in terms of PCK on different datasets.
arXiv Detail & Related papers (2021-03-10T16:45:22Z) - Collaborative Training between Region Proposal Localization and
Classification for Domain Adaptive Object Detection [121.28769542994664]
Domain adaptation for object detection tries to adapt the detector from labeled datasets to unlabeled ones for better performance.
In this paper, we are the first to reveal that the region proposal network (RPN) and region proposal classifier(RPC) demonstrate significantly different transferability when facing large domain gap.
arXiv Detail & Related papers (2020-09-17T07:39:52Z) - Probabilistic Anchor Assignment with IoU Prediction for Object Detection [9.703212439661097]
In object detection, determining which anchors to assign as positive or negative samples, known as anchor assignment, has been revealed as a core procedure that can significantly affect a model's performance.
We propose a novel anchor assignment strategy that adaptively separates anchors into positive and negative samples for a ground truth bounding box according to the model's learning status.
arXiv Detail & Related papers (2020-07-16T04:26:57Z) - Scope Head for Accurate Localization in Object Detection [135.9979405835606]
We propose a novel detector coined as ScopeNet, which models anchors of each location as a mutually dependent relationship.
With our concise and effective design, the proposed ScopeNet achieves state-of-the-art results on COCO.
arXiv Detail & Related papers (2020-05-11T04:00:09Z) - Detection in Crowded Scenes: One Proposal, Multiple Predictions [79.28850977968833]
We propose a proposal-based object detector, aiming at detecting highly-overlapped instances in crowded scenes.
The key of our approach is to let each proposal predict a set of correlated instances rather than a single one in previous proposal-based frameworks.
Our detector can obtain 4.9% AP gains on challenging CrowdHuman dataset and 1.0% $textMR-2$ improvements on CityPersons dataset.
arXiv Detail & Related papers (2020-03-20T09:48:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.