NMS by Representative Region: Towards Crowded Pedestrian Detection by
Proposal Pairing
- URL: http://arxiv.org/abs/2003.12729v2
- Date: Tue, 21 Apr 2020 09:05:54 GMT
- Title: NMS by Representative Region: Towards Crowded Pedestrian Detection by
Proposal Pairing
- Authors: Xin Huang, Zheng Ge, Zequn Jie and Osamu Yoshie
- Abstract summary: The heavy occlusion between pedestrians imposes great challenges to the standard Non-Maximum Suppression (NMS)
This paper proposes a novel Representative Region NMS approach leveraging the less occluded visible parts, effectively removing the redundant boxes without bringing in many false positives.
Experiments on the challenging CrowdHuman and CityPersons benchmarks sufficiently validate the effectiveness of the proposed approach on pedestrian detection in the crowded situation.
- Score: 25.050500817717108
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Although significant progress has been made in pedestrian detection recently,
pedestrian detection in crowded scenes is still challenging. The heavy
occlusion between pedestrians imposes great challenges to the standard
Non-Maximum Suppression (NMS). A relative low threshold of intersection over
union (IoU) leads to missing highly overlapped pedestrians, while a higher one
brings in plenty of false positives. To avoid such a dilemma, this paper
proposes a novel Representative Region NMS approach leveraging the less
occluded visible parts, effectively removing the redundant boxes without
bringing in many false positives. To acquire the visible parts, a novel
Paired-Box Model (PBM) is proposed to simultaneously predict the full and
visible boxes of a pedestrian. The full and visible boxes constitute a pair
serving as the sample unit of the model, thus guaranteeing a strong
correspondence between the two boxes throughout the detection pipeline.
Moreover, convenient feature integration of the two boxes is allowed for the
better performance on both full and visible pedestrian detection tasks.
Experiments on the challenging CrowdHuman and CityPersons benchmarks
sufficiently validate the effectiveness of the proposed approach on pedestrian
detection in the crowded situation.
Related papers
- A PST Algorithm for FPs Suppression in Two-stage CNN Detection Methods [2.288618928064061]
This paper proposes a pedestrian-sensitive training algorithm to help two-stage CNN detection methods learn to distinguish the pedestrian and non-pedestrian samples.
With the help of the proposed algorithm, the detection accuracy of the MetroNext, an smaller and accurate metro passenger detector, is further improved.
arXiv Detail & Related papers (2024-05-24T08:26:14Z) - PSDiff: Diffusion Model for Person Search with Iterative and
Collaborative Refinement [59.6260680005195]
We present a novel Person Search framework based on the Diffusion model, PSDiff.
PSDiff formulates the person search as a dual denoising process from noisy boxes and ReID embeddings to ground truths.
Following the new paradigm, we further design a new Collaborative Denoising Layer (CDL) to optimize detection and ReID sub-tasks in an iterative and collaborative way.
arXiv Detail & Related papers (2023-09-20T08:16:39Z) - Multi-view Tracking Using Weakly Supervised Human Motion Prediction [60.972708589814125]
We argue that an even more effective approach is to predict people motion over time and infer people's presence in individual frames from these.
This enables to enforce consistency both over time and across views of a single temporal frame.
We validate our approach on the PETS2009 and WILDTRACK datasets and demonstrate that it outperforms state-of-the-art methods.
arXiv Detail & Related papers (2022-10-19T17:58:23Z) - STCrowd: A Multimodal Dataset for Pedestrian Perception in Crowded
Scenes [78.95447086305381]
Accurately detecting and tracking pedestrians in 3D space is challenging due to large variations in rotations, poses and scales.
Existing benchmarks either only provide 2D annotations, or have limited 3D annotations with low-density pedestrian distribution.
We introduce a large-scale multimodal dataset, STCrowd, to better evaluate pedestrian perception algorithms in crowded scenarios.
arXiv Detail & Related papers (2022-04-03T08:26:07Z) - DETR for Crowd Pedestrian Detection [114.00860636622949]
The proposed detector PED(Pedestrian End-to-end Detector) outperforms both previous EDs and the baseline Faster-RCNN on CityPersons and CrowdHuman.
It also achieves comparable performance with state-of-the-art pedestrian detection methods.
arXiv Detail & Related papers (2020-12-12T11:02:05Z) - Mutual-Supervised Feature Modulation Network for Occluded Pedestrian
Detection [10.497367073305806]
We propose a novel Mutual-Supervised Feature Modulation (MSFM) network to better handle occluded pedestrian detection.
MSFM module calculates the similarity loss of full body boxes and visible body boxes corresponding to the same pedestrian.
Our approach achieves superior performance compared to other state-of-the-art methods on two challenging pedestrian datasets.
arXiv Detail & Related papers (2020-10-21T03:42:22Z) - Visible Feature Guidance for Crowd Pedestrian Detection [12.8128512764041]
We propose Visible Feature Guidance (VFG) for both training and inference.
During training, we adopt visible feature to regress the simultaneous outputs of visible bounding box and full bounding box.
Then we perform NMS only on visible bounding boxes to achieve the best fitting full box in inference.
arXiv Detail & Related papers (2020-08-23T08:52:52Z) - Anchor-free Small-scale Multispectral Pedestrian Detection [88.7497134369344]
We propose a method for effective and efficient multispectral fusion of the two modalities in an adapted single-stage anchor-free base architecture.
We aim at learning pedestrian representations based on object center and scale rather than direct bounding box predictions.
Results show our method's effectiveness in detecting small-scaled pedestrians.
arXiv Detail & Related papers (2020-08-19T13:13:01Z) - Do Not Disturb Me: Person Re-identification Under the Interference of
Other Pedestrians [97.45805377769354]
This paper presents a novel deep network termed Pedestrian-Interference Suppression Network (PISNet)
PISNet leverages a Query-Guided Attention Block (QGAB) to enhance the feature of the target in the gallery, under the guidance of the query.
Our method is evaluated on two new pedestrian-interference datasets and the results show that the proposed method performs favorably against existing Re-ID methods.
arXiv Detail & Related papers (2020-08-16T17:45:14Z) - Resisting Crowd Occlusion and Hard Negatives for Pedestrian Detection in
the Wild [36.39830329023875]
Crowd and hard negatives are still challenging state-of-the-art pedestrian detectors.
We offer two approaches based on the general region-based detection framework to tackle these challenges.
We consistently achieve high performance on the Caltech-USA and CityPersons benchmarks.
arXiv Detail & Related papers (2020-05-15T03:47:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.