Feature Calibration Network for Occluded Pedestrian Detection
- URL: http://arxiv.org/abs/2212.05717v1
- Date: Mon, 12 Dec 2022 05:48:34 GMT
- Title: Feature Calibration Network for Occluded Pedestrian Detection
- Authors: Tianliang Zhang, Qixiang Ye, Baochang Zhang, Jianzhuang Liu, Xiaopeng
Zhang, Qi Tian
- Abstract summary: We propose a novel feature learning method in the deep learning framework, referred to as Feature Network (FC-Net)
FC-Net is based on the observation that the visible parts of pedestrians are selective and decisive for detection.
Experiments on CityPersons and Caltech datasets demonstrate that FC-Net improves detection performance on occluded pedestrians up to 10%.
- Score: 137.37275165635882
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Pedestrian detection in the wild remains a challenging problem especially for
scenes containing serious occlusion. In this paper, we propose a novel feature
learning method in the deep learning framework, referred to as Feature
Calibration Network (FC-Net), to adaptively detect pedestrians under various
occlusions. FC-Net is based on the observation that the visible parts of
pedestrians are selective and decisive for detection, and is implemented as a
self-paced feature learning framework with a self-activation (SA) module and a
feature calibration (FC) module. In a new self-activated manner, FC-Net learns
features which highlight the visible parts and suppress the occluded parts of
pedestrians. The SA module estimates pedestrian activation maps by reusing
classifier weights, without any additional parameter involved, therefore
resulting in an extremely parsimony model to reinforce the semantics of
features, while the FC module calibrates the convolutional features for
adaptive pedestrian representation in both pixel-wise and region-based ways.
Experiments on CityPersons and Caltech datasets demonstrate that FC-Net
improves detection performance on occluded pedestrians up to 10% while
maintaining excellent performance on non-occluded instances.
Related papers
- Imagine the Unseen: Occluded Pedestrian Detection via Adversarial Feature Completion [31.488897675973657]
We propose to complete features for occluded regions so as to align the features of pedestrians across different occlusion patterns.
In order to narrow down the gap between completed features and real fully visible ones, we propose an adversarial learning method.
We report experimental results on the CityPersons, Caltech and CrowdHuman datasets.
arXiv Detail & Related papers (2024-05-02T14:20:20Z) - DARTH: Holistic Test-time Adaptation for Multiple Object Tracking [87.72019733473562]
Multiple object tracking (MOT) is a fundamental component of perception systems for autonomous driving.
Despite the urge of safety in driving systems, no solution to the MOT adaptation problem to domain shift in test-time conditions has ever been proposed.
We introduce DARTH, a holistic test-time adaptation framework for MOT.
arXiv Detail & Related papers (2023-10-03T10:10:42Z) - Frequency Perception Network for Camouflaged Object Detection [51.26386921922031]
We propose a novel learnable and separable frequency perception mechanism driven by the semantic hierarchy in the frequency domain.
Our entire network adopts a two-stage model, including a frequency-guided coarse localization stage and a detail-preserving fine localization stage.
Compared with the currently existing models, our proposed method achieves competitive performance in three popular benchmark datasets.
arXiv Detail & Related papers (2023-08-17T11:30:46Z) - PARFormer: Transformer-based Multi-Task Network for Pedestrian Attribute
Recognition [23.814762073093153]
We propose a pure transformer-based multi-task PAR network named PARFormer, which includes four modules.
In the feature extraction module, we build a strong baseline for feature extraction, which achieves competitive results on several PAR benchmarks.
In the viewpoint perception module, we explore the impact of viewpoints on pedestrian attributes, and propose a multi-view contrastive loss.
In the attribute recognition module, we alleviate the negative-positive imbalance problem to generate the attribute predictions.
arXiv Detail & Related papers (2023-04-14T16:27:56Z) - Semantic Feature Integration network for Fine-grained Visual
Classification [5.182627302449368]
We propose the Semantic Feature Integration network (SFI-Net) to address the above difficulties.
By eliminating unnecessary features and reconstructing the semantic relations among discriminative features, our SFI-Net has achieved satisfying performance.
arXiv Detail & Related papers (2023-02-13T07:32:25Z) - CircleNet: Reciprocating Feature Adaptation for Robust Pedestrian
Detection [62.41288479917261]
We propose a novel feature learning model, referred to as CircleNet, to achieve feature adaptation by mimicking the process humans look at low resolution and occluded objects.
Experiments on two pedestrian detection datasets, Caltech and CityPersons, show that CircleNet improves the performance of occluded and low-resolution pedestrians with significant margins.
arXiv Detail & Related papers (2022-12-12T04:10:24Z) - NL-FCOS: Improving FCOS through Non-Local Modules for Object Detection [0.0]
We show that non-local modules combined with an FCOS head (NL-FCOS) are practical and efficient.
We establish state-of-the-art performance in clothing detection and handwritten amount recognition problems.
arXiv Detail & Related papers (2022-03-29T15:00:14Z) - iffDetector: Inference-aware Feature Filtering for Object Detection [70.8678270164057]
We introduce a generic Inference-aware Feature Filtering (IFF) module that can easily be combined with modern detectors.
IFF performs closed-loop optimization by leveraging high-level semantics to enhance the convolutional features.
IFF can be fused with CNN-based object detectors in a plug-and-play manner with negligible computational cost overhead.
arXiv Detail & Related papers (2020-06-23T02:57:29Z) - PSC-Net: Learning Part Spatial Co-occurrence for Occluded Pedestrian
Detection [144.19392893747582]
This paper introduces a novel approach, termed as PSC-Net, for occluded pedestrian detection.
PSC-Net captures both inter and intra-part co-occurrence information of different pedestrian body parts through a Graph Convolutional Network (GCN)
Our PSC-Net exploits the topological structure of pedestrian and does not require part-based annotations or additional visible bounding-box (VBB) information to learn part spatial co-occurrence.
arXiv Detail & Related papers (2020-01-25T02:03:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.