FMT:Fusing Multi-task Convolutional Neural Network for Person Search
- URL: http://arxiv.org/abs/2003.00406v1
- Date: Sun, 1 Mar 2020 05:20:47 GMT
- Title: FMT:Fusing Multi-task Convolutional Neural Network for Person Search
- Authors: Sulan Zhai, Shunqiang Liu, Xiao Wang, Jin Tang
- Abstract summary: We propose a fusing multi-task convolutional neural network(FMT-CNN) to tackle the correlation and heterogeneity of detection and re-identification.
Experiment results on CUHK-SYSU Person Search dataset show that the performance of our proposed method is superior to state-of-the-art approaches.
- Score: 33.91664470686695
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Person search is to detect all persons and identify the query persons from
detected persons in the image without proposals and bounding boxes, which is
different from person re-identification. In this paper, we propose a fusing
multi-task convolutional neural network(FMT-CNN) to tackle the correlation and
heterogeneity of detection and re-identification with a single convolutional
neural network. We focus on how the interplay of person detection and person
re-identification affects the overall performance. We employ person labels in
region proposal network to produce features for person re-identification and
person detection network, which can improve the accuracy of detection and
re-identification simultaneously. We also use a multiple loss to train our
re-identification network. Experiment results on CUHK-SYSU Person Search
dataset show that the performance of our proposed method is superior to
state-of-the-art approaches in both mAP and top-1.
Related papers
- Joint Neural Networks for One-shot Object Recognition and Detection [5.389851588398047]
This paper presents a novel joint neural networks approach to address the challenging one-shot object recognition and detection tasks.
Inspired by Siamese neural networks and state-of-art multi-box detection approaches, the joint neural networks are able to perform object recognition and detection for categories that remain unseen during the training process.
The proposed approach achieves 61.41% accuracy for one-shot object recognition on the MiniImageNet dataset and 47.1% mAP for one-shot object detection when trained on the dataset and tested.
arXiv Detail & Related papers (2024-08-01T16:48:03Z) - PSDiff: Diffusion Model for Person Search with Iterative and
Collaborative Refinement [59.6260680005195]
We present a novel Person Search framework based on the Diffusion model, PSDiff.
PSDiff formulates the person search as a dual denoising process from noisy boxes and ReID embeddings to ground truths.
Following the new paradigm, we further design a new Collaborative Denoising Layer (CDL) to optimize detection and ReID sub-tasks in an iterative and collaborative way.
arXiv Detail & Related papers (2023-09-20T08:16:39Z) - Sequential Transformer for End-to-End Person Search [4.920657401819193]
Person search aims to simultaneously localize and recognize a target person from realistic and uncropped gallery images.
In this paper, we propose a novel Sequential Transformer (SeqTR) for end-to-end person search to deal with this challenge.
Our SeqTR contains a detection transformer and a novel re-ID transformer that sequentially addresses detection and re-ID tasks.
arXiv Detail & Related papers (2022-11-06T09:32:30Z) - Subtask-dominated Transfer Learning for Long-tail Person Search [12.311100923753449]
Person search unifies person detection and person re-identification (Re-ID) to locate query persons from panoramic gallery images.
One major challenge comes from the imbalanced long-tail person identity distributions.
We propose a Subtask-dominated Transfer Learning (STL) method to solve this problem.
arXiv Detail & Related papers (2021-12-01T14:34:48Z) - Decoupled and Memory-Reinforced Networks: Towards Effective Feature
Learning for One-Step Person Search [65.51181219410763]
One-step methods have been developed to handle pedestrian detection and identification sub-tasks using a single network.
There are two major challenges in the current one-step approaches.
We propose a decoupled and memory-reinforced network (DMRNet) to overcome these problems.
arXiv Detail & Related papers (2021-02-22T06:19:45Z) - Robust Facial Landmark Detection by Cross-order Cross-semantic Deep
Network [58.843211405385205]
We propose a cross-order cross-semantic deep network (CCDN) to boost the semantic features learning for robust facial landmark detection.
Specifically, a cross-order two-squeeze multi-excitation (CTM) module is proposed to introduce the cross-order channel correlations for more discriminative representations learning.
A novel cross-order cross-semantic (COCS) regularizer is designed to drive the network to learn cross-order cross-semantic features from different activation for facial landmark detection.
arXiv Detail & Related papers (2020-11-16T08:19:26Z) - An Improved Person Re-identification Method by light-weight
convolutional neural network [0.0]
Person Re-identification is faced with challenges such as low resolution, varying poses, illumination, background clutter, and occlusion.
The present paper aims to improve Person Re-identification using transfer learning and application of verification loss function.
Experiments showed that the proposed model performs better than state-of-the-art methods on the CUHK01 dataset.
arXiv Detail & Related papers (2020-08-21T12:34:15Z) - Vision Meets Wireless Positioning: Effective Person Re-identification
with Recurrent Context Propagation [120.18969251405485]
Existing person re-identification methods rely on the visual sensor to capture the pedestrians.
Mobile phone can be sensed by WiFi and cellular networks in the form of a wireless positioning signal.
We propose a novel recurrent context propagation module that enables information to propagate between visual data and wireless positioning data.
arXiv Detail & Related papers (2020-08-10T14:19:15Z) - Ventral-Dorsal Neural Networks: Object Detection via Selective Attention [51.79577908317031]
We propose a new framework called Ventral-Dorsal Networks (VDNets)
Inspired by the structure of the human visual system, we propose the integration of a "Ventral Network" and a "Dorsal Network"
Our experimental results reveal that the proposed method outperforms state-of-the-art object detection approaches.
arXiv Detail & Related papers (2020-05-15T23:57:36Z) - FairMOT: On the Fairness of Detection and Re-Identification in Multiple
Object Tracking [92.48078680697311]
Multi-object tracking (MOT) is an important problem in computer vision.
We present a simple yet effective approach termed as FairMOT based on the anchor-free object detection architecture CenterNet.
The approach achieves high accuracy for both detection and tracking.
arXiv Detail & Related papers (2020-04-04T08:18:00Z) - A Convolutional Baseline for Person Re-Identification Using Vision and
Language Descriptions [24.794592610444514]
In real-world surveillance scenarios, frequently no visual information will be available about the queried person.
A two stream deep convolutional neural network framework supervised by cross entropy loss is presented.
The learnt visual representations are more robust and perform 22% better during retrieval as compared to a single modality system.
arXiv Detail & Related papers (2020-02-20T10:12:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.