Lightweight Multi-Branch Network for Person Re-Identification
- URL: http://arxiv.org/abs/2101.10774v1
- Date: Tue, 26 Jan 2021 13:28:46 GMT
- Title: Lightweight Multi-Branch Network for Person Re-Identification
- Authors: Fabian Herzog, Xunbo Ji, Torben Teepe, Stefan H\"ormann, Johannes
Gilg, Gerhard Rigoll
- Abstract summary: This paper proposes a lightweight network that combines global, part-based, and channel features in a unified multi-branch architecture that builds on the resource-efficient OSNet backbone.
Using a well-founded combination of training techniques and design choices, our final model achieves state-of-the-art results on CUHK03 labeled, CUHK03 detected, and Market-1501 with 85.1% mAP / 87.2% rank1, 82.4% mAP / 84.9% rank1, and 91.5% mAP / 96.3% rank1, respectively.
- Score: 6.353193172884524
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Person Re-Identification aims to retrieve person identities from images
captured by multiple cameras or the same cameras in different time instances
and locations. Because of its importance in many vision applications from
surveillance to human-machine interaction, person re-identification methods
need to be reliable and fast. While more and more deep architectures are
proposed for increasing performance, those methods also increase overall model
complexity. This paper proposes a lightweight network that combines global,
part-based, and channel features in a unified multi-branch architecture that
builds on the resource-efficient OSNet backbone. Using a well-founded
combination of training techniques and design choices, our final model achieves
state-of-the-art results on CUHK03 labeled, CUHK03 detected, and Market-1501
with 85.1% mAP / 87.2% rank1, 82.4% mAP / 84.9% rank1, and 91.5% mAP / 96.3%
rank1, respectively.
Related papers
- Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding [49.218195440600354]
Current image pyramids use the same large-scale model to process multiple resolutions, leading to significant computational cost.
We propose a novel network architecture, called COCO-Inverted Image Pyramid Networks (PIIP)
PIIP uses pretrained models (ViTs or CNNs) as branches to process multi-scale images, where images of higher resolutions are processed by smaller network branches to balance computational cost and performance.
arXiv Detail & Related papers (2025-01-14T01:57:41Z) - Enhancing Person Re-Identification via Uncertainty Feature Fusion and Auto-weighted Measure Combination [1.183049138259841]
Person re-identification (Re-ID) is a challenging task that involves identifying the same person across different camera views in surveillance systems.
In this paper, a new approach is introduced that enhances the capability of ReID models through the Uncertain Feature Fusion Method (UFFM) and Auto-weighted Measure Combination (AMC)
Our method significantly improves Rank@1 accuracy and Mean Average Precision (mAP) when evaluated on person re-identification datasets.
arXiv Detail & Related papers (2024-05-02T09:09:48Z) - Raising the Bar of AI-generated Image Detection with CLIP [50.345365081177555]
The aim of this work is to explore the potential of pre-trained vision-language models (VLMs) for universal detection of AI-generated images.
We develop a lightweight detection strategy based on CLIP features and study its performance in a wide variety of challenging scenarios.
arXiv Detail & Related papers (2023-11-30T21:11:20Z) - AaP-ReID: Improved Attention-Aware Person Re-identification [2.5761958263376745]
AaP-ReID is a more effective method for person ReID that incorporates channel-wise attention into a ResNet-based architecture.
Our method incorporates the Channel-Wise Attention Bottleneck block and can learn discriminating features by dynamically adjusting the importance ofeach channel in the feature maps.
arXiv Detail & Related papers (2023-09-27T16:54:38Z) - Decoupled and Memory-Reinforced Networks: Towards Effective Feature
Learning for One-Step Person Search [65.51181219410763]
One-step methods have been developed to handle pedestrian detection and identification sub-tasks using a single network.
There are two major challenges in the current one-step approaches.
We propose a decoupled and memory-reinforced network (DMRNet) to overcome these problems.
arXiv Detail & Related papers (2021-02-22T06:19:45Z) - Multi-Attribute Enhancement Network for Person Search [7.85420914437147]
Person Search is designed to jointly solve the problems of Person Detection and Person Re-identification (Re-ID)
Visual character attributes play a key role in retrieving the query person, which has been explored in Re-ID but has been ignored in Person Search.
We introduce attribute learning into the model, allowing the use of attribute features for retrieval task.
arXiv Detail & Related papers (2021-02-16T05:43:47Z) - Camera-aware Proxies for Unsupervised Person Re-Identification [60.26031011794513]
This paper tackles the purely unsupervised person re-identification (Re-ID) problem that requires no annotations.
We propose to split each single cluster into multiple proxies and each proxy represents the instances coming from the same camera.
Based on the camera-aware proxies, we design both intra- and inter-camera contrastive learning components for our Re-ID model.
arXiv Detail & Related papers (2020-12-19T12:37:04Z) - Integrating Coarse Granularity Part-level Features with Supervised
Global-level Features for Person Re-identification [3.4758712821739426]
Part-level person Re-ID network (CGPN) integrates supervised global features for both holistic and partial person images.
CGPN learns to extract effective body part features for both holistic and partial person images.
Single model trained on three Re-ID datasets including Market-1501, DukeMTMC-reID and CUHK03 state-of-the-art performances.
arXiv Detail & Related papers (2020-10-15T11:49:20Z) - ResNeSt: Split-Attention Networks [86.25490825631763]
We present a modularized architecture, which applies the channel-wise attention on different network branches to leverage their success in capturing cross-feature interactions and learning diverse representations.
Our model, named ResNeSt, outperforms EfficientNet in accuracy and latency trade-off on image classification.
arXiv Detail & Related papers (2020-04-19T20:40:31Z) - Intra-Camera Supervised Person Re-Identification [87.88852321309433]
We propose a novel person re-identification paradigm based on an idea of independent per-camera identity annotation.
This eliminates the most time-consuming and tedious inter-camera identity labelling process.
We formulate a Multi-tAsk mulTi-labEl (MATE) deep learning method for Intra-Camera Supervised (ICS) person re-id.
arXiv Detail & Related papers (2020-02-12T15:26:33Z) - VMRFANet:View-Specific Multi-Receptive Field Attention Network for
Person Re-identification [3.1498833540989413]
We propose a novel multi-receptive field attention (MRFA) module that utilizes filters of various sizes to help network focusing on informative pixels.
We present a view-specific mechanism that guides attention module to handle the variation of view conditions.
Our method achieves 95.5% / 88.1% in rank-1 / mAP on Market-1501, 88.9% / 80.0% on DukeMTMC-reID, 81.1% / 78.8% on CUHK03 labeled dataset and 78.9% / 75.3% on CUHK03 detected dataset.
arXiv Detail & Related papers (2020-01-21T06:31:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.