Spectrum-guided Feature Enhancement Network for Event Person
Re-Identification
- URL: http://arxiv.org/abs/2402.01269v1
- Date: Fri, 2 Feb 2024 09:47:26 GMT
- Title: Spectrum-guided Feature Enhancement Network for Event Person
Re-Identification
- Authors: Hongchen Tan, Yi Zhang, Xiuping Liu, Baocai Yin, Nan Ma, Xin Li,
Huchuan Lu
- Abstract summary: We introduce the Spectrum-guided Feature Enhancement Network (SFE-Net)
The SFE-Net consists of two innovative components: the Multi-grain Spectrum Attention Mechanism (MSAM) and the Consecutive Patch Dropout Module (CPDM)
- Score: 82.52960675574353
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: As a cutting-edge biosensor, the event camera holds significant potential in
the field of computer vision, particularly regarding privacy preservation.
However, compared to traditional cameras, event streams often contain noise and
possess extremely sparse semantics, posing a formidable challenge for
event-based person re-identification (event Re-ID). To address this, we
introduce a novel event person re-identification network: the Spectrum-guided
Feature Enhancement Network (SFE-Net). This network consists of two innovative
components: the Multi-grain Spectrum Attention Mechanism (MSAM) and the
Consecutive Patch Dropout Module (CPDM). MSAM employs a fourier spectrum
transform strategy to filter event noise, while also utilizing an event-guided
multi-granularity attention strategy to enhance and capture discriminative
person semantics. CPDM employs a consecutive patch dropout strategy to generate
multiple incomplete feature maps, encouraging the deep Re-ID model to equally
perceive each effective region of the person's body and capture robust person
descriptors. Extensive experiments on Event Re-ID datasets demonstrate that our
SFE-Net achieves the best performance in this task.
Related papers
- MambaPupil: Bidirectional Selective Recurrent model for Event-based Eye tracking [50.26836546224782]
Event-based eye tracking has shown great promise with the high temporal resolution and low redundancy.
The diversity and abruptness of eye movement patterns, including blinking, fixating, saccades, and smooth pursuit, pose significant challenges for eye localization.
This paper proposes a bidirectional long-term sequence modeling and time-varying state selection mechanism to fully utilize contextual temporal information.
arXiv Detail & Related papers (2024-04-18T11:09:25Z) - Cross-Modality Perturbation Synergy Attack for Person Re-identification [70.44850060727474]
The main challenge in cross-modality ReID lies in effectively dealing with visual differences between different modalities.
Existing attack methods have primarily focused on the characteristics of the visible image modality.
This study proposes a universal perturbation attack specifically designed for cross-modality ReID.
arXiv Detail & Related papers (2024-01-18T15:56:23Z) - SpikeMOT: Event-based Multi-Object Tracking with Sparse Motion Features [52.213656737672935]
SpikeMOT is an event-based multi-object tracker.
SpikeMOT uses spiking neural networks to extract sparsetemporal features from event streams associated with objects.
arXiv Detail & Related papers (2023-09-29T05:13:43Z) - HiDAnet: RGB-D Salient Object Detection via Hierarchical Depth Awareness [2.341385717236931]
We propose a novel Hierarchical Depth Awareness network (HiDAnet) for RGB-D saliency detection.
Our motivation comes from the observation that the multi-granularity properties of geometric priors correlate well with the neural network hierarchies.
Our HiDAnet performs favorably over the state-of-the-art methods by large margins.
arXiv Detail & Related papers (2023-01-18T10:00:59Z) - Feature Disentanglement Learning with Switching and Aggregation for
Video-based Person Re-Identification [9.068045610800667]
In video person re-identification (Re-ID), the network must consistently extract features of the target person from successive frames.
Existing methods tend to focus only on how to use temporal information, which often leads to networks being fooled by similar appearances and same backgrounds.
We propose a Disentanglement and Switching and Aggregation Network (DSANet), which segregates the features representing identity and features based on camera characteristics, and pays more attention to ID information.
arXiv Detail & Related papers (2022-12-16T04:27:56Z) - Dynamic Prototype Mask for Occluded Person Re-Identification [88.7782299372656]
Existing methods mainly address this issue by employing body clues provided by an extra network to distinguish the visible part.
We propose a novel Dynamic Prototype Mask (DPM) based on two self-evident prior knowledge.
Under this condition, the occluded representation could be well aligned in a selected subspace spontaneously.
arXiv Detail & Related papers (2022-07-19T03:31:13Z) - Multi-Level Attention for Unsupervised Person Re-Identification [9.529435737056179]
In unsupervised person re-identification, the attention module represented by multi-headed self-attention suffers from attention spreading in the condition of non-ground truth.
We design pixel-level attention module to provide constraints for multi-headed self-attention.
For the trait that the identification targets of person re-identification data are all pedestrians, we design domain-level attention module.
arXiv Detail & Related papers (2022-01-10T02:47:06Z) - Specificity-preserving RGB-D Saliency Detection [103.3722116992476]
We propose a specificity-preserving network (SP-Net) for RGB-D saliency detection.
Two modality-specific networks and a shared learning network are adopted to generate individual and shared saliency maps.
Experiments on six benchmark datasets demonstrate that our SP-Net outperforms other state-of-the-art methods.
arXiv Detail & Related papers (2021-08-18T14:14:22Z) - Semantic Consistency and Identity Mapping Multi-Component Generative
Adversarial Network for Person Re-Identification [39.605062525247135]
We propose a semantic consistency and identity mapping multi-component generative adversarial network (SC-IMGAN) which provides style adaptation from one to many domains.
Our proposed method outperforms state-of-the-art techniques on six challenging person Re-ID datasets.
arXiv Detail & Related papers (2021-04-28T14:12:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.