Multi-Branch with Attention Network for Hand-Based Person Recognition
- URL: http://arxiv.org/abs/2108.02234v1
- Date: Wed, 4 Aug 2021 18:25:08 GMT
- Title: Multi-Branch with Attention Network for Hand-Based Person Recognition
- Authors: Nathanael L. Baisa, Bryan Williams, Hossein Rahmani, Plamen Angelov,
Sue Black
- Abstract summary: We propose a novel hand-based person recognition method for the purpose of criminal investigations.
Our proposed method, Multi-Branch with Attention Network (MBA-Net), incorporates both channel and spatial attention modules.
Our proposed method achieves state-of-the-art performance, surpassing the existing hand-based identification methods.
- Score: 5.162308830328819
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we propose a novel hand-based person recognition method for
the purpose of criminal investigations since the hand image is often the only
available information in cases of serious crime such as sexual abuse. Our
proposed method, Multi-Branch with Attention Network (MBA-Net), incorporates
both channel and spatial attention modules in branches in addition to a global
(without attention) branch to capture global structural information for
discriminative feature learning. The attention modules focus on the relevant
features of the hand image while suppressing the irrelevant backgrounds. In
order to overcome the weakness of the attention mechanisms, equivariant to
pixel shuffling, we integrate relative positional encodings into the spatial
attention module to capture the spatial positions of pixels. Extensive
evaluations on two large multi-ethnic and publicly available hand datasets
demonstrate that our proposed method achieves state-of-the-art performance,
surpassing the existing hand-based identification methods.
Related papers
- SACANet: scene-aware class attention network for semantic segmentation
of remote sensing images [4.124381172041927]
We propose a scene-aware class attention network (SACANet) for semantic segmentation of remote sensing images.
Experimental results on three datasets show that SACANet outperforms other state-of-the-art methods and validate its effectiveness.
arXiv Detail & Related papers (2023-04-22T14:54:31Z) - Towards Effective Image Manipulation Detection with Proposal Contrastive
Learning [61.5469708038966]
We propose Proposal Contrastive Learning (PCL) for effective image manipulation detection.
Our PCL consists of a two-stream architecture by extracting two types of global features from RGB and noise views respectively.
Our PCL can be easily adapted to unlabeled data in practice, which can reduce manual labeling costs and promote more generalizable features.
arXiv Detail & Related papers (2022-10-16T13:30:13Z) - Local-Aware Global Attention Network for Person Re-Identification Based on Body and Hand Images [0.0]
We propose a compound approach for end-to-end discriminative deep feature learning for person Re-Id based on both body and hand images.
The proposed method consistently outperforms existing state-of-the-art methods.
arXiv Detail & Related papers (2022-09-11T09:43:42Z) - Multi-Level Attention for Unsupervised Person Re-Identification [9.529435737056179]
In unsupervised person re-identification, the attention module represented by multi-headed self-attention suffers from attention spreading in the condition of non-ground truth.
We design pixel-level attention module to provide constraints for multi-headed self-attention.
For the trait that the identification targets of person re-identification data are all pedestrians, we design domain-level attention module.
arXiv Detail & Related papers (2022-01-10T02:47:06Z) - Alignment Attention by Matching Key and Query Distributions [48.93793773929006]
This paper introduces alignment attention that explicitly encourages self-attention to match the distributions of the key and query within each head.
It is simple to convert any models with self-attention, including pre-trained ones, to the proposed alignment attention.
On a variety of language understanding tasks, we show the effectiveness of our method in accuracy, uncertainty estimation, generalization across domains, and robustness to adversarial attacks.
arXiv Detail & Related papers (2021-10-25T00:54:57Z) - All the attention you need: Global-local, spatial-channel attention for
image retrieval [11.150896867058902]
We address representation learning for large-scale instance-level image retrieval.
We present global-local attention module (GLAM), which is attached at the end of a backbone network.
We obtain a new feature tensor and, by spatial pooling, we learn a powerful embedding for image retrieval.
arXiv Detail & Related papers (2021-07-16T16:39:13Z) - Watching You: Global-guided Reciprocal Learning for Video-based Person
Re-identification [82.6971648465279]
We propose a novel Global-guided Reciprocal Learning framework for video-based person Re-ID.
Our approach can achieve better performance than other state-of-the-art approaches.
arXiv Detail & Related papers (2021-03-07T12:27:42Z) - Hand-Based Person Identification using Global and Part-Aware Deep
Feature Representation Learning [6.144554939661599]
We propose hand-based person identification by learning both global and local deep feature representation.
Our proposed method, Global and Part-Aware Network (GPA-Net), creates global and local branches on the conv-layer.
We make extensive evaluations on two large multi-ethnic and publicly available hand datasets, demonstrating that our proposed method significantly outperforms competing approaches.
arXiv Detail & Related papers (2021-01-13T18:47:47Z) - Self-supervised Human Detection and Segmentation via Multi-view
Consensus [116.92405645348185]
We propose a multi-camera framework in which geometric constraints are embedded in the form of multi-view consistency during training.
We show that our approach outperforms state-of-the-art self-supervised person detection and segmentation techniques on images that visually depart from those of standard benchmarks.
arXiv Detail & Related papers (2020-12-09T15:47:21Z) - Rethinking of the Image Salient Object Detection: Object-level Semantic
Saliency Re-ranking First, Pixel-wise Saliency Refinement Latter [62.26677215668959]
We propose a lightweight, weakly supervised deep network to coarsely locate semantically salient regions.
We then fuse multiple off-the-shelf deep models on these semantically salient regions as the pixel-wise saliency refinement.
Our method is simple yet effective, which is the first attempt to consider the salient object detection mainly as an object-level semantic re-ranking problem.
arXiv Detail & Related papers (2020-08-10T07:12:43Z) - Ventral-Dorsal Neural Networks: Object Detection via Selective Attention [51.79577908317031]
We propose a new framework called Ventral-Dorsal Networks (VDNets)
Inspired by the structure of the human visual system, we propose the integration of a "Ventral Network" and a "Dorsal Network"
Our experimental results reveal that the proposed method outperforms state-of-the-art object detection approaches.
arXiv Detail & Related papers (2020-05-15T23:57:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.