Learning landmark guided embeddings for animal re-identification
- URL: http://arxiv.org/abs/2001.02801v1
- Date: Thu, 9 Jan 2020 01:31:00 GMT
- Title: Learning landmark guided embeddings for animal re-identification
- Authors: Olga Moskvyak, Frederic Maire, Feras Dayoub and Mahsa Baktashmotlagh
- Abstract summary: We propose to improve embedding learning by exploiting body landmarks information explicitly.
Body landmarks are provided to the input of a CNN as confidence heatmaps.
We evaluate the proposed method on a large synthetic dataset and a small real dataset.
- Score: 15.356786390476591
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Re-identification of individual animals in images can be ambiguous due to
subtle variations in body markings between different individuals and no
constraints on the poses of animals in the wild. Person re-identification is a
similar task and it has been approached with a deep convolutional neural
network (CNN) that learns discriminative embeddings for images of people.
However, learning discriminative features for an individual animal is more
challenging than for a person's appearance due to the relatively small size of
ecological datasets compared to labelled datasets of person's identities. We
propose to improve embedding learning by exploiting body landmarks information
explicitly. Body landmarks are provided to the input of a CNN as confidence
heatmaps that can be obtained from a separate body landmark predictor. The
model is encouraged to use heatmaps by learning an auxiliary task of
reconstructing input heatmaps. Body landmarks guide a feature extraction
network to learn the representation of a distinctive pattern and its position
on the body. We evaluate the proposed method on a large synthetic dataset and a
small real dataset. Our method outperforms the same model without body
landmarks input by 26% and 18% on the synthetic and the real datasets
respectively. The method is robust to noise in input coordinates and can
tolerate an error in coordinates up to 10% of the image size.
Related papers
- Universal Bovine Identification via Depth Data and Deep Metric Learning [1.6605913858547239]
This paper proposes and evaluates, for the first time, a depth-only deep learning system for accurately identifying individual cattle.
An increase in herd size skews the cow-to-human ratio at the farm and makes the manual monitoring of individuals more challenging.
Underpinned by our previous work, this paper introduces a deep-metric learning method for cattle identification using depth data from an off-the-shelf 3D camera.
arXiv Detail & Related papers (2024-03-29T22:03:53Z) - Learning 3D Human Pose Estimation from Dozens of Datasets using a
Geometry-Aware Autoencoder to Bridge Between Skeleton Formats [80.12253291709673]
We propose a novel affine-combining autoencoder (ACAE) method to perform dimensionality reduction on the number of landmarks.
Our approach scales to an extreme multi-dataset regime, where we use 28 3D human pose datasets to supervise one model.
arXiv Detail & Related papers (2022-12-29T22:22:49Z) - Body Part-Based Representation Learning for Occluded Person
Re-Identification [102.27216744301356]
Occluded person re-identification (ReID) is a person retrieval task which aims at matching occluded person images with holistic ones.
Part-based methods have been shown beneficial as they offer fine-grained information and are well suited to represent partially visible human bodies.
We propose BPBreID, a body part-based ReID model for solving the above issues.
arXiv Detail & Related papers (2022-11-07T16:48:41Z) - PGGANet: Pose Guided Graph Attention Network for Person
Re-identification [0.0]
Person re-identification (ReID) aims at retrieving a person from images captured by different cameras.
It has been proved that using local features together with global feature of person image could help to give robust feature representations for person retrieval.
We propose a pose guided graph attention network, a multi-branch architecture consisting of one branch for global feature, one branch for mid-granular body features and one branch for fine-granular key point features.
arXiv Detail & Related papers (2021-11-29T09:47:39Z) - Keypoint Message Passing for Video-based Person Re-Identification [106.41022426556776]
Video-based person re-identification (re-ID) is an important technique in visual surveillance systems which aims to match video snippets of people captured by different cameras.
Existing methods are mostly based on convolutional neural networks (CNNs), whose building blocks either process local neighbor pixels at a time, or, when 3D convolutions are used to model temporal information, suffer from the misalignment problem caused by person movement.
In this paper, we propose to overcome the limitations of normal convolutions with a human-oriented graph method. Specifically, features located at person joint keypoints are extracted and connected as a spatial-temporal graph
arXiv Detail & Related papers (2021-11-16T08:01:16Z) - Pose Recognition in the Wild: Animal pose estimation using Agglomerative
Clustering and Contrastive Learning [0.0]
We introduce a novel architecture that is able to recognize the pose of multiple animals fromunlabelled data.
We are able to distinguish between body parts of the animal, based on their visual behavior, instead of the underlying anatomy.
arXiv Detail & Related papers (2021-11-16T07:00:31Z) - 3D Convolution Neural Network based Person Identification using Gait
cycles [0.0]
In this work, gait features are used to identify an individual. The steps involve object detection, background subtraction, silhouettes extraction, skeletonization, and training 3D Convolution Neural Network on these gait features.
The proposed method focuses more on the lower body part to extract features such as the angle between knee and thighs, hip angle, angle of contact, and many other features.
arXiv Detail & Related papers (2021-06-06T14:27:06Z) - Pretrained equivariant features improve unsupervised landmark discovery [69.02115180674885]
We formulate a two-step unsupervised approach that overcomes this challenge by first learning powerful pixel-based features.
Our method produces state-of-the-art results in several challenging landmark detection datasets.
arXiv Detail & Related papers (2021-04-07T05:42:11Z) - Hidden Footprints: Learning Contextual Walkability from 3D Human Trails [70.01257397390361]
Current datasets only tell you where people are, not where they could be.
We first augment the set of valid, labeled walkable regions by propagating person observations between images, utilizing 3D information to create what we call hidden footprints.
We devise a training strategy designed for such sparse labels, combining a class-balanced classification loss with a contextual adversarial loss.
arXiv Detail & Related papers (2020-08-19T23:19:08Z) - Whole-Body Human Pose Estimation in the Wild [88.09875133989155]
COCO-WholeBody extends COCO dataset with whole-body annotations.
It is the first benchmark that has manual annotations on the entire human body.
A single-network model, named ZoomNet, is devised to take into account the hierarchical structure of the full human body.
arXiv Detail & Related papers (2020-07-23T08:35:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.