Automatic Detection and Recognition of Individuals in Patterned Species
- URL: http://arxiv.org/abs/2005.02905v1
- Date: Wed, 6 May 2020 15:29:21 GMT
- Title: Automatic Detection and Recognition of Individuals in Patterned Species
- Authors: Gullal Singh Cheema, Saket Anand
- Abstract summary: We develop a framework for automatic detection and recognition of individuals in different patterned species.
We use the recently proposed Faster-RCNN object detection framework to efficiently detect animals in images.
We evaluate our recognition system on zebra and jaguar images to show generalization to other patterned species.
- Score: 4.163860911052052
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Visual animal biometrics is rapidly gaining popularity as it enables a
non-invasive and cost-effective approach for wildlife monitoring applications.
Widespread usage of camera traps has led to large volumes of collected images,
making manual processing of visual content hard to manage. In this work, we
develop a framework for automatic detection and recognition of individuals in
different patterned species like tigers, zebras and jaguars. Most existing
systems primarily rely on manual input for localizing the animal, which does
not scale well to large datasets. In order to automate the detection process
while retaining robustness to blur, partial occlusion, illumination and pose
variations, we use the recently proposed Faster-RCNN object detection framework
to efficiently detect animals in images. We further extract features from
AlexNet of the animal's flank and train a logistic regression (or Linear SVM)
classifier to recognize the individuals. We primarily test and evaluate our
framework on a camera trap tiger image dataset that contains images that vary
in overall image quality, animal pose, scale and lighting. We also evaluate our
recognition system on zebra and jaguar images to show generalization to other
patterned species. Our framework gives perfect detection results in camera
trapped tiger images and a similar or better individual recognition performance
when compared with state-of-the-art recognition techniques.
Related papers
- Multimodal Foundation Models for Zero-shot Animal Species Recognition in
Camera Trap Images [57.96659470133514]
Motion-activated camera traps constitute an efficient tool for tracking and monitoring wildlife populations across the globe.
Supervised learning techniques have been successfully deployed to analyze such imagery, however training such techniques requires annotations from experts.
Reducing the reliance on costly labelled data has immense potential in developing large-scale wildlife tracking solutions with markedly less human labor.
arXiv Detail & Related papers (2023-11-02T08:32:00Z) - Improving Image Recognition by Retrieving from Web-Scale Image-Text Data [68.63453336523318]
We introduce an attention-based memory module, which learns the importance of each retrieved example from the memory.
Compared to existing approaches, our method removes the influence of the irrelevant retrieved examples, and retains those that are beneficial to the input query.
We show that it achieves state-of-the-art accuracies in ImageNet-LT, Places-LT and Webvision datasets.
arXiv Detail & Related papers (2023-04-11T12:12:05Z) - Choosing an Appropriate Platform and Workflow for Processing Camera Trap
Data using Artificial Intelligence [0.18350044465969417]
Camera traps have transformed how ecologists study wildlife species distributions, activity patterns, and interspecific interactions.
The potential of Artificial Intelligence (AI), specifically Deep Learning (DL), to process camera-trap data has gained considerable attention.
Using DL for these applications involves training algorithms, such as Convolutional Neural Networks (CNNs) to automatically detect objects and classify species.
arXiv Detail & Related papers (2022-02-04T18:13:09Z) - Ensembling with Deep Generative Views [72.70801582346344]
generative models can synthesize "views" of artificial images that mimic real-world variations, such as changes in color or pose.
Here, we investigate whether such views can be applied to real images to benefit downstream analysis tasks such as image classification.
We use StyleGAN2 as the source of generative augmentations and investigate this setup on classification tasks involving facial attributes, cat faces, and cars.
arXiv Detail & Related papers (2021-04-29T17:58:35Z) - A first step towards automated species recognition from camera trap
images of mammals using AI in a European temperate forest [0.0]
This paper presents the implementation of the YOLOv5 architecture for automated labeling of camera trap images of mammals in the Bialowieza Forest (BF), Poland.
The camera trapping data were organized and harmonized using TRAPPER software, an open source application for managing large-scale wildlife monitoring projects.
The proposed image recognition pipeline achieved an average accuracy of 85% F1-score in the identification of the 12 most commonly occurring medium-size and large mammal species in BF.
arXiv Detail & Related papers (2021-03-19T22:48:03Z) - Instance Localization for Self-supervised Detection Pretraining [68.24102560821623]
We propose a new self-supervised pretext task, called instance localization.
We show that integration of bounding boxes into pretraining promotes better task alignment and architecture alignment for transfer learning.
Experimental results demonstrate that our approach yields state-of-the-art transfer learning results for object detection.
arXiv Detail & Related papers (2021-02-16T17:58:57Z) - Exploiting Depth Information for Wildlife Monitoring [0.0]
We propose an automated camera trap-based approach to detect and identify animals using depth estimation.
To detect and identify individual animals, we propose a novel method D-Mask R-CNN for the so-called instance segmentation.
An experimental evaluation shows the benefit of the additional depth estimation in terms of improved average precision scores of the animal detection.
arXiv Detail & Related papers (2021-02-10T18:10:34Z) - Self-supervised Human Detection and Segmentation via Multi-view
Consensus [116.92405645348185]
We propose a multi-camera framework in which geometric constraints are embedded in the form of multi-view consistency during training.
We show that our approach outperforms state-of-the-art self-supervised person detection and segmentation techniques on images that visually depart from those of standard benchmarks.
arXiv Detail & Related papers (2020-12-09T15:47:21Z) - An explainable deep vision system for animal classification and
detection in trail-camera images with automatic post-deployment retraining [0.0]
This paper introduces an automated vision system for animal detection in trail-camera images taken from a field under the administration of the Texas Parks and Wildlife Department.
We implement a two-stage deep convolutional neural network pipeline to find animal-containing images in the first stage and then process these images to detect birds in the second stage.
The animal classification system classifies animal images with overall 93% sensitivity and 96% specificity. The bird detection system achieves better than 93% sensitivity, 92% specificity, and 68% average Intersection-over-Union rate.
arXiv Detail & Related papers (2020-10-22T06:29:55Z) - WhoAmI: An Automatic Tool for Visual Recognition of Tiger and Leopard
Individuals in the Wild [3.1708876837195157]
We develop automatic algorithms that are able to detect animals, identify the species of animals and to recognize individual animals for two species.
We demonstrate the effectiveness of our approach on a data set of camera-trap images recorded in the jungles of Southern India.
arXiv Detail & Related papers (2020-06-17T16:17:46Z) - Automatic image-based identification and biomass estimation of
invertebrates [70.08255822611812]
Time-consuming sorting and identification of taxa pose strong limitations on how many insect samples can be processed.
We propose to replace the standard manual approach of human expert-based sorting and identification with an automatic image-based technology.
We use state-of-the-art Resnet-50 and InceptionV3 CNNs for the classification task.
arXiv Detail & Related papers (2020-02-05T21:38:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.