A contrastive learning approach for individual re-identification in a
wild fish population
- URL: http://arxiv.org/abs/2301.00596v1
- Date: Mon, 2 Jan 2023 11:03:39 GMT
- Title: A contrastive learning approach for individual re-identification in a
wild fish population
- Authors: {\O}rjan Lang{\o}y Olsen and Tonje Knutsen S{\o}rdalen and Morten
Goodwin and Ketil Malde and Kristian Muri Knausg{\aa}rd and Kim Tallaksen
Halvorsen
- Abstract summary: This paper introduces a contrastive learning-based model for identifying individuals.
We use contrastive learning to find similar or dissimilar image pairs from a collection of uniform photographs.
Our model achieves a one-shot accuracy of 0.35, a 5-shot accuracy of 0.56, and a 100-shot accuracy of 0.88, on our dataset.
- Score: 2.626095252463179
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In both terrestrial and marine ecology, physical tagging is a frequently used
method to study population dynamics and behavior. However, such tagging
techniques are increasingly being replaced by individual re-identification
using image analysis.
This paper introduces a contrastive learning-based model for identifying
individuals. The model uses the first parts of the Inception v3 network,
supported by a projection head, and we use contrastive learning to find similar
or dissimilar image pairs from a collection of uniform photographs. We apply
this technique for corkwing wrasse, Symphodus melops, an ecologically and
commercially important fish species. Photos are taken during repeated catches
of the same individuals from a wild population, where the intervals between
individual sightings might range from a few days to several years.
Our model achieves a one-shot accuracy of 0.35, a 5-shot accuracy of 0.56,
and a 100-shot accuracy of 0.88, on our dataset.
Related papers
- Unconstrained Body Recognition at Altitude and Range: Comparing Four Approaches [0.0]
We focus on learning persistent body shape characteristics that remain stable over time.
We introduce a body identification model based on a Vision Transformer (ViT) and on a Swin-ViT model.
All models are trained on a large and diverse dataset of over 1.9 million images of approximately 5k identities across 9 databases.
arXiv Detail & Related papers (2025-02-10T23:49:06Z) - Deep Learning for Leopard Individual Identification: An Adaptive Angular Margin Approach [0.0]
This paper introduces a deep learning framework to distinguish between individual leopards based on their unique spot patterns.
I propose a preprocessing pipeline that combines RGB channels with an edge detection channel to underscore the critical features learned by the model.
arXiv Detail & Related papers (2024-11-04T10:38:33Z) - Finding the Subjective Truth: Collecting 2 Million Votes for Comprehensive Gen-AI Model Evaluation [0.0]
Our study collected over 2 million annotations across 4,512 images, evaluating four prominent models on style preference, coherence, and text-to-image alignment.
We show that our approach makes it feasible to comprehensively rank image generation models based on a vast pool of annotators.
arXiv Detail & Related papers (2024-09-18T12:02:20Z) - Evaluating Multiview Object Consistency in Humans and Image Models [68.36073530804296]
We leverage an experimental design from the cognitive sciences which requires zero-shot visual inferences about object shape.
We collect 35K trials of behavioral data from over 500 participants.
We then evaluate the performance of common vision models.
arXiv Detail & Related papers (2024-09-09T17:59:13Z) - Spatial Implicit Neural Representations for Global-Scale Species Mapping [72.92028508757281]
Given a set of locations where a species has been observed, the goal is to build a model to predict whether the species is present or absent at any location.
Traditional methods struggle to take advantage of emerging large-scale crowdsourced datasets.
We use Spatial Implicit Neural Representations (SINRs) to jointly estimate the geographical range of 47k species simultaneously.
arXiv Detail & Related papers (2023-06-05T03:36:01Z) - Bag of Tricks for Long-Tail Visual Recognition of Animal Species in
Camera Trap Images [2.294014185517203]
We evaluate recently proposed techniques to address the long-tail visual recognition of animal species in camera trap images.
In general, the square-root sampling was the method that most improved the performance for minority classes by around 10%.
The proposed approach achieved the best trade-off between the performance of the tail class and the cost of the head classes' accuracy.
arXiv Detail & Related papers (2022-06-24T18:30:26Z) - Unravelling the Effect of Image Distortions for Biased Prediction of
Pre-trained Face Recognition Models [86.79402670904338]
We evaluate the performance of four state-of-the-art deep face recognition models in the presence of image distortions.
We have observed that image distortions have a relationship with the performance gap of the model across different subgroups.
arXiv Detail & Related papers (2021-08-14T16:49:05Z) - Focus on the Positives: Self-Supervised Learning for Biodiversity
Monitoring [9.086207853136054]
We address the problem of learning self-supervised representations from unlabeled image collections.
We exploit readily available context data that encodes information such as the spatial and temporal relationships between the input images.
For the critical task of global biodiversity monitoring, this results in image features that can be adapted to challenging visual species classification tasks with limited human supervision.
arXiv Detail & Related papers (2021-08-14T01:12:41Z) - An Empirical Study of the Collapsing Problem in Semi-Supervised 2D Human
Pose Estimation [80.02124918255059]
Semi-supervised learning aims to boost the accuracy of a model by exploring unlabeled images.
We learn two networks to mutually teach each other.
The more reliable predictions on easy images in each network are used to teach the other network to learn about the corresponding hard images.
arXiv Detail & Related papers (2020-11-25T03:29:52Z) - Deep learning for lithological classification of carbonate rock micro-CT
images [52.77024349608834]
This work intends to present an application of deep learning techniques to identify patterns in Brazilian pre-salt carbonate rock microtomographic images.
Four convolutional neural network models were proposed.
According to accuracy, Model 2 trained on resized images achieved the best results, reaching an average of 75.54% for the first evaluation approach and an average of 81.33% for the second.
arXiv Detail & Related papers (2020-07-30T19:14:00Z) - Temperate Fish Detection and Classification: a Deep Learning based
Approach [6.282069822653608]
We propose a two-step deep learning approach for the detection and classification of temperate fishes without pre-filtering.
The first step is to detect each single fish in an image, independent of species and sex.
In the second step, we adopt a Convolutional Neural Network (CNN) with the Squeeze-and-Excitation (SE) architecture for classifying each fish in the image without pre-filtering.
arXiv Detail & Related papers (2020-05-14T12:40:57Z) - Automatic image-based identification and biomass estimation of
invertebrates [70.08255822611812]
Time-consuming sorting and identification of taxa pose strong limitations on how many insect samples can be processed.
We propose to replace the standard manual approach of human expert-based sorting and identification with an automatic image-based technology.
We use state-of-the-art Resnet-50 and InceptionV3 CNNs for the classification task.
arXiv Detail & Related papers (2020-02-05T21:38:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.