SeaTurtleID2022: A long-span dataset for reliable sea turtle re-identification
- URL: http://arxiv.org/abs/2311.05524v2
- Date: Tue, 30 Apr 2024 14:50:59 GMT
- Title: SeaTurtleID2022: A long-span dataset for reliable sea turtle re-identification
- Authors: Lukáš Adam, Vojtěch Čermák, Kostas Papafitsoros, Lukáš Picek,
- Abstract summary: This paper introduces the first large-scale, long-span dataset with sea turtle photographs captured in the wild -- SeaTurtleID2022.
The dataset contains 8729 photographs of 438 unique individuals collected within 13 years.
Instead of standard "random" splits, the dataset allows for two realistic and ecologically motivated splits.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: This paper introduces the first public large-scale, long-span dataset with sea turtle photographs captured in the wild -- SeaTurtleID2022 (https://www.kaggle.com/datasets/wildlifedatasets/seaturtleid2022). The dataset contains 8729 photographs of 438 unique individuals collected within 13 years, making it the longest-spanned dataset for animal re-identification. All photographs include various annotations, e.g., identity, encounter timestamp, and body parts segmentation masks. Instead of standard "random" splits, the dataset allows for two realistic and ecologically motivated splits: (i) a time-aware closed-set with training, validation, and test data from different days/years, and (ii) a time-aware open-set with new unknown individuals in test and validation sets. We show that time-aware splits are essential for benchmarking re-identification methods, as random splits lead to performance overestimation. Furthermore, a baseline instance segmentation and re-identification performance over various body parts is provided. Finally, an end-to-end system for sea turtle re-identification is proposed and evaluated. The proposed system based on Hybrid Task Cascade for head instance segmentation and ArcFace-trained feature-extractor achieved an accuracy of 86.8%.
Related papers
- A Dataset for Semantic Segmentation in the Presence of Unknowns [49.795683850385956]
Existing datasets allow evaluation of only knowns or unknowns - but not both.
We propose a novel anomaly segmentation dataset, ISSU, that features a diverse set of anomaly inputs from cluttered real-world environments.
The dataset is twice larger than existing anomaly segmentation datasets.
arXiv Detail & Related papers (2025-03-28T10:31:01Z) - Multispecies Animal Re-ID Using a Large Community-Curated Dataset [0.19418036471925312]
We construct a dataset that includes 49 species, 37K individual animals, and 225K images, using this data to train a single embedding network for all species.
Our model consistently outperforms models trained separately on each species, achieving an average gain of 12.5% in top-1 accuracy.
The model is already in production use for 60+ species in a large-scale wildlife monitoring system.
arXiv Detail & Related papers (2024-12-07T09:56:33Z) - WildlifeReID-10k: Wildlife re-identification dataset with 10k individual animals [0.0]
This paper introduces WildlifeReID-10k, a new large-scale re-identification benchmark with more than 10k animal identities of around 33 species across more than 140k images.
WildlifeReID-10k covers diverse animal species and poses significant challenges for SoTA methods.
The dataset and benchmark are publicly available on Kaggle, along with strong baselines for both closed-set and open-set evaluation.
arXiv Detail & Related papers (2024-06-13T15:15:07Z) - SwinFace: A Multi-task Transformer for Face Recognition, Expression
Recognition, Age Estimation and Attribute Estimation [60.94239810407917]
This paper presents a multi-purpose algorithm for simultaneous face recognition, facial expression recognition, age estimation, and face attribute estimation based on a single Swin Transformer.
To address the conflicts among multiple tasks, a Multi-Level Channel Attention (MLCA) module is integrated into each task-specific analysis.
Experiments show that the proposed model has a better understanding of the face and achieves excellent performance for all tasks.
arXiv Detail & Related papers (2023-08-22T15:38:39Z) - Navya3DSeg -- Navya 3D Semantic Segmentation Dataset & split generation
for autonomous vehicles [63.20765930558542]
3D semantic data are useful for core perception tasks such as obstacle detection and ego-vehicle localization.
We propose a new dataset, Navya 3D (Navya3DSeg), with a diverse label space corresponding to a large scale production grade operational domain.
It contains 23 labeled sequences and 25 supplementary sequences without labels, designed to explore self-supervised and semi-supervised semantic segmentation benchmarks on point clouds.
arXiv Detail & Related papers (2023-02-16T13:41:19Z) - SeaTurtleID2022: A long-span dataset for reliable sea turtle re-identification [0.0]
This paper introduces the first public large-scale, long-span dataset with sea turtle photographs captured in the wild.
The dataset contains 8729 photographs of 438 unique individuals collected within 13 years.
arXiv Detail & Related papers (2022-11-18T15:46:24Z) - TempNet: Temporal Attention Towards the Detection of Animal Behaviour in
Videos [63.85815474157357]
We propose an efficient computer vision- and deep learning-based method for the detection of biological behaviours in videos.
TempNet uses an encoder bridge and residual blocks to maintain model performance with a two-staged, spatial, then temporal, encoder.
We demonstrate its application to the detection of sablefish (Anoplopoma fimbria) startle events.
arXiv Detail & Related papers (2022-11-17T23:55:12Z) - End-to-end Person Search Sequentially Trained on Aggregated Dataset [1.9766522384767227]
We propose a new end-to-end model that jointly computes detection and feature extraction steps.
We show that aggregating more pedestrian detection datasets without costly identity annotations makes the shared feature maps more generic.
arXiv Detail & Related papers (2022-01-24T11:22:15Z) - Multi-dataset Pretraining: A Unified Model for Semantic Segmentation [97.61605021985062]
We propose a unified framework, termed as Multi-Dataset Pretraining, to take full advantage of the fragmented annotations of different datasets.
This is achieved by first pretraining the network via the proposed pixel-to-prototype contrastive loss over multiple datasets.
In order to better model the relationship among images and classes from different datasets, we extend the pixel level embeddings via cross dataset mixing.
arXiv Detail & Related papers (2021-06-08T06:13:11Z) - Large-Scale Spatio-Temporal Person Re-identification: Algorithm and
Benchmark [100.77540900932763]
We contribute a novel Large-scale Spatio-Temporal (LaST) person re-ID dataset, including 10,860 identities with more than 224k images.
LaST presents more challenging and high-diversity reID settings, and significantly larger spatial and temporal ranges.
arXiv Detail & Related papers (2021-05-31T16:05:51Z) - EDEN: Deep Feature Distribution Pooling for Saimaa Ringed Seals Pattern
Matching [0.17999333451993946]
pelage pattern matching is considered to solve the individual re-identification of the Saimaa ringed seals.
We propose a novel feature pooling approach that allow aggregating the local pattern features to get a fixed size embedding vector.
arXiv Detail & Related papers (2021-05-28T16:59:39Z) - Towards Self-Supervision for Video Identification of Individual
Holstein-Friesian Cattle: The Cows2021 Dataset [10.698921107213554]
We publish the largest identity-annotated Holstein-Friesian cattle dataset Cows2021.
We propose exploiting the temporal coat pattern appearance across videos as a self-supervision signal for animal identity learning.
Results show an accuracy of Top-1 57.0% and Top-4: 76.9% and an Adjusted Rand Index: 0.53 compared to the ground truth.
arXiv Detail & Related papers (2021-05-05T09:08:19Z) - Unsupervised Pre-training for Person Re-identification [90.98552221699508]
We present a large scale unlabeled person re-identification (Re-ID) dataset "LUPerson"
We make the first attempt of performing unsupervised pre-training for improving the generalization ability of the learned person Re-ID feature representation.
arXiv Detail & Related papers (2020-12-07T14:48:26Z) - Deep Learning based Person Re-identification [2.9631016562930546]
We propose an efficient hierarchical re-identification approach in which color histogram based comparison is first employed to find the closest matches in the gallery set.
A silhouette part-based feature extraction scheme is adopted in each level of hierarchy to preserve the relative locations of the different body structures.
Results reveal that it outperforms most state-of-the-art approaches in terms of overall accuracy.
arXiv Detail & Related papers (2020-05-07T07:30:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.