SeaTurtleID2022: A long-span dataset for reliable sea turtle re-identification
- URL: http://arxiv.org/abs/2211.10307v4
- Date: Wed, 1 May 2024 13:16:09 GMT
- Title: SeaTurtleID2022: A long-span dataset for reliable sea turtle re-identification
- Authors: Lukáš Adam, Vojtěch Čermák, Kostas Papafitsoros, Lukáš Picek,
- Abstract summary: This paper introduces the first public large-scale, long-span dataset with sea turtle photographs captured in the wild.
The dataset contains 8729 photographs of 438 unique individuals collected within 13 years.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: This paper introduces the first public large-scale, long-span dataset with sea turtle photographs captured in the wild -- \href{https://www.kaggle.com/datasets/wildlifedatasets/seaturtleid2022}{SeaTurtleID2022}. The dataset contains 8729 photographs of 438 unique individuals collected within 13 years, making it the longest-spanned dataset for animal re-identification. All photographs include various annotations, e.g., identity, encounter timestamp, and body parts segmentation masks. Instead of standard "random" splits, the dataset allows for two realistic and ecologically motivated splits: (i) a \textit{time-aware closed-set} with training, validation, and test data from different days/years, and (ii) a \textit{time-aware open-set} with new unknown individuals in test and validation sets. We show that time-aware splits are essential for benchmarking re-identification methods, as random splits lead to performance overestimation. Furthermore, a baseline instance segmentation and re-identification performance over various body parts is provided. Finally, an end-to-end system for sea turtle re-identification is proposed and evaluated. The proposed system based on Hybrid Task Cascade for head instance segmentation and ArcFace-trained feature-extractor achieved an accuracy of 86.8\%.
Related papers
- A Dataset for Semantic Segmentation in the Presence of Unknowns [49.795683850385956]
Existing datasets allow evaluation of only knowns or unknowns - but not both.
We propose a novel anomaly segmentation dataset, ISSU, that features a diverse set of anomaly inputs from cluttered real-world environments.
The dataset is twice larger than existing anomaly segmentation datasets.
arXiv Detail & Related papers (2025-03-28T10:31:01Z) - Multispecies Animal Re-ID Using a Large Community-Curated Dataset [0.19418036471925312]
We construct a dataset that includes 49 species, 37K individual animals, and 225K images, using this data to train a single embedding network for all species.
Our model consistently outperforms models trained separately on each species, achieving an average gain of 12.5% in top-1 accuracy.
The model is already in production use for 60+ species in a large-scale wildlife monitoring system.
arXiv Detail & Related papers (2024-12-07T09:56:33Z) - FungiTastic: A multi-modal dataset and benchmark for image categorization [21.01939456569417]
We introduce a new benchmark and a dataset, FungiTastic, based on fungal records continuously collected over a twenty-year span.
The dataset is labeled and curated by experts and consists of about 350k multimodal observations of 5k fine-grained categories (species)
FungiTastic is one of the few benchmarks that include a test set with DNA-sequenced ground truth of unprecedented label reliability.
arXiv Detail & Related papers (2024-08-24T17:22:46Z) - WildlifeReID-10k: Wildlife re-identification dataset with 10k individual animals [0.0]
This paper introduces WildlifeReID-10k, a new large-scale re-identification benchmark with more than 10k animal identities of around 33 species across more than 140k images.
WildlifeReID-10k covers diverse animal species and poses significant challenges for SoTA methods.
The dataset and benchmark are publicly available on Kaggle, along with strong baselines for both closed-set and open-set evaluation.
arXiv Detail & Related papers (2024-06-13T15:15:07Z) - Universal Bovine Identification via Depth Data and Deep Metric Learning [1.6605913858547239]
This paper proposes and evaluates, for the first time, a depth-only deep learning system for accurately identifying individual cattle.
An increase in herd size skews the cow-to-human ratio at the farm and makes the manual monitoring of individuals more challenging.
Underpinned by our previous work, this paper introduces a deep-metric learning method for cattle identification using depth data from an off-the-shelf 3D camera.
arXiv Detail & Related papers (2024-03-29T22:03:53Z) - SeaTurtleID2022: A long-span dataset for reliable sea turtle re-identification [0.0]
This paper introduces the first large-scale, long-span dataset with sea turtle photographs captured in the wild -- SeaTurtleID2022.
The dataset contains 8729 photographs of 438 unique individuals collected within 13 years.
Instead of standard "random" splits, the dataset allows for two realistic and ecologically motivated splits.
arXiv Detail & Related papers (2023-11-09T17:10:20Z) - SwinFace: A Multi-task Transformer for Face Recognition, Expression
Recognition, Age Estimation and Attribute Estimation [60.94239810407917]
This paper presents a multi-purpose algorithm for simultaneous face recognition, facial expression recognition, age estimation, and face attribute estimation based on a single Swin Transformer.
To address the conflicts among multiple tasks, a Multi-Level Channel Attention (MLCA) module is integrated into each task-specific analysis.
Experiments show that the proposed model has a better understanding of the face and achieves excellent performance for all tasks.
arXiv Detail & Related papers (2023-08-22T15:38:39Z) - MomentDiff: Generative Video Moment Retrieval from Random to Real [71.40038773943638]
We provide a generative diffusion-based framework called MomentDiff.
MomentDiff simulates a typical human retrieval process from random browsing to gradual localization.
We show that MomentDiff consistently outperforms state-of-the-art methods on three public benchmarks.
arXiv Detail & Related papers (2023-07-06T09:12:13Z) - Navya3DSeg -- Navya 3D Semantic Segmentation Dataset & split generation
for autonomous vehicles [63.20765930558542]
3D semantic data are useful for core perception tasks such as obstacle detection and ego-vehicle localization.
We propose a new dataset, Navya 3D (Navya3DSeg), with a diverse label space corresponding to a large scale production grade operational domain.
It contains 23 labeled sequences and 25 supplementary sequences without labels, designed to explore self-supervised and semi-supervised semantic segmentation benchmarks on point clouds.
arXiv Detail & Related papers (2023-02-16T13:41:19Z) - End-to-end Person Search Sequentially Trained on Aggregated Dataset [1.9766522384767227]
We propose a new end-to-end model that jointly computes detection and feature extraction steps.
We show that aggregating more pedestrian detection datasets without costly identity annotations makes the shared feature maps more generic.
arXiv Detail & Related papers (2022-01-24T11:22:15Z) - Multi-dataset Pretraining: A Unified Model for Semantic Segmentation [97.61605021985062]
We propose a unified framework, termed as Multi-Dataset Pretraining, to take full advantage of the fragmented annotations of different datasets.
This is achieved by first pretraining the network via the proposed pixel-to-prototype contrastive loss over multiple datasets.
In order to better model the relationship among images and classes from different datasets, we extend the pixel level embeddings via cross dataset mixing.
arXiv Detail & Related papers (2021-06-08T06:13:11Z) - Large-Scale Spatio-Temporal Person Re-identification: Algorithm and
Benchmark [100.77540900932763]
We contribute a novel Large-scale Spatio-Temporal (LaST) person re-ID dataset, including 10,860 identities with more than 224k images.
LaST presents more challenging and high-diversity reID settings, and significantly larger spatial and temporal ranges.
arXiv Detail & Related papers (2021-05-31T16:05:51Z) - Unsupervised Pre-training for Person Re-identification [90.98552221699508]
We present a large scale unlabeled person re-identification (Re-ID) dataset "LUPerson"
We make the first attempt of performing unsupervised pre-training for improving the generalization ability of the learned person Re-ID feature representation.
arXiv Detail & Related papers (2020-12-07T14:48:26Z) - Deep Learning based Person Re-identification [2.9631016562930546]
We propose an efficient hierarchical re-identification approach in which color histogram based comparison is first employed to find the closest matches in the gallery set.
A silhouette part-based feature extraction scheme is adopted in each level of hierarchy to preserve the relative locations of the different body structures.
Results reveal that it outperforms most state-of-the-art approaches in terms of overall accuracy.
arXiv Detail & Related papers (2020-05-07T07:30:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.