WildlifeReID-10k: Wildlife re-identification dataset with 10k individual animals
- URL: http://arxiv.org/abs/2406.09211v3
- Date: Tue, 15 Apr 2025 06:21:22 GMT
- Title: WildlifeReID-10k: Wildlife re-identification dataset with 10k individual animals
- Authors: Lukáš Adam, Vojtěch Čermák, Kostas Papafitsoros, Lukas Picek,
- Abstract summary: This paper introduces WildlifeReID-10k, a new large-scale re-identification benchmark with more than 10k animal identities of around 33 species across more than 140k images.<n>WildlifeReID-10k covers diverse animal species and poses significant challenges for SoTA methods.<n>The dataset and benchmark are publicly available on Kaggle, along with strong baselines for both closed-set and open-set evaluation.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper introduces WildlifeReID-10k, a new large-scale re-identification benchmark with more than 10k animal identities of around 33 species across more than 140k images, re-sampled from 37 existing datasets. WildlifeReID-10k covers diverse animal species and poses significant challenges for SoTA methods, ensuring fair and robust evaluation through its time-aware and similarity-aware split protocol. The latter is designed to address the common issue of training-to-test data leakage caused by visually similar images appearing in both training and test sets. The WildlifeReID-10k dataset and benchmark are publicly available on Kaggle, along with strong baselines for both closed-set and open-set evaluation, enabling fair, transparent, and standardized evaluation of not just multi-species animal re-identification models.
Related papers
- Multispecies Animal Re-ID Using a Large Community-Curated Dataset [0.19418036471925312]
We construct a dataset that includes 49 species, 37K individual animals, and 225K images, using this data to train a single embedding network for all species.
Our model consistently outperforms models trained separately on each species, achieving an average gain of 12.5% in top-1 accuracy.
The model is already in production use for 60+ species in a large-scale wildlife monitoring system.
arXiv Detail & Related papers (2024-12-07T09:56:33Z) - An Individual Identity-Driven Framework for Animal Re-Identification [15.381573249551181]
IndivAID is a framework specifically designed for Animal ReID.
It generates image-specific and individual-specific textual descriptions that fully capture the diverse visual concepts of each individual across animal images.
Evaluation against state-of-the-art methods across eight benchmark datasets and a real-world Stoat dataset demonstrates IndivAID's effectiveness and applicability.
arXiv Detail & Related papers (2024-10-30T11:34:55Z) - SPOTS-10: Animal Pattern Benchmark Dataset for Machine Learning Algorithms [0.08158530638728499]
SPOTS-10 is an extensive collection of grayscale images showcasing diverse patterns in ten animal species.
This dataset is a resource for evaluating machine learning algorithms in situ.
The training set comprises 40,000 images, while the test set contains 10,000 images.
arXiv Detail & Related papers (2024-10-28T14:00:02Z) - OpenAnimals: Revisiting Person Re-Identification for Animals Towards Better Generalization [10.176567936487364]
We conduct a study by revisiting several state-of-the-art person re-identification methods, including BoT, AGW, SBS, and MGN.
We evaluate their effectiveness on animal re-identification benchmarks such as HyenaID, LeopardID, SeaTurtleID, and WhaleSharkID.
Our findings reveal that while some techniques well, many do not generalize, underscoring the significant differences between the two tasks.
We propose ARBase, a strong textbfBase model tailored for textbfAnimal textbfRe-
arXiv Detail & Related papers (2024-09-30T20:07:14Z) - Addressing the Elephant in the Room: Robust Animal Re-Identification with Unsupervised Part-Based Feature Alignment [44.86310789545717]
Animal Re-ID is crucial for wildlife conservation, yet it faces unique challenges compared to person Re-ID.
This study addresses background biases by proposing a method to systematically remove backgrounds in both training and evaluation phases.
Our method achieves superior results on three key animal Re-ID datasets: ATRW, YakReID-103, and ELPephants.
arXiv Detail & Related papers (2024-05-22T16:08:06Z) - WildlifeDatasets: An open-source toolkit for animal re-identification [0.0]
WildlifeDatasets is an open-source toolkit for ecologists and computer-vision / machine-learning researchers.
WildlifeDatasets is written in Python and allows straightforward access to publicly available wildlife datasets.
We provide the first-ever foundation model for individual re-identification within a wide range of species - MegaDescriptor.
arXiv Detail & Related papers (2023-11-15T17:08:09Z) - SeaTurtleID2022: A long-span dataset for reliable sea turtle re-identification [0.0]
This paper introduces the first large-scale, long-span dataset with sea turtle photographs captured in the wild -- SeaTurtleID2022.
The dataset contains 8729 photographs of 438 unique individuals collected within 13 years.
Instead of standard "random" splits, the dataset allows for two realistic and ecologically motivated splits.
arXiv Detail & Related papers (2023-11-09T17:10:20Z) - Multimodal Foundation Models for Zero-shot Animal Species Recognition in
Camera Trap Images [57.96659470133514]
Motion-activated camera traps constitute an efficient tool for tracking and monitoring wildlife populations across the globe.
Supervised learning techniques have been successfully deployed to analyze such imagery, however training such techniques requires annotations from experts.
Reducing the reliance on costly labelled data has immense potential in developing large-scale wildlife tracking solutions with markedly less human labor.
arXiv Detail & Related papers (2023-11-02T08:32:00Z) - Wild Face Anti-Spoofing Challenge 2023: Benchmark and Results [73.98594459933008]
Face anti-spoofing (FAS) is an essential mechanism for safeguarding the integrity of automated face recognition systems.
This limitation can be attributed to the scarcity and lack of diversity in publicly available FAS datasets.
We introduce the Wild Face Anti-Spoofing dataset, a large-scale, diverse FAS dataset collected in unconstrained settings.
arXiv Detail & Related papers (2023-04-12T10:29:42Z) - SeaTurtleID2022: A long-span dataset for reliable sea turtle re-identification [0.0]
This paper introduces the first public large-scale, long-span dataset with sea turtle photographs captured in the wild.
The dataset contains 8729 photographs of 438 unique individuals collected within 13 years.
arXiv Detail & Related papers (2022-11-18T15:46:24Z) - Bugs in the Data: How ImageNet Misrepresents Biodiversity [98.98950914663813]
We analyze the 13,450 images from 269 classes that represent wild animals in the ImageNet-1k validation set.
We find that many of the classes are ill-defined or overlapping, and that 12% of the images are incorrectly labeled.
We also find that both the wildlife-related labels and images included in ImageNet-1k present significant geographical and cultural biases.
arXiv Detail & Related papers (2022-08-24T17:55:48Z) - APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking [77.87449881852062]
APT-36K is the first large-scale benchmark for animal pose estimation and tracking.
It consists of 2,400 video clips collected and filtered from 30 animal species with 15 frames for each video, resulting in 36,000 frames in total.
We benchmark several representative models on the following three tracks: (1) supervised animal pose estimation on a single frame under intra- and inter-domain transfer learning settings, (2) inter-species domain generalization test for unseen animals, and (3) animal pose estimation with animal tracking.
arXiv Detail & Related papers (2022-06-12T07:18:36Z) - AP-10K: A Benchmark for Animal Pose Estimation in the Wild [83.17759850662826]
We propose AP-10K, the first large-scale benchmark for general animal pose estimation.
AP-10K consists of 10,015 images collected and filtered from 23 animal families and 60 species.
Results provide sound empirical evidence on the superiority of learning from diverse animals species in terms of both accuracy and generalization ability.
arXiv Detail & Related papers (2021-08-28T10:23:34Z) - Florida Wildlife Camera Trap Dataset [48.99466876948454]
We introduce a challenging wildlife camera trap classification dataset collected from two different locations in Southwestern Florida.
The dataset consists of 104,495 images featuring visually similar species, varying illumination conditions, skewed class distribution, and including samples of endangered species.
arXiv Detail & Related papers (2021-06-23T18:53:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.