Hierarchical Re-Classification: Combining Animal Classification Models with Vision Transformers
- URL: http://arxiv.org/abs/2510.14594v1
- Date: Thu, 16 Oct 2025 11:57:07 GMT
- Title: Hierarchical Re-Classification: Combining Animal Classification Models with Vision Transformers
- Authors: Hugo Markoff, Jevgenijs Galaktionovs,
- Abstract summary: We present a hierarchical re-classification system for the Animal Detect platform.<n>Our five-stage pipeline is evaluated on a segment of the LILA BC Desert Lion Conservation dataset.<n>After recovering 761 bird detections from "blank" and "animal" labels, we re-classify 456 detections labeled animal, mammal, or blank with 96.5% accuracy.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: State-of-the-art animal classification models like SpeciesNet provide predictions across thousands of species but use conservative rollup strategies, resulting in many animals labeled at high taxonomic levels rather than species. We present a hierarchical re-classification system for the Animal Detect platform that combines SpeciesNet EfficientNetV2-M predictions with CLIP embeddings and metric learning to refine high-level taxonomic labels toward species-level identification. Our five-stage pipeline (high-confidence acceptance, bird override, centroid building, triplet-loss metric learning, and adaptive cosine-distance scoring) is evaluated on a segment of the LILA BC Desert Lion Conservation dataset (4,018 images, 15,031 detections). After recovering 761 bird detections from "blank" and "animal" labels, we re-classify 456 detections labeled animal, mammal, or blank with 96.5% accuracy, achieving species-level identification for 64.9 percent
Related papers
- Vision Transformers for Zero-Shot Clustering of Animal Images: A Comparative Benchmarking Study [0.19116784879310023]
Manual labeling of animal images remains a significant bottleneck in ecological research.<n>This study investigates whether state-of-the-art Vision Transformer (ViT) foundation models can reduce thousands of unlabeled animal images directly to species-level clusters.
arXiv Detail & Related papers (2026-02-03T08:27:22Z) - BioCLIP 2: Emergent Properties from Scaling Hierarchical Contrastive Learning [51.341003735575335]
We find emergent behaviors in biological vision models via large-scale contrastive vision-language training.<n>We train BioCLIP 2 on TreeOfLife-200M to distinguish different species.<n>We identify emergent properties in the learned embedding space of BioCLIP 2.
arXiv Detail & Related papers (2025-05-29T17:48:20Z) - A novel approach to navigate the taxonomic hierarchy to address the Open-World Scenarios in Medicinal Plant Classification [0.0]
It is observed that existing methods for medicinal plant classification often fail to perform hierarchical classification and accurately identifying unknown species.<n>We propose a novel method, which integrates DenseNet121, Multi-Scale Self-Attention (MSSA) and cascaded classifiers for hierarchical classification.<n>Our proposed model size is almost four times less than the existing state of the art methods making it easily deploy able in real world application.
arXiv Detail & Related papers (2025-02-24T16:20:25Z) - Acoustic identification of individual animals with hierarchical contrastive learning [12.965591289179372]
We frame AIID as a hierarchical multi-label classification task.
We propose the use of hierarchy-aware loss functions to learn robust representations of individual identities.
arXiv Detail & Related papers (2024-09-13T09:37:44Z) - OpenAnimalTracks: A Dataset for Animal Track Recognition [2.3020018305241337]
We introduce OpenAnimalTracks dataset, the first publicly available labeled dataset designed to facilitate the automated classification and detection of animal footprints.
We show the potential of automated footprint identification with representative classifiers and detection models.
We hope our dataset paves the way for automated animal tracking techniques, enhancing our ability to protect and manage biodiversity.
arXiv Detail & Related papers (2024-06-14T00:37:17Z) - Low Cost Machine Vision for Insect Classification [33.7054351451505]
We present an imaging method as part of a multisensor system developed as a low-cost, scalable, open-source system.
The system is evaluated exemplarily on a dataset consisting of 16 insect species of the same as well as different genus, family and order.
It was proved that image cropping of insects is necessary for classification of species with high inter-class similarity.
arXiv Detail & Related papers (2024-04-26T15:43:24Z) - Spatial Implicit Neural Representations for Global-Scale Species Mapping [72.92028508757281]
Given a set of locations where a species has been observed, the goal is to build a model to predict whether the species is present or absent at any location.
Traditional methods struggle to take advantage of emerging large-scale crowdsourced datasets.
We use Spatial Implicit Neural Representations (SINRs) to jointly estimate the geographical range of 47k species simultaneously.
arXiv Detail & Related papers (2023-06-05T03:36:01Z) - Bugs in the Data: How ImageNet Misrepresents Biodiversity [98.98950914663813]
We analyze the 13,450 images from 269 classes that represent wild animals in the ImageNet-1k validation set.
We find that many of the classes are ill-defined or overlapping, and that 12% of the images are incorrectly labeled.
We also find that both the wildlife-related labels and images included in ImageNet-1k present significant geographical and cultural biases.
arXiv Detail & Related papers (2022-08-24T17:55:48Z) - Two-View Fine-grained Classification of Plant Species [66.75915278733197]
We propose a novel method based on a two-view leaf image representation and a hierarchical classification strategy for fine-grained recognition of plant species.
A deep metric based on Siamese convolutional neural networks is used to reduce the dependence on a large number of training samples and make the method scalable to new plant species.
arXiv Detail & Related papers (2020-05-18T21:57:47Z) - Transferring Dense Pose to Proximal Animal Classes [83.84439508978126]
We show that it is possible to transfer the knowledge existing in dense pose recognition for humans, as well as in more general object detectors and segmenters, to the problem of dense pose recognition in other classes.
We do this by establishing a DensePose model for the new animal which is also geometrically aligned to humans.
We also introduce two benchmark datasets labelled in the manner of DensePose for the class chimpanzee and use them to evaluate our approach.
arXiv Detail & Related papers (2020-02-28T21:43:53Z) - Automatic image-based identification and biomass estimation of
invertebrates [70.08255822611812]
Time-consuming sorting and identification of taxa pose strong limitations on how many insect samples can be processed.
We propose to replace the standard manual approach of human expert-based sorting and identification with an automatic image-based technology.
We use state-of-the-art Resnet-50 and InceptionV3 CNNs for the classification task.
arXiv Detail & Related papers (2020-02-05T21:38:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.