Feathers dataset for Fine-Grained Visual Categorization
- URL: http://arxiv.org/abs/2004.08606v1
- Date: Sat, 18 Apr 2020 12:40:43 GMT
- Title: Feathers dataset for Fine-Grained Visual Categorization
- Authors: Alina Belko, Konstantin Dobratulin and Andrey Kuznetsov
- Abstract summary: FeatherV1 is the first publicly available bird's plumage dataset for machine learning.
It can raise interest for a new task in fine-grained visual recognition domain.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper introduces a novel dataset FeatherV1, containing 28,272 images of
feathers categorized by 595 bird species. It was created to perform taxonomic
identification of bird species by a single feather, which can be applied in
amateur and professional ornithology. FeatherV1 is the first publicly available
bird's plumage dataset for machine learning, and it can raise interest for a
new task in fine-grained visual recognition domain. The latest version of the
dataset can be downloaded at
https://github.com/feathers-dataset/feathersv1-dataset. We also present
feathers classification task results. We selected several deep learning
architectures (DenseNet based) for categorical crossentropy values comparison
on the provided dataset.
Related papers
- Anno-incomplete Multi-dataset Detection [67.69438032767613]
We propose a novel problem as "-incomplete Multi-dataset Detection"
We develop an end-to-end multi-task learning architecture which can accurately detect all the object categories with multiple partially annotated datasets.
arXiv Detail & Related papers (2024-08-29T03:58:21Z) - AudioProtoPNet: An interpretable deep learning model for bird sound classification [1.6298921134113031]
We present an adaption of the Prototypical Part Network (ProtoPNet) for audio classification that provides inherent interpretability through its model architecture.
Our approach is based on a ConvNeXt backbone architecture for feature extraction and learns patterns for each bird species using spectrograms of the training data.
arXiv Detail & Related papers (2024-04-16T09:37:41Z) - Leveraging Habitat Information for Fine-grained Bird Identification [4.392299539811761]
We are the first to explore integrating habitat information, one of the four major cues for identifying birds by ornithologists, into modern bird classifiers.
We focus on two leading model types: CNNs and ViTs trained on the downstream bird datasets; and original, multi-modal CLIP.
Training CNNs and ViTs with habitat-augmented data results in an improvement of up to +0.83 and +0.23 points on NABirds and CUB-200, respectively.
arXiv Detail & Related papers (2023-12-22T16:23:22Z) - BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species
Classification and Mapping [22.30038765017189]
We propose a metadata-aware self-supervised learning(SSL) framework useful for fine-grained classification and ecological mapping of bird species around the world.
Our framework unifies two SSL strategies: Contrastive Learning(CL) and Masked Image Modeling(MIM), while also enriching the embedding space with metadata available with ground-level imagery of birds.
We demonstrate that our models learn fine-grained and geographically conditioned features of birds, by evaluating on two downstream tasks: fine-grained visual classification(FGVC) and cross-modal retrieval.
arXiv Detail & Related papers (2023-10-29T22:08:00Z) - Transfer Learning with Semi-Supervised Dataset Annotation for Birdcall
Classification [0.0]
We present working notes on transfer learning with semi-supervised dataset annotation for the BirdCLEF 2023 competition.
Our approach utilizes existing off-the-shelf models, BirdNET and MixIT, to address representation and labeling challenges in the competition.
arXiv Detail & Related papers (2023-06-29T07:56:27Z) - Thinking Like an Annotator: Generation of Dataset Labeling Instructions [59.603239753484345]
We introduce a new task, Labeling Instruction Generation, to address missing publicly available labeling instructions.
We take a reasonably annotated dataset and: 1) generate a set of examples that are visually representative of each category in the dataset; 2) provide a text label that corresponds to each of the examples.
This framework acts as a proxy to human annotators that can help to both generate a final labeling instruction set and evaluate its quality.
arXiv Detail & Related papers (2023-06-24T18:32:48Z) - Minimalist Data Wrangling with Python [4.429175633425273]
Data Wrangling with Python is envisaged as a student's first introduction to data science.
It provides a high-level overview as well as discussing key concepts in detail.
arXiv Detail & Related papers (2022-11-09T01:24:39Z) - Multi-dataset Pretraining: A Unified Model for Semantic Segmentation [97.61605021985062]
We propose a unified framework, termed as Multi-Dataset Pretraining, to take full advantage of the fragmented annotations of different datasets.
This is achieved by first pretraining the network via the proposed pixel-to-prototype contrastive loss over multiple datasets.
In order to better model the relationship among images and classes from different datasets, we extend the pixel level embeddings via cross dataset mixing.
arXiv Detail & Related papers (2021-06-08T06:13:11Z) - Simple multi-dataset detection [83.9604523643406]
We present a simple method for training a unified detector on multiple large-scale datasets.
We show how to automatically integrate dataset-specific outputs into a common semantic taxonomy.
Our approach does not require manual taxonomy reconciliation.
arXiv Detail & Related papers (2021-02-25T18:55:58Z) - Dataset Bias in Few-shot Image Recognition [57.25445414402398]
We first investigate the impact of transferable capabilities learned from base categories.
Second, we investigate performance differences on different datasets from dataset structures and different few-shot learning methods.
arXiv Detail & Related papers (2020-08-18T14:46:23Z) - Cross-dataset Training for Class Increasing Object Detection [52.34737978720484]
We present a conceptually simple, flexible and general framework for cross-dataset training in object detection.
By cross-dataset training, existing datasets can be utilized to detect the merged object classes with a single model.
While using cross-dataset training, we only need to label the new classes on the new dataset.
arXiv Detail & Related papers (2020-01-14T04:40:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.