Towards ML Methods for Biodiversity: A Novel Wild Bee Dataset and
Evaluations of XAI Methods for ML-Assisted Rare Species Annotations
- URL: http://arxiv.org/abs/2206.07497v1
- Date: Wed, 15 Jun 2022 12:48:05 GMT
- Title: Towards ML Methods for Biodiversity: A Novel Wild Bee Dataset and
Evaluations of XAI Methods for ML-Assisted Rare Species Annotations
- Authors: Teodor Chiaburu, Felix Biessmann and Frank Hausser
- Abstract summary: Insects are a crucial part of our ecosystem. Sadly, in the past few decades, their numbers have worryingly decreased.
In an attempt to gain a better understanding of this process and monitor the insects populations, Deep Learning may offer viable solutions.
This paper presents a dataset of thoroughly annotated images of wild bees sampled from the iNaturalist database.
A ResNet model trained on the wild bee dataset achieving classification scores comparable to similar state-of-the-art models trained on other fine-grained datasets.
- Score: 3.947933139348889
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Insects are a crucial part of our ecosystem. Sadly, in the past few decades,
their numbers have worryingly decreased. In an attempt to gain a better
understanding of this process and monitor the insects populations, Deep
Learning may offer viable solutions. However, given the breadth of their
taxonomy and the typical hurdles of fine grained analysis, such as high
intraclass variability compared to low interclass variability, insect
classification remains a challenging task. There are few benchmark datasets,
which impedes rapid development of better AI models. The annotation of rare
species training data, however, requires expert knowledge. Explainable
Artificial Intelligence (XAI) could assist biologists in these annotation
tasks, but choosing the optimal XAI method is difficult. Our contribution to
these research challenges is threefold: 1) a dataset of thoroughly annotated
images of wild bees sampled from the iNaturalist database, 2) a ResNet model
trained on the wild bee dataset achieving classification scores comparable to
similar state-of-the-art models trained on other fine-grained datasets and 3)
an investigation of XAI methods to support biologists in annotation tasks.
Related papers
- Explainable few-shot learning workflow for detecting invasive and exotic tree species [42.30413964219434]
This research presents an explainable few-shot learning workflow for detecting invasive and exotic tree species in the Atlantic Forest of Brazil.
By integrating a Siamese network with explainable AI (XAI), the workflow enables the classification of tree species with minimal labeled data.
Results demonstrate the effectiveness of the proposed workflow in identifying new tree species, even in data-scarce conditions.
arXiv Detail & Related papers (2024-11-01T15:45:19Z) - Explainable AI for Comparative Analysis of Intrusion Detection Models [20.683181384051395]
This research analyzes various machine learning models to the tasks of binary and multi-class classification for intrusion detection from network traffic.
We trained all models to the accuracy of 90% on the UNSW-NB15 dataset.
We also discover that Random Forest provides the best performance in terms of accuracy, time efficiency and robustness.
arXiv Detail & Related papers (2024-06-14T03:11:01Z) - Deep Neural Network Identification of Limnonectes Species and New Class
Detection Using Image Data [5.943822554753426]
Deep neural networks can successfully automate the classification of an image into a known species group for which it has been trained.
We demonstrate that the algorithm can successfully classify an image into a new class if the image does not belong to the existing classes.
arXiv Detail & Related papers (2023-11-15T02:57:59Z) - A Step Towards Worldwide Biodiversity Assessment: The BIOSCAN-1M Insect
Dataset [18.211840156134784]
This paper presents a curated million-image dataset, primarily to train computer-vision models capable of providing image-based taxonomic assessment.
The dataset also presents compelling characteristics, the study of which would be of interest to the broader machine learning community.
arXiv Detail & Related papers (2023-07-19T20:54:08Z) - CellTranspose: Few-shot Domain Adaptation for Cellular Instance
Segmentation [4.38301148531795]
We address the problem of designing an approach that requires minimal amounts of new annotated data as well as training time.
We do so by designing specialized contrastive losses that leverage the few annotated samples very efficiently.
arXiv Detail & Related papers (2022-12-28T23:00:50Z) - Gone Fishing: Neural Active Learning with Fisher Embeddings [55.08537975896764]
There is an increasing need for active learning algorithms that are compatible with deep neural networks.
This article introduces BAIT, a practical representation of tractable, and high-performing active learning algorithm for neural networks.
arXiv Detail & Related papers (2021-06-17T17:26:31Z) - Low-Regret Active learning [64.36270166907788]
We develop an online learning algorithm for identifying unlabeled data points that are most informative for training.
At the core of our work is an efficient algorithm for sleeping experts that is tailored to achieve low regret on predictable (easy) instances.
arXiv Detail & Related papers (2021-04-06T22:53:45Z) - Deep Low-Shot Learning for Biological Image Classification and
Visualization from Limited Training Samples [52.549928980694695]
In situ hybridization (ISH) gene expression pattern images from the same developmental stage are compared.
labeling training data with precise stages is very time-consuming even for biologists.
We propose a deep two-step low-shot learning framework to accurately classify ISH images using limited training images.
arXiv Detail & Related papers (2020-10-20T06:06:06Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Deep Mining External Imperfect Data for Chest X-ray Disease Screening [57.40329813850719]
We argue that incorporating an external CXR dataset leads to imperfect training data, which raises the challenges.
We formulate the multi-label disease classification problem as weighted independent binary tasks according to the categories.
Our framework simultaneously models and tackles the domain and label discrepancies, enabling superior knowledge mining ability.
arXiv Detail & Related papers (2020-06-06T06:48:40Z) - Rectified Meta-Learning from Noisy Labels for Robust Image-based Plant
Disease Diagnosis [64.82680813427054]
Plant diseases serve as one of main threats to food security and crop production.
One popular approach is to transform this problem as a leaf image classification task, which can be addressed by the powerful convolutional neural networks (CNNs)
We propose a novel framework that incorporates rectified meta-learning module into common CNN paradigm to train a noise-robust deep network without using extra supervision information.
arXiv Detail & Related papers (2020-03-17T09:51:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.