Visual Microfossil Identificationvia Deep Metric Learning
- URL: http://arxiv.org/abs/2112.09490v1
- Date: Fri, 17 Dec 2021 13:00:37 GMT
- Title: Visual Microfossil Identificationvia Deep Metric Learning
- Authors: Tayfun Karaderi, Tilo Burghardt, Allison Y. Hsiang, Jacob Ramaer,
Daniela N. Schmidt
- Abstract summary: We apply metric learning to classifying planktic foraminifer shells on microscopic images.
We produce the first scientific visualisation of the phenotypic planktic foraminifer space.
We show that metric learning out-performs all published CNN-based state-of-the-art benchmarks in this domain.
- Score: 1.3199511198128897
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: We apply deep metric learning for the first time to the prob-lem of
classifying planktic foraminifer shells on microscopic images. This species
recognition task is an important information source and scientific pillar for
reconstructing past climates. All foraminifer CNN recognition pipelines in the
literature produce black-box classifiers that lack visualisation options for
human experts and cannot be applied to open set problems. Here, we benchmark
metric learning against these pipelines, produce the first scientific
visualisation of the phenotypic planktic foraminifer morphology space, and
demonstrate that metric learning can be used to cluster species unseen during
training. We show that metric learning out-performs all published CNN-based
state-of-the-art benchmarks in this domain. We evaluate our approach on the
34,640 expert-annotated images of the Endless Forams public library of 35
modern planktic foraminifera species. Our results on this data show leading 92%
accuracy (at 0.84 F1-score) in reproducing expert labels on withheld test data,
and 66.5% accuracy (at 0.70 F1-score) when clustering species never encountered
in training. We conclude that metric learning is highly effective for this
domain and serves as an important tool towards expert-in-the-loop automation of
microfossil identification. Key code, network weights, and data splits are
published with this paper for full reproducibility.
Related papers
- High-Throughput Phenotyping using Computer Vision and Machine Learning [0.0]
We used a dataset provided by Oak Ridge National Laboratory with 1,672 images of Populus Trichocarpa with white labels displaying treatment.
Optical character recognition (OCR) was used to read these labels on the plants.
Machine learning models were used to predict treatment based on those classifications, and analyzed encoded EXIF tags were used for the purpose of finding leaf size and correlations between phenotypes.
arXiv Detail & Related papers (2024-07-08T19:46:31Z) - WhaleNet: a Novel Deep Learning Architecture for Marine Mammals Vocalizations on Watkins Marine Mammal Sound Database [49.1574468325115]
We introduce textbfWhaleNet (Wavelet Highly Adaptive Learning Ensemble Network), a sophisticated deep ensemble architecture for the classification of marine mammal vocalizations.
We achieve an improvement in classification accuracy by $8-10%$ over existing architectures, corresponding to a classification accuracy of $97.61%$.
arXiv Detail & Related papers (2024-02-20T11:36:23Z) - A Saliency-based Clustering Framework for Identifying Aberrant
Predictions [49.1574468325115]
We introduce the concept of aberrant predictions, emphasizing that the nature of classification errors is as critical as their frequency.
We propose a novel, efficient training methodology aimed at both reducing the misclassification rate and discerning aberrant predictions.
We apply this methodology to the less-explored domain of veterinary radiology, where the stakes are high but have not been as extensively studied compared to human medicine.
arXiv Detail & Related papers (2023-11-11T01:53:59Z) - Spatial Implicit Neural Representations for Global-Scale Species Mapping [72.92028508757281]
Given a set of locations where a species has been observed, the goal is to build a model to predict whether the species is present or absent at any location.
Traditional methods struggle to take advantage of emerging large-scale crowdsourced datasets.
We use Spatial Implicit Neural Representations (SINRs) to jointly estimate the geographical range of 47k species simultaneously.
arXiv Detail & Related papers (2023-06-05T03:36:01Z) - Deep Visual-Genetic Biometrics for Taxonomic Classification of Rare
Species [1.9819034119774483]
We propose aligned visual-genetic inference spaces with the aim to implicitly encode cross-domain associations for improved performance.
We experimentally demonstrate the efficacy of the concept via application to microscopic imagery of 30k+ planktic foraminifer shells.
Visual-genetic alignment can significantly benefit visual-only recognition of the rarest species.
arXiv Detail & Related papers (2023-05-11T10:04:27Z) - Tree-Based Learning on Amperometric Time Series Data Demonstrates High
Accuracy for Classification [0.0]
We present a universal method for the classification with respect to diverse amperometric datasets using data-driven approaches in computational science.
We demonstrate a very high prediction accuracy (greater than or equal to 95%)
This is one of the first studies that propose a scheme for machine learning, and in particular, supervised learning on full amperometry time series data.
arXiv Detail & Related papers (2023-02-06T09:44:53Z) - Triple-stream Deep Metric Learning of Great Ape Behavioural Actions [3.8820728151341717]
We propose the first metric learning system for the recognition of great ape behavioural actions.
Our proposed triple stream embedding architecture works on camera trap videos taken directly in the wild.
arXiv Detail & Related papers (2023-01-06T18:36:04Z) - White Matter Tracts are Point Clouds: Neuropsychological Score
Prediction and Critical Region Localization via Geometric Deep Learning [68.5548609642999]
We propose a deep-learning-based framework for neuropsychological score prediction using white matter tract data.
We represent the arcuate fasciculus (AF) as a point cloud with microstructure measurements at each point.
We improve prediction performance with the proposed Paired-Siamese Loss that utilizes information about differences between continuous neuropsychological scores.
arXiv Detail & Related papers (2022-07-06T02:03:28Z) - Combining Feature and Instance Attribution to Detect Artifacts [62.63504976810927]
We propose methods to facilitate identification of training data artifacts.
We show that this proposed training-feature attribution approach can be used to uncover artifacts in training data.
We execute a small user study to evaluate whether these methods are useful to NLP researchers in practice.
arXiv Detail & Related papers (2021-07-01T09:26:13Z) - Automatic identification of fossils and abiotic grains during carbonate
microfacies analysis using deep convolutional neural networks [1.520387509697271]
Petrographic analysis based on microfacies identification in thin sections is widely used in sedimentary environment interpretation and paleoecological reconstruction.
Distinguishing the morphological and microstructural diversity of skeletal fragments requires extensive prior knowledge of fossil morphotypes in microfacies.
arXiv Detail & Related papers (2020-09-24T00:58:48Z) - Omni-supervised Facial Expression Recognition via Distilled Data [120.11782405714234]
We propose omni-supervised learning to exploit reliable samples in a large amount of unlabeled data for network training.
We experimentally verify that the new dataset can significantly improve the ability of the learned FER model.
To tackle this, we propose to apply a dataset distillation strategy to compress the created dataset into several informative class-wise images.
arXiv Detail & Related papers (2020-05-18T09:36:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.