Related papers: Visual Microfossil Identificationvia Deep Metric Learning

Visual Microfossil Identificationvia Deep Metric Learning

URL: http://arxiv.org/abs/2112.09490v1
Date: Fri, 17 Dec 2021 13:00:37 GMT
Title: Visual Microfossil Identificationvia Deep Metric Learning
Authors: Tayfun Karaderi, Tilo Burghardt, Allison Y. Hsiang, Jacob Ramaer, Daniela N. Schmidt
Abstract summary: We apply metric learning to classifying planktic foraminifer shells on microscopic images. We produce the first scientific visualisation of the phenotypic planktic foraminifer space. We show that metric learning out-performs all published CNN-based state-of-the-art benchmarks in this domain.
Score: 1.3199511198128897
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: We apply deep metric learning for the first time to the prob-lem of classifying planktic foraminifer shells on microscopic images. This species recognition task is an important information source and scientific pillar for reconstructing past climates. All foraminifer CNN recognition pipelines in the literature produce black-box classifiers that lack visualisation options for human experts and cannot be applied to open set problems. Here, we benchmark metric learning against these pipelines, produce the first scientific visualisation of the phenotypic planktic foraminifer morphology space, and demonstrate that metric learning can be used to cluster species unseen during training. We show that metric learning out-performs all published CNN-based state-of-the-art benchmarks in this domain. We evaluate our approach on the 34,640 expert-annotated images of the Endless Forams public library of 35 modern planktic foraminifera species. Our results on this data show leading 92% accuracy (at 0.84 F1-score) in reproducing expert labels on withheld test data, and 66.5% accuracy (at 0.70 F1-score) when clustering species never encountered in training. We conclude that metric learning is highly effective for this domain and serves as an important tool towards expert-in-the-loop automation of microfossil identification. Key code, network weights, and data splits are published with this paper for full reproducibility.

Related papers

DExNet: Combining Observations of Domain Adapted Critics for Leaf Disease Classification with Limited Data [1.124958340749622]
This work proposes a few-shot learning framework, Domain-adapted Expert Network (DExNet), for plant disease classification.<n>It starts with extracting the feature embeddings as 'observations' from nine 'critics' that are state-of-the-art pre-trained CNN-based architectures.<n>The proposed pipeline is evaluated on the 10 classes of tomato leaf images from the PlantVillage dataset.
arXiv Detail & Related papers (2025-06-22T21:15:54Z)
High-Throughput Phenotyping using Computer Vision and Machine Learning [0.0]
We used a dataset provided by Oak Ridge National Laboratory with 1,672 images of Populus Trichocarpa with white labels displaying treatment. Optical character recognition (OCR) was used to read these labels on the plants. Machine learning models were used to predict treatment based on those classifications, and analyzed encoded EXIF tags were used for the purpose of finding leaf size and correlations between phenotypes.
arXiv Detail & Related papers (2024-07-08T19:46:31Z)
WhaleNet: a Novel Deep Learning Architecture for Marine Mammals Vocalizations on Watkins Marine Mammal Sound Database [49.1574468325115]
We introduce textbfWhaleNet (Wavelet Highly Adaptive Learning Ensemble Network), a sophisticated deep ensemble architecture for the classification of marine mammal vocalizations. We achieve an improvement in classification accuracy by $8-10%$ over existing architectures, corresponding to a classification accuracy of $97.61%$.
arXiv Detail & Related papers (2024-02-20T11:36:23Z)
A Saliency-based Clustering Framework for Identifying Aberrant Predictions [49.1574468325115]
We introduce the concept of aberrant predictions, emphasizing that the nature of classification errors is as critical as their frequency. We propose a novel, efficient training methodology aimed at both reducing the misclassification rate and discerning aberrant predictions. We apply this methodology to the less-explored domain of veterinary radiology, where the stakes are high but have not been as extensively studied compared to human medicine.
arXiv Detail & Related papers (2023-11-11T01:53:59Z)
Spatial Implicit Neural Representations for Global-Scale Species Mapping [72.92028508757281]
Given a set of locations where a species has been observed, the goal is to build a model to predict whether the species is present or absent at any location. Traditional methods struggle to take advantage of emerging large-scale crowdsourced datasets. We use Spatial Implicit Neural Representations (SINRs) to jointly estimate the geographical range of 47k species simultaneously.
arXiv Detail & Related papers (2023-06-05T03:36:01Z)
Deep Visual-Genetic Biometrics for Taxonomic Classification of Rare Species [1.9819034119774483]
We propose aligned visual-genetic inference spaces with the aim to implicitly encode cross-domain associations for improved performance. We experimentally demonstrate the efficacy of the concept via application to microscopic imagery of 30k+ planktic foraminifer shells. Visual-genetic alignment can significantly benefit visual-only recognition of the rarest species.
arXiv Detail & Related papers (2023-05-11T10:04:27Z)
Tree-Based Learning on Amperometric Time Series Data Demonstrates High Accuracy for Classification [0.0]
We present a universal method for the classification with respect to diverse amperometric datasets using data-driven approaches in computational science. We demonstrate a very high prediction accuracy (greater than or equal to 95%) This is one of the first studies that propose a scheme for machine learning, and in particular, supervised learning on full amperometry time series data.
arXiv Detail & Related papers (2023-02-06T09:44:53Z)
Triple-stream Deep Metric Learning of Great Ape Behavioural Actions [3.8820728151341717]
We propose the first metric learning system for the recognition of great ape behavioural actions. Our proposed triple stream embedding architecture works on camera trap videos taken directly in the wild.
arXiv Detail & Related papers (2023-01-06T18:36:04Z)
White Matter Tracts are Point Clouds: Neuropsychological Score Prediction and Critical Region Localization via Geometric Deep Learning [68.5548609642999]
We propose a deep-learning-based framework for neuropsychological score prediction using white matter tract data. We represent the arcuate fasciculus (AF) as a point cloud with microstructure measurements at each point. We improve prediction performance with the proposed Paired-Siamese Loss that utilizes information about differences between continuous neuropsychological scores.
arXiv Detail & Related papers (2022-07-06T02:03:28Z)
Combining Feature and Instance Attribution to Detect Artifacts [62.63504976810927]
We propose methods to facilitate identification of training data artifacts. We show that this proposed training-feature attribution approach can be used to uncover artifacts in training data. We execute a small user study to evaluate whether these methods are useful to NLP researchers in practice.
arXiv Detail & Related papers (2021-07-01T09:26:13Z)
Automatic identification of fossils and abiotic grains during carbonate microfacies analysis using deep convolutional neural networks [1.520387509697271]
Petrographic analysis based on microfacies identification in thin sections is widely used in sedimentary environment interpretation and paleoecological reconstruction. Distinguishing the morphological and microstructural diversity of skeletal fragments requires extensive prior knowledge of fossil morphotypes in microfacies.
arXiv Detail & Related papers (2020-09-24T00:58:48Z)
Omni-supervised Facial Expression Recognition via Distilled Data [120.11782405714234]
We propose omni-supervised learning to exploit reliable samples in a large amount of unlabeled data for network training. We experimentally verify that the new dataset can significantly improve the ability of the learned FER model. To tackle this, we propose to apply a dataset distillation strategy to compress the created dataset into several informative class-wise images.
arXiv Detail & Related papers (2020-05-18T09:36:51Z)

This list is automatically generated from the titles and abstracts of the papers in this site.