Image-based Automated Species Identification: Can Virtual Data
Augmentation Overcome Problems of Insufficient Sampling?
- URL: http://arxiv.org/abs/2010.09009v1
- Date: Sun, 18 Oct 2020 15:44:45 GMT
- Title: Image-based Automated Species Identification: Can Virtual Data
Augmentation Overcome Problems of Insufficient Sampling?
- Authors: Morris Klasen, Dirk Ahrens, Jonas Eberle, and Volker Steinhage
- Abstract summary: We present a two-level data augmentation approach to automated visual species identification.
The first level of data augmentation applies classic approaches of data augmentation and generation of faked images.
The second level of data augmentation employs synthetic additional sampling in feature space by an oversampling algorithm in vector space.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Automated species identification and delimitation is challenging,
particularly in rare and thus often scarcely sampled species, which do not
allow sufficient discrimination of infraspecific versus interspecific
variation. Typical problems arising from either low or exaggerated
interspecific morphological differentiation are best met by automated methods
of machine learning that learn efficient and effective species identification
from training samples. However, limited infraspecific sampling remains a key
challenge also in machine learning. 1In this study, we assessed whether a
two-level data augmentation approach may help to overcome the problem of scarce
training data in automated visual species identification. The first level of
visual data augmentation applies classic approaches of data augmentation and
generation of faked images using a GAN approach. Descriptive feature vectors
are derived from bottleneck features of a VGG-16 convolutional neural network
(CNN) that are then stepwise reduced in dimensionality using Global Average
Pooling and PCA to prevent overfitting. The second level of data augmentation
employs synthetic additional sampling in feature space by an oversampling
algorithm in vector space (SMOTE). Applied on two challenging datasets of
scarab beetles (Coleoptera), our augmentation approach outperformed a
non-augmented deep learning baseline approach as well as a traditional 2D
morphometric approach (Procrustes analysis).
Related papers
- Open-Set Deepfake Detection: A Parameter-Efficient Adaptation Method with Forgery Style Mixture [58.60915132222421]
We introduce an approach that is both general and parameter-efficient for face forgery detection.
We design a forgery-style mixture formulation that augments the diversity of forgery source domains.
We show that the designed model achieves state-of-the-art generalizability with significantly reduced trainable parameters.
arXiv Detail & Related papers (2024-08-23T01:53:36Z) - Leveraging Latent Diffusion Models for Training-Free In-Distribution Data Augmentation for Surface Defect Detection [9.784793380119806]
We introduce DIAG, a training-free Diffusion-based In-distribution Anomaly Generation pipeline for data augmentation.
Unlike conventional image generation techniques, we implement a human-in-the-loop pipeline, where domain experts provide multimodal guidance to the model.
We demonstrate the efficacy and versatility of DIAG with respect to state-of-the-art data augmentation approaches on the challenging KSDD2 dataset.
arXiv Detail & Related papers (2024-07-04T14:28:52Z) - MAPL: Memory Augmentation and Pseudo-Labeling for Semi-Supervised Anomaly Detection [0.0]
A new meth-odology for detecting surface defects in in-dustrial settings is introduced, referred to as Memory Augmentation and Pseudo-Labeling(MAPL)
The methodology first in-troduces an anomaly simulation strategy, which significantly improves the model's ability to recognize rare or unknown anom-aly types.
An end-to-end learning framework is employed by MAPL to identify the abnormal regions directly from the input data.
arXiv Detail & Related papers (2024-05-10T02:26:35Z) - Self-supervised Feature Adaptation for 3D Industrial Anomaly Detection [59.41026558455904]
We focus on multi-modal anomaly detection. Specifically, we investigate early multi-modal approaches that attempted to utilize models pre-trained on large-scale visual datasets.
We propose a Local-to-global Self-supervised Feature Adaptation (LSFA) method to finetune the adaptors and learn task-oriented representation toward anomaly detection.
arXiv Detail & Related papers (2024-01-06T07:30:41Z) - Domain Adaptive Synapse Detection with Weak Point Annotations [63.97144211520869]
We present AdaSyn, a framework for domain adaptive synapse detection with weak point annotations.
In the WASPSYN challenge at I SBI 2023, our method ranks the 1st place.
arXiv Detail & Related papers (2023-08-31T05:05:53Z) - Cluster-level pseudo-labelling for source-free cross-domain facial
expression recognition [94.56304526014875]
We propose the first Source-Free Unsupervised Domain Adaptation (SFUDA) method for Facial Expression Recognition (FER)
Our method exploits self-supervised pretraining to learn good feature representations from the target data.
We validate the effectiveness of our method in four adaptation setups, proving that it consistently outperforms existing SFUDA methods when applied to FER.
arXiv Detail & Related papers (2022-10-11T08:24:50Z) - Decision Forest Based EMG Signal Classification with Low Volume Dataset
Augmented with Random Variance Gaussian Noise [51.76329821186873]
We produce a model that can classify six different hand gestures with a limited number of samples that generalizes well to a wider audience.
We appeal to a set of more elementary methods such as the use of random bounds on a signal, but desire to show the power these methods can carry in an online setting.
arXiv Detail & Related papers (2022-06-29T23:22:18Z) - Deep Feature Screening: Feature Selection for Ultra High-Dimensional
Data via Deep Neural Networks [4.212520096619388]
We propose a novel two-step nonparametric approach called Deep Feature Screening (DeepFS)
DeepFS can identify significant features with high precision for ultra high-dimensional, low-sample-size data.
The superiority of DeepFS is demonstrated via extensive simulation studies and real data analyses.
arXiv Detail & Related papers (2022-04-04T17:51:49Z) - Multiscale Convolutional Transformer with Center Mask Pretraining for
Hyperspectral Image Classificationtion [14.33259265286265]
We propose a noval multi-scale convolutional embedding module for hyperspectral images (HSI) to realize effective extraction of spatial-spectral information.
Similar to Mask autoencoder, but our pre-training method only masks the corresponding token of the central pixel in the encoder, and inputs the remaining token into the decoder to reconstruct the spectral information of the central pixel.
arXiv Detail & Related papers (2022-03-09T14:42:26Z) - Weakly Supervised Change Detection Using Guided Anisotropic Difusion [97.43170678509478]
We propose original ideas that help us to leverage such datasets in the context of change detection.
First, we propose the guided anisotropic diffusion (GAD) algorithm, which improves semantic segmentation results.
We then show its potential in two weakly-supervised learning strategies tailored for change detection.
arXiv Detail & Related papers (2021-12-31T10:03:47Z) - Exploiting Invariance in Training Deep Neural Networks [4.169130102668252]
Inspired by two basic mechanisms in animal visual systems, we introduce a feature transform technique that imposes invariance properties in the training of deep neural networks.
The resulting algorithm requires less parameter tuning, trains well with an initial learning rate 1.0, and easily generalizes to different tasks.
Tested on ImageNet, MS COCO, and Cityscapes datasets, our proposed technique requires fewer iterations to train, surpasses all baselines by a large margin, seamlessly works on both small and large batch size training, and applies to different computer vision tasks of image classification, object detection, and semantic segmentation.
arXiv Detail & Related papers (2021-03-30T19:18:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.