Classification of animal sounds in a hyperdiverse rainforest using
Convolutional Neural Networks
- URL: http://arxiv.org/abs/2111.14971v1
- Date: Mon, 29 Nov 2021 21:34:57 GMT
- Title: Classification of animal sounds in a hyperdiverse rainforest using
Convolutional Neural Networks
- Authors: Yuren Sun, Tatiana Midori Maeda, Claudia Solis-Lemus, Daniel
Pimentel-Alarcon, Zuzana Burivalova
- Abstract summary: Automated species detection from passively recorded soundscapes via machine-learning approaches is a promising technique.
We use soundscapes from a tropical forest in Borneo and a Convolutional Neural Network model (CNN) created with transfer learning.
Our results suggest that transfer learning and data augmentation can make the use of CNNs to classify species' vocalizations feasible even for small soundscape-based projects with many rare species.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: To protect tropical forest biodiversity, we need to be able to detect it
reliably, cheaply, and at scale. Automated species detection from passively
recorded soundscapes via machine-learning approaches is a promising technique
towards this goal, but it is constrained by the necessity of large training
data sets. Using soundscapes from a tropical forest in Borneo and a
Convolutional Neural Network model (CNN) created with transfer learning, we
investigate i) the minimum viable training data set size for accurate
prediction of call types ('sonotypes'), and ii) the extent to which data
augmentation can overcome the issue of small training data sets. We found that
even relatively high sample sizes (> 80 per call type) lead to mediocre
accuracy, which however improves significantly with data augmentation,
including at extremely small sample sizes, regardless of taxonomic group or
call characteristics. Our results suggest that transfer learning and data
augmentation can make the use of CNNs to classify species' vocalizations
feasible even for small soundscape-based projects with many rare species. Our
open-source method has the potential to enable conservation initiatives become
more evidence-based by using soundscape data in the adaptive management of
biodiversity.
Related papers
- Explainable few-shot learning workflow for detecting invasive and exotic tree species [42.30413964219434]
This research presents an explainable few-shot learning workflow for detecting invasive and exotic tree species in the Atlantic Forest of Brazil.
By integrating a Siamese network with explainable AI (XAI), the workflow enables the classification of tree species with minimal labeled data.
Results demonstrate the effectiveness of the proposed workflow in identifying new tree species, even in data-scarce conditions.
arXiv Detail & Related papers (2024-11-01T15:45:19Z) - Deep learning-based ecological analysis of camera trap images is impacted by training data quality and size [11.153016596465593]
We analyse camera trap data from an African savannah and an Asian sub-tropical dry forest.
We compare key ecological metrics derived from expert-generated species identifications with those generated from deep neural networks.
Our results show that while model architecture has minimal impact, large amounts of noise and reduced dataset size significantly affect these metrics.
arXiv Detail & Related papers (2024-08-26T15:26:27Z) - TinyChirp: Bird Song Recognition Using TinyML Models on Low-power Wireless Acoustic Sensors [1.0790796076947324]
Monitoring biodiversity at scale is challenging.
detecting and identifying species in fine grained requires highly accurate machine learning (ML) methods.
deploying these models to low power devices requires novel compression techniques and model architectures.
arXiv Detail & Related papers (2024-07-31T08:57:42Z) - Just How Flexible are Neural Networks in Practice? [89.80474583606242]
It is widely believed that a neural network can fit a training set containing at least as many samples as it has parameters.
In practice, however, we only find solutions via our training procedure, including the gradient and regularizers, limiting flexibility.
arXiv Detail & Related papers (2024-06-17T12:24:45Z) - Transferable Models for Bioacoustics with Human Language Supervision [0.0]
BioLingual is a new model for bioacoustics based on contrastive language-audio pretraining.
It can identify over a thousand species' calls across taxa, complete bioacoustic tasks zero-shot, and retrieve animal vocalization recordings from natural text queries.
arXiv Detail & Related papers (2023-08-09T14:22:18Z) - Decision Forest Based EMG Signal Classification with Low Volume Dataset
Augmented with Random Variance Gaussian Noise [51.76329821186873]
We produce a model that can classify six different hand gestures with a limited number of samples that generalizes well to a wider audience.
We appeal to a set of more elementary methods such as the use of random bounds on a signal, but desire to show the power these methods can carry in an online setting.
arXiv Detail & Related papers (2022-06-29T23:22:18Z) - Ensembles of Vision Transformers as a New Paradigm for Automated
Classification in Ecology [0.0]
We show that ensembles of Data-efficient image Transformers (DeiTs) significantly outperform the previous state of the art (SOTA)
On all the data sets we test, we achieve a new SOTA, with a reduction of the error with respect to the previous SOTA ranging from 18.48% to 87.50%.
arXiv Detail & Related papers (2022-03-03T14:16:22Z) - Zoo-Tuning: Adaptive Transfer from a Zoo of Models [82.9120546160422]
Zoo-Tuning learns to adaptively transfer the parameters of pretrained models to the target task.
We evaluate our approach on a variety of tasks, including reinforcement learning, image classification, and facial landmark detection.
arXiv Detail & Related papers (2021-06-29T14:09:45Z) - Towards an Automatic Analysis of CHO-K1 Suspension Growth in
Microfluidic Single-cell Cultivation [63.94623495501023]
We propose a novel Machine Learning architecture, which allows us to infuse a neural deep network with human-powered abstraction on the level of data.
Specifically, we train a generative model simultaneously on natural and synthetic data, so that it learns a shared representation, from which a target variable, such as the cell count, can be reliably estimated.
arXiv Detail & Related papers (2020-10-20T08:36:51Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Diversity inducing Information Bottleneck in Model Ensembles [73.80615604822435]
In this paper, we target the problem of generating effective ensembles of neural networks by encouraging diversity in prediction.
We explicitly optimize a diversity inducing adversarial loss for learning latent variables and thereby obtain diversity in the output predictions necessary for modeling multi-modal data.
Compared to the most competitive baselines, we show significant improvements in classification accuracy, under a shift in the data distribution.
arXiv Detail & Related papers (2020-03-10T03:10:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.