Towards small and accurate convolutional neural networks for acoustic
biodiversity monitoring
- URL: http://arxiv.org/abs/2312.03666v1
- Date: Wed, 6 Dec 2023 18:34:01 GMT
- Title: Towards small and accurate convolutional neural networks for acoustic
biodiversity monitoring
- Authors: Serge Zaugg, Mike van der Schaar, Florence Erbs, Antonio Sanchez, Joan
V. Castell, Emiliano Ramallo, Michel Andr\'e
- Abstract summary: CNNs are fast at inference time and achieve good classification performance.
Recordings from a rainforest ecosystem were used.
RF duration was a major driver of classification performance.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Automated classification of animal sounds is a prerequisite for large-scale
monitoring of biodiversity. Convolutional Neural Networks (CNNs) are among the
most promising algorithms but they are slow, often achieve poor classification
in the field and typically require large training data sets. Our objective was
to design CNNs that are fast at inference time and achieve good classification
performance while learning from moderate-sized data. Recordings from a
rainforest ecosystem were used. Start and end-point of sounds from 20 bird
species were manually annotated. Spectrograms from 10 second segments were used
as CNN input. We designed simple CNNs with a frequency unwrapping layer
(SIMP-FU models) such that any output unit was connected to all spectrogram
frequencies but only to a sub-region of time, the Receptive Field (RF). Our
models allowed experimentation with different RF durations. Models either used
the time-indexed labels that encode start and end-point of sounds or simpler
segment-level labels. Models learning from time-indexed labels performed
considerably better than their segment-level counterparts. Best classification
performances was achieved for models with intermediate RF duration of 1.5
seconds. The best SIMP-FU models achieved AUCs over 0.95 in 18 of 20 classes on
the test set. On compact low-cost hardware the best SIMP-FU models evaluated up
to seven times faster than real-time data acquisition. RF duration was a major
driver of classification performance. The optimum of 1.5 s was in the same
range as the duration of the sounds. Our models achieved good classification
performance while learning from moderate-sized training data. This is explained
by the usage of time-indexed labels during training and adequately sized RF.
Results confirm the feasibility of deploying small CNNs with good
classification performance on compact low-cost devices.
Related papers
- Convolutional Fourier Analysis Network (CFAN): A Unified Time-Frequency Approach for ECG Classification [0.0]
Machine learning has transformed the classification of biomedical signals such as electrocardiograms (ECGs)
We evaluated three ECG classification tasks: (1) arrhythmia classification, (2) identity recognition, and (3) apnea detection.
We developed the Convolutional Fourier Analysis Network (CFAN), which integrates FAN with CNN.
arXiv Detail & Related papers (2025-02-01T17:30:26Z) - How neural networks learn to classify chaotic time series [77.34726150561087]
We study the inner workings of neural networks trained to classify regular-versus-chaotic time series.
We find that the relation between input periodicity and activation periodicity is key for the performance of LKCNN models.
arXiv Detail & Related papers (2023-06-04T08:53:27Z) - Convolutional Neural Networks for the classification of glitches in
gravitational-wave data streams [52.77024349608834]
We classify transient noise signals (i.e.glitches) and gravitational waves in data from the Advanced LIGO detectors.
We use models with a supervised learning approach, both trained from scratch using the Gravity Spy dataset.
We also explore a self-supervised approach, pre-training models with automatically generated pseudo-labels.
arXiv Detail & Related papers (2023-03-24T11:12:37Z) - Do We Really Need a Learnable Classifier at the End of Deep Neural
Network? [118.18554882199676]
We study the potential of learning a neural network for classification with the classifier randomly as an ETF and fixed during training.
Our experimental results show that our method is able to achieve similar performances on image classification for balanced datasets.
arXiv Detail & Related papers (2022-03-17T04:34:28Z) - Animal Behavior Classification via Accelerometry Data and Recurrent
Neural Networks [11.099308746733028]
We study the classification of animal behavior using accelerometry data through various recurrent neural network (RNN) models.
We evaluate the classification performance and complexity of the considered models.
We also include two state-of-the-art convolutional neural network (CNN)-based time-series classification models in the evaluations.
arXiv Detail & Related papers (2021-11-24T23:28:25Z) - A robust approach for deep neural networks in presence of label noise:
relabelling and filtering instances during training [14.244244290954084]
We propose a robust training strategy against label noise, called RAFNI, that can be used with any CNN.
RAFNI consists of three mechanisms: two mechanisms that filter instances and one mechanism that relabels instances.
We evaluated our algorithm using different data sets of several sizes and characteristics.
arXiv Detail & Related papers (2021-09-08T16:11:31Z) - Time-Frequency Localization Using Deep Convolutional Maxout Neural
Network in Persian Speech Recognition [0.0]
Time-frequency flexibility in some mammals' auditory neurons system improves recognition performance.
This paper proposes a CNN-based structure for time-frequency localization of audio signal information in the ASR acoustic model.
The average recognition score of TFCMNN models is about 1.6% higher than the average of conventional models.
arXiv Detail & Related papers (2021-08-09T05:46:58Z) - ANNETTE: Accurate Neural Network Execution Time Estimation with Stacked
Models [56.21470608621633]
We propose a time estimation framework to decouple the architectural search from the target hardware.
The proposed methodology extracts a set of models from micro- kernel and multi-layer benchmarks and generates a stacked model for mapping and network execution time estimation.
We compare estimation accuracy and fidelity of the generated mixed models, statistical models with the roofline model, and a refined roofline model for evaluation.
arXiv Detail & Related papers (2021-05-07T11:39:05Z) - A Two-Stage Approach to Device-Robust Acoustic Scene Classification [63.98724740606457]
Two-stage system based on fully convolutional neural networks (CNNs) is proposed to improve device robustness.
Our results show that the proposed ASC system attains a state-of-the-art accuracy on the development set.
Neural saliency analysis with class activation mapping gives new insights on the patterns learnt by our models.
arXiv Detail & Related papers (2020-11-03T03:27:18Z) - On the Performance of Convolutional Neural Networks under High and Low
Frequency Information [13.778851745408133]
We study the performance of CNN models over the high and low frequency information of the images.
We propose the filtering based data augmentation during training.
A satisfactory performance improvement has been observed in terms of robustness and low frequency generalization.
arXiv Detail & Related papers (2020-10-30T17:54:45Z) - Deep learning for gravitational-wave data analysis: A resampling
white-box approach [62.997667081978825]
We apply Convolutional Neural Networks (CNNs) to detect gravitational wave (GW) signals of compact binary coalescences, using single-interferometer data from LIGO detectors.
CNNs were quite precise to detect noise but not sensitive enough to recall GW signals, meaning that CNNs are better for noise reduction than generation of GW triggers.
arXiv Detail & Related papers (2020-09-09T03:28:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.