A Multi-view CNN-based Acoustic Classification System for Automatic
Animal Species Identification
- URL: http://arxiv.org/abs/2002.09821v1
- Date: Sun, 23 Feb 2020 03:51:08 GMT
- Title: A Multi-view CNN-based Acoustic Classification System for Automatic
Animal Species Identification
- Authors: Weitao Xu, Xiang Zhang, Lina Yao, Wanli Xue, Bo Wei
- Abstract summary: We propose a deep learning based acoustic classification framework for Wireless Acoustic Sensor Network (WASN)
The proposed framework is based on cloud architecture which relaxes the computational burden on the wireless sensor node.
To improve the recognition accuracy, we design a multi-view Convolution Neural Network (CNN) to extract the short-, middle-, and long-term dependencies in parallel.
- Score: 42.119250432849505
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Automatic identification of animal species by their vocalization is an
important and challenging task. Although many kinds of audio monitoring system
have been proposed in the literature, they suffer from several disadvantages
such as non-trivial feature selection, accuracy degradation because of
environmental noise or intensive local computation. In this paper, we propose a
deep learning based acoustic classification framework for Wireless Acoustic
Sensor Network (WASN). The proposed framework is based on cloud architecture
which relaxes the computational burden on the wireless sensor node. To improve
the recognition accuracy, we design a multi-view Convolution Neural Network
(CNN) to extract the short-, middle-, and long-term dependencies in parallel.
The evaluation on two real datasets shows that the proposed architecture can
achieve high accuracy and outperforms traditional classification systems
significantly when the environmental noise dominate the audio signal (low SNR).
Moreover, we implement and deploy the proposed system on a testbed and analyse
the system performance in real-world environments. Both simulation and
real-world evaluation demonstrate the accuracy and robustness of the proposed
acoustic classification system in distinguishing species of animals.
Related papers
- Advanced Framework for Animal Sound Classification With Features Optimization [35.2832738406242]
We propose an automated classification framework applicable to general animal sound classification.
Our approach consistently outperforms baseline methods by over 25% in precision, recall, and accuracy.
arXiv Detail & Related papers (2024-07-03T18:33:47Z) - Probing the Information Encoded in Neural-based Acoustic Models of
Automatic Speech Recognition Systems [7.207019635697126]
This article aims to determine which and where information is located in an automatic speech recognition acoustic model (AM)
Experiments are performed on speaker verification, acoustic environment classification, gender classification, tempo-distortion detection systems and speech sentiment/emotion identification.
Analysis showed that neural-based AMs hold heterogeneous information that seems surprisingly uncorrelated with phoneme recognition.
arXiv Detail & Related papers (2024-02-29T18:43:53Z) - Wider or Deeper Neural Network Architecture for Acoustic Scene
Classification with Mismatched Recording Devices [59.86658316440461]
We present a robust and low complexity system for Acoustic Scene Classification (ASC)
We first construct an ASC baseline system in which a novel inception-residual-based network architecture is proposed to deal with the mismatched recording device issue.
To further improve the performance but still satisfy the low complexity model, we apply two techniques: ensemble of multiple spectrograms and channel reduction.
arXiv Detail & Related papers (2022-03-23T10:27:41Z) - Animal Behavior Classification via Deep Learning on Embedded Systems [10.160218445628836]
We develop an end-to-end deep-neural-network-based algorithm for classifying animal behavior using accelerometry data.
We implement the proposed algorithm on the embedded system of the collar tag's AIoT device to perform in-situ classification of animal behavior.
arXiv Detail & Related papers (2021-11-24T06:26:15Z) - PILOT: Introducing Transformers for Probabilistic Sound Event
Localization [107.78964411642401]
This paper introduces a novel transformer-based sound event localization framework, where temporal dependencies in the received multi-channel audio signals are captured via self-attention mechanisms.
The framework is evaluated on three publicly available multi-source sound event localization datasets and compared against state-of-the-art methods in terms of localization error and event detection accuracy.
arXiv Detail & Related papers (2021-06-07T18:29:19Z) - Discriminative Singular Spectrum Classifier with Applications on
Bioacoustic Signal Recognition [67.4171845020675]
We present a bioacoustic signal classifier equipped with a discriminative mechanism to extract useful features for analysis and classification efficiently.
Unlike current bioacoustic recognition methods, which are task-oriented, the proposed model relies on transforming the input signals into vector subspaces.
The validity of the proposed method is verified using three challenging bioacoustic datasets containing anuran, bee, and mosquito species.
arXiv Detail & Related papers (2021-03-18T11:01:21Z) - Capturing scattered discriminative information using a deep architecture
in acoustic scene classification [49.86640645460706]
In this study, we investigate various methods to capture discriminative information and simultaneously mitigate the overfitting problem.
We adopt a max feature map method to replace conventional non-linear activations in a deep neural network.
Two data augment methods and two deep architecture modules are further explored to reduce overfitting and sustain the system's discriminative power.
arXiv Detail & Related papers (2020-07-09T08:32:06Z) - Deep Speaker Embeddings for Far-Field Speaker Recognition on Short
Utterances [53.063441357826484]
Speaker recognition systems based on deep speaker embeddings have achieved significant performance in controlled conditions.
Speaker verification on short utterances in uncontrolled noisy environment conditions is one of the most challenging and highly demanded tasks.
This paper presents approaches aimed to achieve two goals: a) improve the quality of far-field speaker verification systems in the presence of environmental noise, reverberation and b) reduce the system qualitydegradation for short utterances.
arXiv Detail & Related papers (2020-02-14T13:34:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.