Western Mediterranean wetlands bird species classification: evaluating
small-footprint deep learning approaches on a new annotated dataset
- URL: http://arxiv.org/abs/2207.05393v1
- Date: Tue, 12 Jul 2022 08:48:12 GMT
- Title: Western Mediterranean wetlands bird species classification: evaluating
small-footprint deep learning approaches on a new annotated dataset
- Authors: Juan G\'omez-G\'omez, Ester Vida\~na-Vila, Xavier Sevillano
- Abstract summary: This work presents a critical comparative analysis between two heavy and large footprint deep neural networks (VGG16 and ResNet50) and a lightweight alternative, MobileNetV2.
Our experimental results reveal that MobileNetV2 achieves an average F1-score less than a 5% lower than ResNet50.
We have created and made public the Western Mediterranean Wetland Birds dataset, consisting of 201.6 minutes and 5,795 audio excerpts of 20 endemic bird species of the Aiguamolls de l'Emporda Natural Park.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: The deployment of an expert system running over a wireless acoustic sensors
network made up of bioacoustic monitoring devices that recognise bird species
from their sounds would enable the automation of many tasks of ecological
value, including the analysis of bird population composition or the detection
of endangered species in areas of environmental interest. Endowing these
devices with accurate audio classification capabilities is possible thanks to
the latest advances in artificial intelligence, among which deep learning
techniques excel. However, a key issue to make bioacoustic devices affordable
is the use of small footprint deep neural networks that can be embedded in
resource and battery constrained hardware platforms. For this reason, this work
presents a critical comparative analysis between two heavy and large footprint
deep neural networks (VGG16 and ResNet50) and a lightweight alternative,
MobileNetV2. Our experimental results reveal that MobileNetV2 achieves an
average F1-score less than a 5\% lower than ResNet50 (0.789 vs. 0.834),
performing better than VGG16 with a footprint size nearly 40 times smaller.
Moreover, to compare the models, we have created and made public the Western
Mediterranean Wetland Birds dataset, consisting of 201.6 minutes and 5,795
audio excerpts of 20 endemic bird species of the Aiguamolls de l'Empord\`a
Natural Park.
Related papers
- Investigating Target Class Influence on Neural Network Compressibility for Energy-Autonomous Avian Monitoring [36.85680419418593]
We propose running machine learning models on inexpensive microcontroller units (MCUs) directly in the field.<n>We trained and compressed models for various numbers of target classes to assess the detection of multiple bird species.<n>We also provide benchmarking results for different hardware platforms and evaluate the feasibility of deploying energy-autonomous devices.
arXiv Detail & Related papers (2026-02-19T16:24:33Z) - Autonomous AI Bird Feeder for Backyard Biodiversity Monitoring [0.0]
This paper presents a low cost, on premise system for autonomous backyard bird monitoring in Belgian urban gardens.<n>A motion triggered IP camera uploads short clips via FTP to a local server, where frames are sampled and birds are localized with Detectron2.<n> cropped regions are then classified by an EfficientNet-B3 model fine tuned on a 40-species Belgian subset derived from a larger Kaggle corpus.
arXiv Detail & Related papers (2025-08-13T00:27:22Z) - Evaluating Deep Learning Models for African Wildlife Image Classification: From DenseNet to Vision Transformers [3.4801331938495705]
Wildlife populations in Africa face severe threats, with vertebrate numbers declining by over 65% in the past five decades.<n>In response, image classification using deep learning has emerged as a promising tool for biodiversity monitoring and conservation.<n>This paper presents a comparative study of deep learning models for automatically classifying African wildlife images.
arXiv Detail & Related papers (2025-07-28T22:18:13Z) - The iNaturalist Sounds Dataset [60.157076990024606]
iNatSounds is a collection of 230,000 audio files capturing sounds from over 5,500 species, contributed by more than 27,000 recordists worldwide.<n>The dataset encompasses sounds from birds, mammals, insects, reptiles, and amphibians, with audio and species labels derived from observations submitted to iNaturalist.<n>We envision models trained on this data powering next-generation public engagement applications, and assisting biologists, ecologists, and land use managers in processing large audio collections.
arXiv Detail & Related papers (2025-05-31T02:07:37Z) - ECOSoundSet: a finely annotated dataset for the automated acoustic identification of Orthoptera and Cicadidae in North, Central and temperate Western Europe [51.82780272068934]
We present ECOSoundSet (European Cicadidae and Orthoptera Sound dataSet), a dataset containing 10,653 recordings of 200 orthopteran and 24 cicada species (217 and 26 respective taxa when including subspecies) present in North, Central, and temperate Western Europe.
This dataset could serve as a meaningful complement to recordings already available online for the training of deep learning algorithms for the acoustic classification of orthopterans and cicadas in North, Central, and temperate Western Europe.
arXiv Detail & Related papers (2025-04-29T13:53:33Z) - A Bird Song Detector for improving bird identification through Deep Learning: a case study from DoƱana [2.7924253850013416]
A key challenge in bird species identification is that many recordings lack target species or contain overlapping vocalizations.<n>We developed a multi-stage pipeline for automatic bird vocalization identification in Donana National Park (SW Spain)<n>We first applied a Bird Song Detector to isolate bird vocalizations using spectrogram-based image processing. Then, species were classified using custom models trained at the local scale.
arXiv Detail & Related papers (2025-03-19T13:19:06Z) - TinyChirp: Bird Song Recognition Using TinyML Models on Low-power Wireless Acoustic Sensors [1.0790796076947324]
Monitoring biodiversity at scale is challenging.
detecting and identifying species in fine grained requires highly accurate machine learning (ML) methods.
deploying these models to low power devices requires novel compression techniques and model architectures.
arXiv Detail & Related papers (2024-07-31T08:57:42Z) - AudioProtoPNet: An interpretable deep learning model for bird sound classification [1.49199020343864]
This study introduces AudioProtoPNet, an adaptation of the Prototypical Part Network (ProtoPNet) for multi-label bird sound classification.
It is an inherently interpretable model that uses a ConvNeXt backbone to extract embeddings.
The model was trained on the BirdSet training dataset, which consists of 9,734 bird species and over 6,800 hours of recordings.
arXiv Detail & Related papers (2024-04-16T09:37:41Z) - Exploring Meta Information for Audio-based Zero-shot Bird Classification [113.17261694996051]
This study investigates how meta-information can improve zero-shot audio classification.
We use bird species as an example case study due to the availability of rich and diverse meta-data.
arXiv Detail & Related papers (2023-09-15T13:50:16Z) - Ultra-low Power Deep Learning-based Monocular Relative Localization
Onboard Nano-quadrotors [64.68349896377629]
This work presents a novel autonomous end-to-end system that addresses the monocular relative localization, through deep neural networks (DNNs), of two peer nano-drones.
To cope with the ultra-constrained nano-drone platform, we propose a vertically-integrated framework, including dataset augmentation, quantization, and system optimizations.
Experimental results show that our DNN can precisely localize a 10cm-size target nano-drone by employing only low-resolution monochrome images, up to 2m distance.
arXiv Detail & Related papers (2023-03-03T14:14:08Z) - Deep object detection for waterbird monitoring using aerial imagery [56.1262568293658]
In this work, we present a deep learning pipeline that can be used to precisely detect, count, and monitor waterbirds using aerial imagery collected by a commercial drone.
By utilizing convolutional neural network-based object detectors, we show that we can detect 16 classes of waterbird species that are commonly found in colonial nesting islands along the Texas coast.
arXiv Detail & Related papers (2022-10-10T17:37:56Z) - Classification of animal sounds in a hyperdiverse rainforest using
Convolutional Neural Networks [0.0]
Automated species detection from passively recorded soundscapes via machine-learning approaches is a promising technique.
We use soundscapes from a tropical forest in Borneo and a Convolutional Neural Network model (CNN) created with transfer learning.
Our results suggest that transfer learning and data augmentation can make the use of CNNs to classify species' vocalizations feasible even for small soundscape-based projects with many rare species.
arXiv Detail & Related papers (2021-11-29T21:34:57Z) - A deep neural network for multi-species fish detection using multiple
acoustic cameras [0.0]
We present a novel approach that takes advantage of both CNN (Convolutional Neural Network) and classical CV (Computer Vision) techniques.
The pipeline pre-treats the acoustic images to extract 2 features, in order to localise the signals and improve the detection performances.
The YOLOv3-based model was trained with data of fish from multiple species recorded by the two common acoustic cameras.
arXiv Detail & Related papers (2021-09-22T11:47:24Z) - A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust
Neural Acoustic Scene Classification [78.04177357888284]
We propose a novel neural model compression strategy combining data augmentation, knowledge transfer, pruning, and quantization for device-robust acoustic scene classification (ASC)
We report an efficient joint framework for low-complexity multi-device ASC, called Acoustic Lottery.
arXiv Detail & Related papers (2021-07-03T16:25:24Z) - Cetacean Translation Initiative: a roadmap to deciphering the
communication of sperm whales [97.41394631426678]
Recent research showed the promise of machine learning tools for analyzing acoustic communication in nonhuman species.
We outline the key elements required for the collection and processing of massive bioacoustic data of sperm whales.
The technological capabilities developed are likely to yield cross-applications and advancements in broader communities investigating non-human communication and animal behavioral research.
arXiv Detail & Related papers (2021-04-17T18:39:22Z) - OutlierNets: Highly Compact Deep Autoencoder Network Architectures for
On-Device Acoustic Anomaly Detection [77.23388080452987]
Human operators often diagnose industrial machinery via anomalous sounds.
Deep learning-driven anomaly detection methods often require an extensive amount of computational resources which prohibits their deployment in factories.
Here we explore a machine-driven design exploration strategy to create OutlierNets, a family of highly compact deep convolutional autoencoder network architectures.
arXiv Detail & Related papers (2021-03-31T04:09:30Z) - Modelling Animal Biodiversity Using Acoustic Monitoring and Deep
Learning [0.0]
This paper outlines an approach for achieving this using state of the art in machine learning to automatically extract features from time-series audio signals.
The acquired bird songs are processed using mel-frequency cepstrum (MFC) to extract features which are later classified using a multilayer perceptron (MLP)
Our proposed method achieved promising results with 0.74 sensitivity, 0.92 specificity and an accuracy of 0.74.
arXiv Detail & Related papers (2021-03-12T13:50:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.