A deep neural network for multi-species fish detection using multiple
acoustic cameras
- URL: http://arxiv.org/abs/2109.10664v1
- Date: Wed, 22 Sep 2021 11:47:24 GMT
- Title: A deep neural network for multi-species fish detection using multiple
acoustic cameras
- Authors: Garcia Fernandez, Guglielmo Fernandez, Fran\c{c}ois Martignac, Marie
Nevoux, Laurent Beaulaton (OFB), Thomas Corpetti (LETG - Rennes)
- Abstract summary: We present a novel approach that takes advantage of both CNN (Convolutional Neural Network) and classical CV (Computer Vision) techniques.
The pipeline pre-treats the acoustic images to extract 2 features, in order to localise the signals and improve the detection performances.
The YOLOv3-based model was trained with data of fish from multiple species recorded by the two common acoustic cameras.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Underwater acoustic cameras are high potential devices for many applications
in ecology, notably for fisheries management and monitoring. However how to
extract such data into high value information without a time-consuming entire
dataset reading by an operator is still a challenge. Moreover the analysis of
acoustic imaging, due to its low signal-to-noise ratio, is a perfect training
ground for experimenting with new approaches, especially concerning Deep
Learning techniques. We present hereby a novel approach that takes advantage of
both CNN (Convolutional Neural Network) and classical CV (Computer Vision)
techniques, able to detect a generic class ''fish'' in acoustic video streams.
The pipeline pre-treats the acoustic images to extract 2 features, in order to
localise the signals and improve the detection performances. To ensure the
performances from an ecological point of view, we propose also a two-step
validation, one to validate the results of the trainings and one to test the
method on a real-world scenario. The YOLOv3-based model was trained with data
of fish from multiple species recorded by the two common acoustic cameras,
DIDSON and ARIS, including species of high ecological interest, as Atlantic
salmon or European eels. The model we developed provides satisfying results
detecting almost 80% of fish and minimizing the false positive rate, however
the model is much less efficient for eel detections on ARIS videos. The first
CNN pipeline for fish monitoring exploiting video data from two models of
acoustic cameras satisfies most of the required features. Many challenges are
still present, such as the automation of fish species identification through a
multiclass model. 1 However the results point a new solution for dealing with
complex data, such as sonar data, which can also be reapplied in other cases
where the signal-to-noise ratio is a challenge.
Related papers
- TinyChirp: Bird Song Recognition Using TinyML Models on Low-power Wireless Acoustic Sensors [1.0790796076947324]
Monitoring biodiversity at scale is challenging.
detecting and identifying species in fine grained requires highly accurate machine learning (ML) methods.
deploying these models to low power devices requires novel compression techniques and model architectures.
arXiv Detail & Related papers (2024-07-31T08:57:42Z) - Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and Benchmark [65.79402756995084]
Real Acoustic Fields (RAF) is a new dataset that captures real acoustic room data from multiple modalities.
RAF is the first dataset to provide densely captured room acoustic data.
arXiv Detail & Related papers (2024-03-27T17:59:56Z) - Histogram Layer Time Delay Neural Networks for Passive Sonar
Classification [58.720142291102135]
A novel method combines a time delay neural network and histogram layer to incorporate statistical contexts for improved feature learning and underwater acoustic target classification.
The proposed method outperforms the baseline model, demonstrating the utility in incorporating statistical contexts for passive sonar target recognition.
arXiv Detail & Related papers (2023-07-25T19:47:26Z) - TempNet: Temporal Attention Towards the Detection of Animal Behaviour in
Videos [63.85815474157357]
We propose an efficient computer vision- and deep learning-based method for the detection of biological behaviours in videos.
TempNet uses an encoder bridge and residual blocks to maintain model performance with a two-staged, spatial, then temporal, encoder.
We demonstrate its application to the detection of sablefish (Anoplopoma fimbria) startle events.
arXiv Detail & Related papers (2022-11-17T23:55:12Z) - Decision Forest Based EMG Signal Classification with Low Volume Dataset
Augmented with Random Variance Gaussian Noise [51.76329821186873]
We produce a model that can classify six different hand gestures with a limited number of samples that generalizes well to a wider audience.
We appeal to a set of more elementary methods such as the use of random bounds on a signal, but desire to show the power these methods can carry in an online setting.
arXiv Detail & Related papers (2022-06-29T23:22:18Z) - Neurosymbolic hybrid approach to driver collision warning [64.02492460600905]
There are two main algorithmic approaches to autonomous driving systems.
Deep learning alone has achieved state-of-the-art results in many areas.
But sometimes it can be very difficult to debug if the deep learning model doesn't work.
arXiv Detail & Related papers (2022-03-28T20:29:50Z) - Depth Guided Adaptive Meta-Fusion Network for Few-shot Video Recognition [86.31412529187243]
Few-shot video recognition aims at learning new actions with only very few labeled samples.
We propose a depth guided Adaptive Meta-Fusion Network for few-shot video recognition which is termed as AMeFu-Net.
arXiv Detail & Related papers (2020-10-20T03:06:20Z) - SWIPENET: Object detection in noisy underwater images [41.35601054297707]
We propose a novel Sample-WeIghted hyPEr Network (SWIPENET), and a robust training paradigm named Curriculum Multi-Class Adaboost (CMA) to address these two problems.
The backbone of SWIPENET produces multiple high resolution and semantic-rich Hyper Feature Maps, which significantly improve small object detection.
Inspired by the human education process that drives the learning from easy to hard concepts, we here propose the CMA training paradigm that first trains a clean detector which is free from the influence of noisy data.
arXiv Detail & Related papers (2020-10-19T16:41:20Z) - Image-based Automated Species Identification: Can Virtual Data
Augmentation Overcome Problems of Insufficient Sampling? [0.0]
We present a two-level data augmentation approach to automated visual species identification.
The first level of data augmentation applies classic approaches of data augmentation and generation of faked images.
The second level of data augmentation employs synthetic additional sampling in feature space by an oversampling algorithm in vector space.
arXiv Detail & Related papers (2020-10-18T15:44:45Z) - Temperate Fish Detection and Classification: a Deep Learning based
Approach [6.282069822653608]
We propose a two-step deep learning approach for the detection and classification of temperate fishes without pre-filtering.
The first step is to detect each single fish in an image, independent of species and sex.
In the second step, we adopt a Convolutional Neural Network (CNN) with the Squeeze-and-Excitation (SE) architecture for classifying each fish in the image without pre-filtering.
arXiv Detail & Related papers (2020-05-14T12:40:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.