Counting Fish and Dolphins in Sonar Images Using Deep Learning
- URL: http://arxiv.org/abs/2007.12808v1
- Date: Fri, 24 Jul 2020 23:52:03 GMT
- Title: Counting Fish and Dolphins in Sonar Images Using Deep Learning
- Authors: Stefan Schneider and Alex Zhuang
- Abstract summary: Current methods of fish and dolphin abundance estimates are performed by on-site sampling using visual and capture/release strategies.
We propose a novel approach to calculating fish abundance using deep learning for fish and dolphin estimates from sonar images taken from the back of a trolling boat.
- Score: 0.40611352512781856
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Deep learning provides the opportunity to improve upon conflicting reports
considering the relationship between the Amazon river's fish and dolphin
abundance and reduced canopy cover as a result of deforestation. Current
methods of fish and dolphin abundance estimates are performed by on-site
sampling using visual and capture/release strategies. We propose a novel
approach to calculating fish abundance using deep learning for fish and dolphin
estimates from sonar images taken from the back of a trolling boat. We consider
a data set of 143 images ranging from 0-34 fish, and 0-3 dolphins provided by
the Fund Amazonia research group. To overcome the data limitation, we test the
capabilities of data augmentation on an unconventional 15/85 training/testing
split. Using 20 training images, we simulate a gradient of data up to 25,000
images using augmented backgrounds and randomly placed/rotation cropped fish
and dolphin taken from the training set. We then train four multitask network
architectures: DenseNet201, InceptionNetV2, Xception, and MobileNetV2 to
predict fish and dolphin numbers using two function approximation methods:
regression and classification. For regression, Densenet201 performed best for
fish and Xception best for dolphin with mean squared errors of 2.11 and 0.133
respectively. For classification, InceptionResNetV2 performed best for fish and
MobileNetV2 best for dolphins with a mean error of 2.07 and 0.245 respectively.
Considering the 123 testing images, our results show the success of data
simulation for limited sonar data sets. We find DenseNet201 is able to identify
dolphins after approximately 5000 training images, while fish required the full
25,000. Our method can be used to lower costs and expedite the data analysis of
fish and dolphin abundance to real-time along the Amazon river and river
systems worldwide.
Related papers
- FishNet: Deep Neural Networks for Low-Cost Fish Stock Estimation [0.0]
FishNet is an automated computer vision system for both taxonomic classification and fish size estimation.
We use a dataset of 300,000 hand-labeled images containing 1.2M fish of 163 different species.
FishNet achieves a 92% intersection over union on the fish segmentation task, a 89% top-1 classification accuracy on single fish species classification, and a 2.3cm mean absolute error on the fish length estimation task.
arXiv Detail & Related papers (2024-03-16T12:44:08Z) - Improving Underwater Visual Tracking With a Large Scale Dataset and
Image Enhancement [70.2429155741593]
This paper presents a new dataset and general tracker enhancement method for Underwater Visual Object Tracking (UVOT)
It poses distinct challenges; the underwater environment exhibits non-uniform lighting conditions, low visibility, lack of sharpness, low contrast, camouflage, and reflections from suspended particles.
We propose a novel underwater image enhancement algorithm designed specifically to boost tracking quality.
The method has resulted in a significant performance improvement, of up to 5.0% AUC, of state-of-the-art (SOTA) visual trackers.
arXiv Detail & Related papers (2023-08-30T07:41:26Z) - Whale Detection Enhancement through Synthetic Satellite Images [13.842008598751445]
We show that we can achieve a 15% performance boost on whale detection compared to using the real data alone for training.
We open source the code of the simulation platform SeaDroneSim2 and the dataset generated through it.
arXiv Detail & Related papers (2023-08-15T13:35:29Z) - Delving Deeper into Data Scaling in Masked Image Modeling [145.36501330782357]
We conduct an empirical study on the scaling capability of masked image modeling (MIM) methods for visual recognition.
Specifically, we utilize the web-collected Coyo-700M dataset.
Our goal is to investigate how the performance changes on downstream tasks when scaling with different sizes of data and models.
arXiv Detail & Related papers (2023-05-24T15:33:46Z) - Leveraging the Third Dimension in Contrastive Learning [88.17394309208925]
Self-Supervised Learning (SSL) methods operate on unlabeled data to learn robust representations useful for downstream tasks.
These augmentations ignore the fact that biological vision takes place in an immersive three-dimensional, temporally contiguous environment.
We explore two distinct approaches to incorporating depth signals into the SSL framework.
arXiv Detail & Related papers (2023-01-27T15:45:03Z) - TempNet: Temporal Attention Towards the Detection of Animal Behaviour in
Videos [63.85815474157357]
We propose an efficient computer vision- and deep learning-based method for the detection of biological behaviours in videos.
TempNet uses an encoder bridge and residual blocks to maintain model performance with a two-staged, spatial, then temporal, encoder.
We demonstrate its application to the detection of sablefish (Anoplopoma fimbria) startle events.
arXiv Detail & Related papers (2022-11-17T23:55:12Z) - Portuguese Man-of-War Image Classification with Convolutional Neural
Networks [58.720142291102135]
Portuguese man-of-war (PMW) is a gelatinous organism with long tentacles capable of causing severe burns.
This paper reports on the use of convolutional neural networks for recognizing PMW images from the Instagram social media.
arXiv Detail & Related papers (2022-07-04T03:06:45Z) - FishNet: A Unified Embedding for Salmon Recognition [0.37798600249187286]
We propose FishNet, based on a deep learning technique that has been successfully used for identifying humans.
Our experiments show that this architecture learns a useful representation based on images of salmon heads.
FishNet achieves a false positive rate of 1% and a true positive rate of 96%.
arXiv Detail & Related papers (2020-10-20T17:35:01Z) - A Realistic Fish-Habitat Dataset to Evaluate Algorithms for Underwater
Visual Analysis [2.6476746128312194]
We present DeepFish as a benchmark suite with a large-scale dataset to train and test methods for several computer vision tasks.
The dataset consists of approximately 40 thousand images collected underwater from 20 greenhabitats in the marine-environments of tropical Australia.
Our experiments provide an in-depth analysis of the dataset characteristics, and the performance evaluation of several state-of-the-art approaches.
arXiv Detail & Related papers (2020-08-28T12:20:59Z) - Temperate Fish Detection and Classification: a Deep Learning based
Approach [6.282069822653608]
We propose a two-step deep learning approach for the detection and classification of temperate fishes without pre-filtering.
The first step is to detect each single fish in an image, independent of species and sex.
In the second step, we adopt a Convolutional Neural Network (CNN) with the Squeeze-and-Excitation (SE) architecture for classifying each fish in the image without pre-filtering.
arXiv Detail & Related papers (2020-05-14T12:40:57Z) - Improved Residual Networks for Image and Video Recognition [98.10703825716142]
Residual networks (ResNets) represent a powerful type of convolutional neural network (CNN) architecture.
We show consistent improvements in accuracy and learning convergence over the baseline.
Our proposed approach allows us to train extremely deep networks, while the baseline shows severe optimization issues.
arXiv Detail & Related papers (2020-04-10T11:09:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.