Related papers: Counting Fish and Dolphins in Sonar Images Using Deep Learning

Counting Fish and Dolphins in Sonar Images Using Deep Learning

URL: http://arxiv.org/abs/2007.12808v1
Date: Fri, 24 Jul 2020 23:52:03 GMT
Title: Counting Fish and Dolphins in Sonar Images Using Deep Learning
Authors: Stefan Schneider and Alex Zhuang
Abstract summary: Current methods of fish and dolphin abundance estimates are performed by on-site sampling using visual and capture/release strategies. We propose a novel approach to calculating fish abundance using deep learning for fish and dolphin estimates from sonar images taken from the back of a trolling boat.
Score: 0.40611352512781856
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Deep learning provides the opportunity to improve upon conflicting reports considering the relationship between the Amazon river's fish and dolphin abundance and reduced canopy cover as a result of deforestation. Current methods of fish and dolphin abundance estimates are performed by on-site sampling using visual and capture/release strategies. We propose a novel approach to calculating fish abundance using deep learning for fish and dolphin estimates from sonar images taken from the back of a trolling boat. We consider a data set of 143 images ranging from 0-34 fish, and 0-3 dolphins provided by the Fund Amazonia research group. To overcome the data limitation, we test the capabilities of data augmentation on an unconventional 15/85 training/testing split. Using 20 training images, we simulate a gradient of data up to 25,000 images using augmented backgrounds and randomly placed/rotation cropped fish and dolphin taken from the training set. We then train four multitask network architectures: DenseNet201, InceptionNetV2, Xception, and MobileNetV2 to predict fish and dolphin numbers using two function approximation methods: regression and classification. For regression, Densenet201 performed best for fish and Xception best for dolphin with mean squared errors of 2.11 and 0.133 respectively. For classification, InceptionResNetV2 performed best for fish and MobileNetV2 best for dolphins with a mean error of 2.07 and 0.245 respectively. Considering the 123 testing images, our results show the success of data simulation for limited sonar data sets. We find DenseNet201 is able to identify dolphins after approximately 5000 training images, while fish required the full 25,000. Our method can be used to lower costs and expedite the data analysis of fish and dolphin abundance to real-time along the Amazon river and river systems worldwide.

Related papers

Counting Fish with Temporal Representations of Sonar Video [15.713015426791221]
We propose an alternative lightweight computer vision method for fish counting based on analyzing echograms. We achieve a count error of 23% on representative data from the Kenai River in Alaska, demonstrating the feasibility of our approach.
arXiv Detail & Related papers (2025-02-07T18:02:28Z)
Dolphin: Moving Towards Closed-loop Auto-research through Thinking, Practice, and Feedback [69.57617563853822]
Dolphin is a framework to enhance the automation level of scientific research. Dolphin first generates novel ideas based on feedback from previous experiments. Dolphin automatically analyzes the results of each idea and feeds the results back to the next round of idea generation.
arXiv Detail & Related papers (2025-01-07T16:31:10Z)
AutoFish: Dataset and Benchmark for Fine-grained Analysis of Fish [19.025566399187547]
The dataset comprises 1,500 images of 454 specimens of visually similar fish placed in various constellations on a white conveyor belt. The data was collected in a controlled environment using an RGB camera. We establish baseline instance segmentation results using two variations of the Mask2Former architecture.
arXiv Detail & Related papers (2025-01-07T13:14:25Z)
FishNet: Deep Neural Networks for Low-Cost Fish Stock Estimation [0.0]
FishNet is an automated computer vision system for both taxonomic classification and fish size estimation. We use a dataset of 300,000 hand-labeled images containing 1.2M fish of 163 different species. FishNet achieves a 92% intersection over union on the fish segmentation task, a 89% top-1 classification accuracy on single fish species classification, and a 2.3cm mean absolute error on the fish length estimation task.
arXiv Detail & Related papers (2024-03-16T12:44:08Z)
Improving Underwater Visual Tracking With a Large Scale Dataset and Image Enhancement [70.2429155741593]
This paper presents a new dataset and general tracker enhancement method for Underwater Visual Object Tracking (UVOT) It poses distinct challenges; the underwater environment exhibits non-uniform lighting conditions, low visibility, lack of sharpness, low contrast, camouflage, and reflections from suspended particles. We propose a novel underwater image enhancement algorithm designed specifically to boost tracking quality. The method has resulted in a significant performance improvement, of up to 5.0% AUC, of state-of-the-art (SOTA) visual trackers.
arXiv Detail & Related papers (2023-08-30T07:41:26Z)
Whale Detection Enhancement through Synthetic Satellite Images [13.842008598751445]
We show that we can achieve a 15% performance boost on whale detection compared to using the real data alone for training. We open source the code of the simulation platform SeaDroneSim2 and the dataset generated through it.
arXiv Detail & Related papers (2023-08-15T13:35:29Z)
Delving Deeper into Data Scaling in Masked Image Modeling [145.36501330782357]
We conduct an empirical study on the scaling capability of masked image modeling (MIM) methods for visual recognition. Specifically, we utilize the web-collected Coyo-700M dataset. Our goal is to investigate how the performance changes on downstream tasks when scaling with different sizes of data and models.
arXiv Detail & Related papers (2023-05-24T15:33:46Z)
Leveraging the Third Dimension in Contrastive Learning [88.17394309208925]
Self-Supervised Learning (SSL) methods operate on unlabeled data to learn robust representations useful for downstream tasks. These augmentations ignore the fact that biological vision takes place in an immersive three-dimensional, temporally contiguous environment. We explore two distinct approaches to incorporating depth signals into the SSL framework.
arXiv Detail & Related papers (2023-01-27T15:45:03Z)
TempNet: Temporal Attention Towards the Detection of Animal Behaviour in Videos [63.85815474157357]
We propose an efficient computer vision- and deep learning-based method for the detection of biological behaviours in videos. TempNet uses an encoder bridge and residual blocks to maintain model performance with a two-staged, spatial, then temporal, encoder. We demonstrate its application to the detection of sablefish (Anoplopoma fimbria) startle events.
arXiv Detail & Related papers (2022-11-17T23:55:12Z)
Portuguese Man-of-War Image Classification with Convolutional Neural Networks [58.720142291102135]
Portuguese man-of-war (PMW) is a gelatinous organism with long tentacles capable of causing severe burns. This paper reports on the use of convolutional neural networks for recognizing PMW images from the Instagram social media.
arXiv Detail & Related papers (2022-07-04T03:06:45Z)
FishNet: A Unified Embedding for Salmon Recognition [0.37798600249187286]
We propose FishNet, based on a deep learning technique that has been successfully used for identifying humans. Our experiments show that this architecture learns a useful representation based on images of salmon heads. FishNet achieves a false positive rate of 1% and a true positive rate of 96%.
arXiv Detail & Related papers (2020-10-20T17:35:01Z)
A Realistic Fish-Habitat Dataset to Evaluate Algorithms for Underwater Visual Analysis [2.6476746128312194]
We present DeepFish as a benchmark suite with a large-scale dataset to train and test methods for several computer vision tasks. The dataset consists of approximately 40 thousand images collected underwater from 20 greenhabitats in the marine-environments of tropical Australia. Our experiments provide an in-depth analysis of the dataset characteristics, and the performance evaluation of several state-of-the-art approaches.
arXiv Detail & Related papers (2020-08-28T12:20:59Z)
Temperate Fish Detection and Classification: a Deep Learning based Approach [6.282069822653608]
We propose a two-step deep learning approach for the detection and classification of temperate fishes without pre-filtering. The first step is to detect each single fish in an image, independent of species and sex. In the second step, we adopt a Convolutional Neural Network (CNN) with the Squeeze-and-Excitation (SE) architecture for classifying each fish in the image without pre-filtering.
arXiv Detail & Related papers (2020-05-14T12:40:57Z)
Improved Residual Networks for Image and Video Recognition [98.10703825716142]
Residual networks (ResNets) represent a powerful type of convolutional neural network (CNN) architecture. We show consistent improvements in accuracy and learning convergence over the baseline. Our proposed approach allows us to train extremely deep networks, while the baseline shows severe optimization issues.
arXiv Detail & Related papers (2020-04-10T11:09:50Z)

This list is automatically generated from the titles and abstracts of the papers in this site.