Pre-trained Models for Sonar Images
- URL: http://arxiv.org/abs/2108.01111v1
- Date: Mon, 2 Aug 2021 18:17:40 GMT
- Title: Pre-trained Models for Sonar Images
- Authors: Matias Valdenegro-Toro and Alan Preciado-Grijalva and Bilal Wehbe
- Abstract summary: We present the Marine Debris Turntable dataset and produce pre-trained neural networks trained on this dataset.
We train Resnet 20, MobileNets, DenseNet121, SqueezeNet, MiniXception, and an Autoencoder, over several input image sizes.
Our results show that in both datasets the pre-trained models produce good features that allow good classification accuracy with low samples.
- Score: 1.7403133838762446
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Machine learning and neural networks are now ubiquitous in sonar perception,
but it lags behind the computer vision field due to the lack of data and
pre-trained models specifically for sonar images. In this paper we present the
Marine Debris Turntable dataset and produce pre-trained neural networks trained
on this dataset, meant to fill the gap of missing pre-trained models for sonar
images. We train Resnet 20, MobileNets, DenseNet121, SqueezeNet, MiniXception,
and an Autoencoder, over several input image sizes, from 32 x 32 to 96 x 96, on
the Marine Debris turntable dataset. We evaluate these models using transfer
learning for low-shot classification in the Marine Debris Watertank and another
dataset captured using a Gemini 720i sonar. Our results show that in both
datasets the pre-trained models produce good features that allow good
classification accuracy with low samples (10-30 samples per class). The Gemini
dataset validates that the features transfer to other kinds of sonar sensors.
We expect that the community benefits from the public release of our
pre-trained models and the turntable dataset.
Related papers
- Comparison of self-supervised in-domain and supervised out-domain transfer learning for bird species recognition [0.19183348587701113]
Transferring the weights of a pre-trained model to assist another task has become a crucial part of modern deep learning.
Our experiments will demonstrate the usefulness of in-domain models and datasets for bird species recognition.
arXiv Detail & Related papers (2024-04-26T08:47:28Z) - Learning Defect Prediction from Unrealistic Data [57.53586547895278]
Pretrained models of code have become popular choices for code understanding and generation tasks.
Such models tend to be large and require commensurate volumes of training data.
It has become popular to train models with far larger but less realistic datasets, such as functions with artificially injected bugs.
Models trained on such data tend to only perform well on similar data, while underperforming on real world programs.
arXiv Detail & Related papers (2023-11-02T01:51:43Z) - Convolutional Neural Networks for the classification of glitches in
gravitational-wave data streams [52.77024349608834]
We classify transient noise signals (i.e.glitches) and gravitational waves in data from the Advanced LIGO detectors.
We use models with a supervised learning approach, both trained from scratch using the Gravity Spy dataset.
We also explore a self-supervised approach, pre-training models with automatically generated pseudo-labels.
arXiv Detail & Related papers (2023-03-24T11:12:37Z) - Core Risk Minimization using Salient ImageNet [53.616101711801484]
We introduce the Salient Imagenet dataset with more than 1 million soft masks localizing core and spurious features for all 1000 Imagenet classes.
Using this dataset, we first evaluate the reliance of several Imagenet pretrained models (42 total) on spurious features.
Next, we introduce a new learning paradigm called Core Risk Minimization (CoRM) whose objective ensures that the model predicts a class using its core features.
arXiv Detail & Related papers (2022-03-28T01:53:34Z) - Hand Pose Classification Based on Neural Networks [0.0]
This work demonstrates the classification of presence of one hand, two hands and no hand in the scene based on transfer learning.
The pre-trained model; simplest NN from Keras library is utilized to train the network with 704 images of hand gestures.
arXiv Detail & Related papers (2021-08-10T09:14:51Z) - Self-supervised Audiovisual Representation Learning for Remote Sensing Data [96.23611272637943]
We propose a self-supervised approach for pre-training deep neural networks in remote sensing.
By exploiting the correspondence between geo-tagged audio recordings and remote sensing, this is done in a completely label-free manner.
We show that our approach outperforms existing pre-training strategies for remote sensing imagery.
arXiv Detail & Related papers (2021-08-02T07:50:50Z) - The Lottery Tickets Hypothesis for Supervised and Self-supervised
Pre-training in Computer Vision Models [115.49214555402567]
Pre-trained weights often boost a wide range of downstream tasks including classification, detection, and segmentation.
Recent studies suggest that pre-training benefits from gigantic model capacity.
In this paper, we examine supervised and self-supervised pre-trained models through the lens of the lottery ticket hypothesis (LTH)
arXiv Detail & Related papers (2020-12-12T21:53:55Z) - Application of Facial Recognition using Convolutional Neural Networks
for Entry Access Control [0.0]
The paper focuses on solving the supervised classification problem of taking images of people as input and classifying the person in the image as one of the authors or not.
Two approaches are proposed: (1) building and training a neural network called WoodNet from scratch and (2) leveraging transfer learning by utilizing a network pre-trained on the ImageNet database.
The results are two models classifying the individuals in the dataset with high accuracy, achieving over 99% accuracy on held-out test data.
arXiv Detail & Related papers (2020-11-23T07:55:24Z) - Rethinking CNN Models for Audio Classification [20.182928938110923]
ImageNet-Pretrained standard deep CNN models can be used as strong baseline networks for audio classification.
We systematically study how much of pretrained weights is useful for learning spectrograms.
We show that for a given standard model using pretrained weights is better than using randomly Dense weights.
arXiv Detail & Related papers (2020-07-22T01:31:44Z) - Omni-sourced Webly-supervised Learning for Video Recognition [74.3637061856504]
We introduce OmniSource, a framework for leveraging web data to train video recognition models.
Experiments show that by utilizing data from multiple sources and formats, OmniSource is more data-efficient in training.
arXiv Detail & Related papers (2020-03-29T14:47:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.