Learning Visual Representation of Underwater Acoustic Imagery Using
Transformer-Based Style Transfer Method
- URL: http://arxiv.org/abs/2211.05396v1
- Date: Thu, 10 Nov 2022 07:54:46 GMT
- Title: Learning Visual Representation of Underwater Acoustic Imagery Using
Transformer-Based Style Transfer Method
- Authors: Xiaoteng Zhou, Changli Yu, Shihao Yuan, Xin Yuan, Hangchi Yu and
Citong Luo
- Abstract summary: This letter proposes a framework for learning the visual representation of underwater acoustic imageries.
It could replace the low-level texture features of optical images with the visual features of underwater acoustic imageries.
The proposed framework could fully use the rich optical image dataset to generate a pseudo-acoustic image dataset.
- Score: 4.885034271315195
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Underwater automatic target recognition (UATR) has been a challenging
research topic in ocean engineering. Although deep learning brings
opportunities for target recognition on land and in the air, underwater target
recognition techniques based on deep learning have lagged due to sensor
performance and the size of trainable data. This letter proposed a framework
for learning the visual representation of underwater acoustic imageries, which
takes a transformer-based style transfer model as the main body. It could
replace the low-level texture features of optical images with the visual
features of underwater acoustic imageries while preserving their raw high-level
semantic content. The proposed framework could fully use the rich optical image
dataset to generate a pseudo-acoustic image dataset and use it as the initial
sample to train the underwater acoustic target recognition model. The
experiments select the dual-frequency identification sonar (DIDSON) as the
underwater acoustic data source and also take fish, the most common marine
creature, as the research subject. Experimental results show that the proposed
method could generate high-quality and high-fidelity pseudo-acoustic samples,
achieve the purpose of acoustic data enhancement and provide support for the
underwater acoustic-optical images domain transfer research.
Related papers
- Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and Benchmark [65.79402756995084]
Real Acoustic Fields (RAF) is a new dataset that captures real acoustic room data from multiple modalities.
RAF is the first dataset to provide densely captured room acoustic data.
arXiv Detail & Related papers (2024-03-27T17:59:56Z) - Learning Heavily-Degraded Prior for Underwater Object Detection [59.5084433933765]
This paper seeks transferable prior knowledge from detector-friendly images.
It is based on statistical observations that, the heavily degraded regions of detector-friendly (DFUI) and underwater images have evident feature distribution gaps.
Our method with higher speeds and less parameters still performs better than transformer-based detectors.
arXiv Detail & Related papers (2023-08-24T12:32:46Z) - Histogram Layer Time Delay Neural Networks for Passive Sonar
Classification [58.720142291102135]
A novel method combines a time delay neural network and histogram layer to incorporate statistical contexts for improved feature learning and underwater acoustic target classification.
The proposed method outperforms the baseline model, demonstrating the utility in incorporating statistical contexts for passive sonar target recognition.
arXiv Detail & Related papers (2023-07-25T19:47:26Z) - PUGAN: Physical Model-Guided Underwater Image Enhancement Using GAN with
Dual-Discriminators [120.06891448820447]
How to obtain clear and visually pleasant images has become a common concern of people.
The task of underwater image enhancement (UIE) has also emerged as the times require.
In this paper, we propose a physical model-guided GAN model for UIE, referred to as PUGAN.
Our PUGAN outperforms state-of-the-art methods in both qualitative and quantitative metrics.
arXiv Detail & Related papers (2023-06-15T07:41:12Z) - A Dataset with Multibeam Forward-Looking Sonar for Underwater Object
Detection [0.0]
Multibeam forward-looking sonar (MFLS) plays an important role in underwater detection.
There are several challenges to the research on underwater object detection with MFLS.
We present a novel dataset, consisting of over 9000 MFLS images captured using Tritech Gemini 1200ik sonar.
arXiv Detail & Related papers (2022-12-01T08:26:03Z) - Semantic-aware Texture-Structure Feature Collaboration for Underwater
Image Enhancement [58.075720488942125]
Underwater image enhancement has become an attractive topic as a significant technology in marine engineering and aquatic robotics.
We develop an efficient and compact enhancement network in collaboration with a high-level semantic-aware pretrained model.
We also apply the proposed algorithm to the underwater salient object detection task to reveal the favorable semantic-aware ability for high-level vision tasks.
arXiv Detail & Related papers (2022-11-19T07:50:34Z) - Synthetic Sonar Image Simulation with Various Seabed Conditions for
Automatic Target Recognition [1.179296191012968]
We propose a novel method to generate underwater object imagery that is acoustically compliant with that generated by side-scan sonar using the Unreal Engine.
We describe the process to develop, tune, and generate imagery to provide representative images for use in training automated target recognition (ATR) and machine learning algorithms.
arXiv Detail & Related papers (2022-10-19T03:08:02Z) - Domain Adaptation for Underwater Image Enhancement via Content and Style
Separation [7.077978580799124]
Underwater image suffer from color cast, low contrast and hazy effect due to light absorption, refraction and scattering.
Recent learning-based methods demonstrate astonishing performance on underwater image enhancement.
We propose a domain adaptation framework for underwater image enhancement via content and style separation.
arXiv Detail & Related papers (2022-02-17T09:30:29Z) - Perceptual underwater image enhancement with deep learning and physical
priors [35.37760003463292]
We propose two perceptual enhancement models, each of which uses a deep enhancement model with a detection perceptor.
Due to the lack of training data, a hybrid underwater image synthesis model, which fuses physical priors and data-driven cues, is proposed to synthesize training data.
Experimental results show the superiority of our proposed method over several state-of-the-art methods on both real-world and synthetic underwater datasets.
arXiv Detail & Related papers (2020-08-21T22:11:34Z) - Cross-Task Transfer for Geotagged Audiovisual Aerial Scene Recognition [61.54648991466747]
We explore an audiovisual aerial scene recognition task using both images and sounds as input.
We show the benefit of exploiting the audio information for the aerial scene recognition.
arXiv Detail & Related papers (2020-05-18T04:14:16Z) - Domain Adaptive Adversarial Learning Based on Physics Model Feedback for
Underwater Image Enhancement [10.143025577499039]
We propose a new robust adversarial learning framework via physics model based feedback control and domain adaptation mechanism for enhancing underwater images.
A new method for simulating underwater-like training dataset from RGB-D data by underwater image formation model is proposed.
Final enhanced results on synthetic and real underwater images demonstrate the superiority of the proposed method.
arXiv Detail & Related papers (2020-02-20T07:50:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.