Improving Generalization of Synthetically Trained Sonar Image
Descriptors for Underwater Place Recognition
- URL: http://arxiv.org/abs/2308.01058v2
- Date: Sun, 24 Sep 2023 17:22:24 GMT
- Title: Improving Generalization of Synthetically Trained Sonar Image
Descriptors for Underwater Place Recognition
- Authors: Ivano Donadi, Emilio Olivastri, Daniel Fusaro, Wanmeng Li, Daniele
Evangelista, and Alberto Pretto
- Abstract summary: Autonomous navigation in underwater environments presents challenges due to factors such as light absorption and water turbidity.
Traditional computer vision algorithms are less effective when applied to sonar-generated acoustic images.
We propose a novel compact deep sonar descriptor pipeline that can generalize to real scenarios while being trained exclusively on synthetic data.
- Score: 1.8911961520222997
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Autonomous navigation in underwater environments presents challenges due to
factors such as light absorption and water turbidity, limiting the
effectiveness of optical sensors. Sonar systems are commonly used for
perception in underwater operations as they are unaffected by these
limitations. Traditional computer vision algorithms are less effective when
applied to sonar-generated acoustic images, while convolutional neural networks
(CNNs) typically require large amounts of labeled training data that are often
unavailable or difficult to acquire. To this end, we propose a novel compact
deep sonar descriptor pipeline that can generalize to real scenarios while
being trained exclusively on synthetic data. Our architecture is based on a
ResNet18 back-end and a properly parameterized random Gaussian projection
layer, whereas input sonar data is enhanced with standard ad-hoc
normalization/prefiltering techniques. A customized synthetic data generation
procedure is also presented. The proposed method has been evaluated extensively
using both synthetic and publicly available real data, demonstrating its
effectiveness compared to state-of-the-art methods.
Related papers
- A Novel Score-CAM based Denoiser for Spectrographic Signature Extraction without Ground Truth [0.0]
This paper develops a novel Score-CAM based denoiser to extract an object's signature from noisy spectrographic data.
In particular, this paper proposes a novel generative adversarial network architecture for learning and producing spectrographic training data.
arXiv Detail & Related papers (2024-10-28T21:40:46Z) - Syn2Real Domain Generalization for Underwater Mine-like Object Detection Using Side-Scan Sonar [1.7851018240619703]
This paper proposes a Syn2Real (Synthetic to Real) domain generalization approach using diffusion models to address this challenge.
We demonstrate that synthetic data generated with noise by DDPM and DDIM models, even if not perfectly realistic, can effectively augment real-world samples for training.
arXiv Detail & Related papers (2024-10-16T18:42:08Z) - Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and Benchmark [65.79402756995084]
Real Acoustic Fields (RAF) is a new dataset that captures real acoustic room data from multiple modalities.
RAF is the first dataset to provide densely captured room acoustic data.
arXiv Detail & Related papers (2024-03-27T17:59:56Z) - RFTrans: Leveraging Refractive Flow of Transparent Objects for Surface
Normal Estimation and Manipulation [50.10282876199739]
This paper introduces RFTrans, an RGB-D-based method for surface normal estimation and manipulation of transparent objects.
It integrates the RFNet, which predicts refractive flow, object mask, and boundaries, followed by the F2Net, which estimates surface normal from the refractive flow.
A real-world robot grasping task witnesses an 83% success rate, proving that refractive flow can help enable direct sim-to-real transfer.
arXiv Detail & Related papers (2023-11-21T07:19:47Z) - SONIC: Sonar Image Correspondence using Pose Supervised Learning for Imaging Sonars [21.36164161799472]
We introduce SONIC, a pose-supervised network designed to yield robust feature correspondence capable of withstanding viewpoint variations.
Our method demonstrates significantly better performance in generating correspondences for sonar images.
arXiv Detail & Related papers (2023-10-23T15:21:46Z) - Histogram Layer Time Delay Neural Networks for Passive Sonar
Classification [58.720142291102135]
A novel method combines a time delay neural network and histogram layer to incorporate statistical contexts for improved feature learning and underwater acoustic target classification.
The proposed method outperforms the baseline model, demonstrating the utility in incorporating statistical contexts for passive sonar target recognition.
arXiv Detail & Related papers (2023-07-25T19:47:26Z) - Underwater Acoustic Target Recognition based on Smoothness-inducing Regularization and Spectrogram-based Data Augmentation [21.327653766608805]
Insufficient data can hinder the ability of recognition systems to support complex modeling.
We propose two strategies to enhance the generalization ability of models in the case of limited data.
arXiv Detail & Related papers (2023-06-12T08:26:47Z) - NAF: Neural Attenuation Fields for Sparse-View CBCT Reconstruction [79.13750275141139]
This paper proposes a novel and fast self-supervised solution for sparse-view CBCT reconstruction.
The desired attenuation coefficients are represented as a continuous function of 3D spatial coordinates, parameterized by a fully-connected deep neural network.
A learning-based encoder entailing hash coding is adopted to help the network capture high-frequency details.
arXiv Detail & Related papers (2022-09-29T04:06:00Z) - Single Underwater Image Enhancement Using an Analysis-Synthesis Network [21.866940227491146]
Most deep models for underwater image enhancement resort to training on synthetic datasets based on underwater image formation models.
A new underwater synthetic dataset is first established, in which a revised ambient light synthesis equation is embedded.
A unified framework, named ANA-SYN, can effectively enhance underwater images under collaborations of priors and data information.
arXiv Detail & Related papers (2021-08-20T06:29:12Z) - Learning Topology from Synthetic Data for Unsupervised Depth Completion [66.26787962258346]
We present a method for inferring dense depth maps from images and sparse depth measurements.
We learn the association of sparse point clouds with dense natural shapes, and using the image as evidence to validate the predicted depth map.
arXiv Detail & Related papers (2021-06-06T00:21:12Z) - Syn2Real Transfer Learning for Image Deraining using Gaussian Processes [92.15895515035795]
CNN-based methods for image deraining have achieved excellent performance in terms of reconstruction error as well as visual quality.
Due to challenges in obtaining real world fully-labeled image deraining datasets, existing methods are trained only on synthetically generated data.
We propose a Gaussian Process-based semi-supervised learning framework which enables the network in learning to derain using synthetic dataset.
arXiv Detail & Related papers (2020-06-10T00:33:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.