Underwater Object Classification and Detection: first results and open
challenges
- URL: http://arxiv.org/abs/2201.00977v1
- Date: Tue, 4 Jan 2022 04:54:08 GMT
- Title: Underwater Object Classification and Detection: first results and open
challenges
- Authors: Andre Jesus, Claudio Zito, Claudio Tortorici, Eloy Roura, Giulia De
Masi
- Abstract summary: This work reviews the problem of object detection in underwater environments.
We analyse and quantify the shortcomings of conventional state-of-the-art (SOTA) algorithms.
- Score: 1.1549572298362782
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This work reviews the problem of object detection in underwater environments.
We analyse and quantify the shortcomings of conventional state-of-the-art
(SOTA) algorithms in the computer vision community when applied to this
challenging environment, as well as providing insights and general guidelines
for future research efforts. First, we assessed if pretraining with the
conventional ImageNet is beneficial when the object detector needs to be
applied to environments that may be characterised by a different feature
distribution. We then investigate whether two-stage detectors yields to better
performance with respect to single-stage detectors, in terms of accuracy,
intersection of union (IoU), floating operation per second (FLOPS), and
inference time. Finally, we assessed the generalisation capability of each
model to a lower quality dataset to simulate performance on a real scenario, in
which harsher conditions ought to be expected. Our experimental results provide
evidence that underwater object detection requires searching for "ad-hoc"
architectures than merely training SOTA architectures on new data, and that
pretraining is not beneficial.
Related papers
- Underwater Object Detection in the Era of Artificial Intelligence: Current, Challenge, and Future [119.88454942558485]
Underwater object detection (UOD) aims to identify and localise objects in underwater images or videos.
In recent years, artificial intelligence (AI) based methods, especially deep learning methods, have shown promising performance in UOD.
arXiv Detail & Related papers (2024-10-08T00:25:33Z) - Small Object Detection via Coarse-to-fine Proposal Generation and
Imitation Learning [52.06176253457522]
We propose a two-stage framework tailored for small object detection based on the Coarse-to-fine pipeline and Feature Imitation learning.
CFINet achieves state-of-the-art performance on the large-scale small object detection benchmarks, SODA-D and SODA-A.
arXiv Detail & Related papers (2023-08-18T13:13:09Z) - Evaluation of Environmental Conditions on Object Detection using Oriented Bounding Boxes for AR Applications [7.022872089444935]
Scene analysis and object recognition play a crucial role in augmented reality (AR)
New approach is proposed that involves using oriented bounding boxes with a detection and recognition deep network to improve performance and processing time.
Results indicate that the proposed approach tends to produce better Average Precision and greater accuracy for small objects in most of the tested conditions.
arXiv Detail & Related papers (2023-06-29T09:17:58Z) - Boosting R-CNN: Reweighting R-CNN Samples by RPN's Error for Underwater
Object Detection [12.291063755824585]
We propose a two-stage underwater detector named boosting R-CNN, which comprises three key components.
First, a new region proposal network named RetinaRPN is proposed, which provides high-quality proposals.
Second, the probabilistic inference pipeline is introduced to combine the first-stage prior uncertainty and the second-stage classification score.
arXiv Detail & Related papers (2022-06-28T03:29:20Z) - Interactron: Embodied Adaptive Object Detection [18.644357684104662]
We propose Interactron, a method for adaptive object detection in an interactive setting.
Our idea is to continue training during inference and adapt the model at test time without any explicit supervision via interacting with the environment.
arXiv Detail & Related papers (2022-02-01T18:56:14Z) - Unsupervised Scale-consistent Depth Learning from Video [131.3074342883371]
We propose a monocular depth estimator SC-Depth, which requires only unlabelled videos for training.
Thanks to the capability of scale-consistent prediction, we show that our monocular-trained deep networks are readily integrated into the ORB-SLAM2 system.
The proposed hybrid Pseudo-RGBD SLAM shows compelling results in KITTI, and it generalizes well to the KAIST dataset without additional training.
arXiv Detail & Related papers (2021-05-25T02:17:56Z) - Cycle and Semantic Consistent Adversarial Domain Adaptation for Reducing
Simulation-to-Real Domain Shift in LiDAR Bird's Eye View [110.83289076967895]
We present a BEV domain adaptation method based on CycleGAN that uses prior semantic classification in order to preserve the information of small objects of interest during the domain adaptation process.
The quality of the generated BEVs has been evaluated using a state-of-the-art 3D object detection framework at KITTI 3D Object Detection Benchmark.
arXiv Detail & Related papers (2021-04-22T12:47:37Z) - Unsupervised Domain Adaptation for Spatio-Temporal Action Localization [69.12982544509427]
S-temporal action localization is an important problem in computer vision.
We propose an end-to-end unsupervised domain adaptation algorithm.
We show that significant performance gain can be achieved when spatial and temporal features are adapted separately or jointly.
arXiv Detail & Related papers (2020-10-19T04:25:10Z) - Underwater object detection using Invert Multi-Class Adaboost with deep
learning [37.14538666012363]
We propose a novel neural network architecture, namely Sample-WeIghted hyPEr Network (SWIPENet), for small object detection.
We show that the proposed SWIPENet+IMA framework achieves better performance in detection accuracy against several state-of-the-art object detection approaches.
arXiv Detail & Related papers (2020-05-23T15:30:38Z) - One-Shot Object Detection without Fine-Tuning [62.39210447209698]
We introduce a two-stage model consisting of a first stage Matching-FCOS network and a second stage Structure-Aware Relation Module.
We also propose novel training strategies that effectively improve detection performance.
Our method exceeds the state-of-the-art one-shot performance consistently on multiple datasets.
arXiv Detail & Related papers (2020-05-08T01:59:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.