A Semi-supervised Object Detection Algorithm for Underwater Imagery
- URL: http://arxiv.org/abs/2306.04834v1
- Date: Wed, 7 Jun 2023 23:40:04 GMT
- Title: A Semi-supervised Object Detection Algorithm for Underwater Imagery
- Authors: Suraj Bijjahalli, Oscar Pizarro, and Stefan B. Williams
- Abstract summary: We propose to treat artificial objects as anomalies and detect them through a semi-supervised framework based on Variational Autoencoders (VAEs)
We develop a method which clusters image data in a learned low-dimensional latent space and extracts images that are likely to contain anomalous features.
We demonstrate that by applying both methods on large image datasets, human operators can be shown candidate anomalous samples with a low false positive rate to identify objects of interest.
- Score: 10.017195276758455
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Detection of artificial objects from underwater imagery gathered by
Autonomous Underwater Vehicles (AUVs) is a key requirement for many subsea
applications. Real-world AUV image datasets tend to be very large and
unlabelled. Furthermore, such datasets are typically imbalanced, containing few
instances of objects of interest, particularly when searching for unusual
objects in a scene. It is therefore, difficult to fit models capable of
reliably detecting these objects. Given these factors, we propose to treat
artificial objects as anomalies and detect them through a semi-supervised
framework based on Variational Autoencoders (VAEs). We develop a method which
clusters image data in a learned low-dimensional latent space and extracts
images that are likely to contain anomalous features. We also devise an anomaly
score based on extracting poorly reconstructed regions of an image. We
demonstrate that by applying both methods on large image datasets, human
operators can be shown candidate anomalous samples with a low false positive
rate to identify objects of interest. We apply our approach to real seafloor
imagery gathered by an AUV and evaluate its sensitivity to the dimensionality
of the latent representation used by the VAE. We evaluate the precision-recall
tradeoff and demonstrate that by choosing an appropriate latent dimensionality
and threshold, we are able to achieve an average precision of 0.64 on
unlabelled datasets.
Related papers
- Bayesian Detector Combination for Object Detection with Crowdsourced Annotations [49.43709660948812]
Acquiring fine-grained object detection annotations in unconstrained images is time-consuming, expensive, and prone to noise.
We propose a novel Bayesian Detector Combination (BDC) framework to more effectively train object detectors with noisy crowdsourced annotations.
BDC is model-agnostic, requires no prior knowledge of the annotators' skill level, and seamlessly integrates with existing object detection models.
arXiv Detail & Related papers (2024-07-10T18:00:54Z) - DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection [55.48770333927732]
We propose a Difusion-based Anomaly Detection (DiAD) framework for multi-class anomaly detection.
It consists of a pixel-space autoencoder, a latent-space Semantic-Guided (SG) network with a connection to the stable diffusion's denoising network, and a feature-space pre-trained feature extractor.
Experiments on MVTec-AD and VisA datasets demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2023-12-11T18:38:28Z) - Innovative Horizons in Aerial Imagery: LSKNet Meets DiffusionDet for
Advanced Object Detection [55.2480439325792]
We present an in-depth evaluation of an object detection model that integrates the LSKNet backbone with the DiffusionDet head.
The proposed model achieves a mean average precision (MAP) of approximately 45.7%, which is a significant improvement.
This advancement underscores the effectiveness of the proposed modifications and sets a new benchmark in aerial image analysis.
arXiv Detail & Related papers (2023-11-21T19:49:13Z) - Learning Heavily-Degraded Prior for Underwater Object Detection [59.5084433933765]
This paper seeks transferable prior knowledge from detector-friendly images.
It is based on statistical observations that, the heavily degraded regions of detector-friendly (DFUI) and underwater images have evident feature distribution gaps.
Our method with higher speeds and less parameters still performs better than transformer-based detectors.
arXiv Detail & Related papers (2023-08-24T12:32:46Z) - A Dataset with Multibeam Forward-Looking Sonar for Underwater Object
Detection [0.0]
Multibeam forward-looking sonar (MFLS) plays an important role in underwater detection.
There are several challenges to the research on underwater object detection with MFLS.
We present a novel dataset, consisting of over 9000 MFLS images captured using Tritech Gemini 1200ik sonar.
arXiv Detail & Related papers (2022-12-01T08:26:03Z) - The Eyecandies Dataset for Unsupervised Multimodal Anomaly Detection and
Localization [1.3124513975412255]
Eyecandies is a novel dataset for unsupervised anomaly detection and localization.
Photo-realistic images of procedurally generated candies are rendered in a controlled environment under multiple lightning conditions.
arXiv Detail & Related papers (2022-10-10T11:19:58Z) - Progressive Domain Adaptation with Contrastive Learning for Object
Detection in the Satellite Imagery [0.0]
State-of-the-art object detection methods largely fail to identify small and dense objects.
We propose a small object detection pipeline that improves the feature extraction process.
We show we can alleviate the degradation of object identification in previously unseen datasets.
arXiv Detail & Related papers (2022-09-06T15:16:35Z) - Knowledge Distillation for Oriented Object Detection on Aerial Images [1.827510863075184]
We present a model compression method for rotated object detection on aerial images by knowledge distillation, namely KD-RNet.
The experimental result on a large-scale aerial object detection dataset (DOTA) demonstrates that the proposed KD-RNet model can achieve improved mean-average precision (mAP) with reduced number of parameters, at the same time, KD-RNet boost the performance on providing high quality detections with higher overlap with groundtruth annotations.
arXiv Detail & Related papers (2022-06-20T14:24:16Z) - A Multi-purpose Real Haze Benchmark with Quantifiable Haze Levels and
Ground Truth [61.90504318229845]
This paper introduces the first paired real image benchmark dataset with hazy and haze-free images, and in-situ haze density measurements.
This dataset was produced in a controlled environment with professional smoke generating machines that covered the entire scene.
A subset of this dataset has been used for the Object Detection in Haze Track of CVPR UG2 2022 challenge.
arXiv Detail & Related papers (2022-06-13T19:14:06Z) - Salient Objects in Clutter [130.63976772770368]
This paper identifies and addresses a serious design bias of existing salient object detection (SOD) datasets.
This design bias has led to a saturation in performance for state-of-the-art SOD models when evaluated on existing datasets.
We propose a new high-quality dataset and update the previous saliency benchmark.
arXiv Detail & Related papers (2021-05-07T03:49:26Z) - Underwater object detection using Invert Multi-Class Adaboost with deep
learning [37.14538666012363]
We propose a novel neural network architecture, namely Sample-WeIghted hyPEr Network (SWIPENet), for small object detection.
We show that the proposed SWIPENet+IMA framework achieves better performance in detection accuracy against several state-of-the-art object detection approaches.
arXiv Detail & Related papers (2020-05-23T15:30:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.