Improved Benthic Classification using Resolution Scaling and SymmNet
Unsupervised Domain Adaptation
- URL: http://arxiv.org/abs/2303.10960v1
- Date: Mon, 20 Mar 2023 09:33:47 GMT
- Title: Improved Benthic Classification using Resolution Scaling and SymmNet
Unsupervised Domain Adaptation
- Authors: Heather Doig, Oscar Pizarro and Stefan B. Williams
- Abstract summary: We adapt the SymmNet state-of-the-art Unsupervised Domain Adaptation method with an efficient bilinear pooling layer and image scaling to normalise spatial resolution.
The results show that generic domain adaptation can be enhanced to produce a significant increase in accuracy for images from an AUV survey that differs from the training images.
- Score: 8.35780131268962
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Autonomous Underwater Vehicles (AUVs) conduct regular visual surveys of
marine environments to characterise and monitor the composition and diversity
of the benthos. The use of machine learning classifiers for this task is
limited by the low numbers of annotations available and the many fine-grained
classes involved. In addition to these challenges, there are domain shifts
between image sets acquired during different AUV surveys due to changes in
camera systems, imaging altitude, illumination and water column properties
leading to a drop in classification performance for images from a different
survey where some or all these elements may have changed. This paper proposes a
framework to improve the performance of a benthic morphospecies classifier when
used to classify images from a different survey compared to the training data.
We adapt the SymmNet state-of-the-art Unsupervised Domain Adaptation method
with an efficient bilinear pooling layer and image scaling to normalise spatial
resolution, and show improved classification accuracy. We test our approach on
two datasets with images from AUV surveys with different imaging payloads and
locations. The results show that generic domain adaptation can be enhanced to
produce a significant increase in accuracy for images from an AUV survey that
differs from the training images.
Related papers
- On Vision Transformers for Classification Tasks in Side-Scan Sonar Imagery [0.0]
Side-scan sonar (SSS) imagery presents unique challenges in the classification of man-made objects on the seafloor.
This paper rigorously compares the performance of ViT models alongside commonly used CNN architectures for binary classification tasks in SSS imagery.
ViT-based models exhibit superior classification performance across f1-score, precision, recall, and accuracy metrics.
arXiv Detail & Related papers (2024-09-18T14:36:50Z) - Self-supervised Domain-agnostic Domain Adaptation for Satellite Images [18.151134198549574]
We propose an self-supervised domain-agnostic domain adaptation (SS(DA)2) method to perform domain adaptation without such a domain definition.
We first design a contrastive generative adversarial loss to train a generative network to perform image-to-image translation between any two satellite image patches.
Then, we improve the generalizability of the downstream models by augmenting the training data with different testing spectral characteristics.
arXiv Detail & Related papers (2023-09-20T07:37:23Z) - Fine-grained Recognition with Learnable Semantic Data Augmentation [68.48892326854494]
Fine-grained image recognition is a longstanding computer vision challenge.
We propose diversifying the training data at the feature-level to alleviate the discriminative region loss problem.
Our method significantly improves the generalization performance on several popular classification networks.
arXiv Detail & Related papers (2023-09-01T11:15:50Z) - Improving Human-Object Interaction Detection via Virtual Image Learning [68.56682347374422]
Human-Object Interaction (HOI) detection aims to understand the interactions between humans and objects.
In this paper, we propose to alleviate the impact of such an unbalanced distribution via Virtual Image Leaning (VIL)
A novel label-to-image approach, Multiple Steps Image Creation (MUSIC), is proposed to create a high-quality dataset that has a consistent distribution with real images.
arXiv Detail & Related papers (2023-08-04T10:28:48Z) - CSP: Self-Supervised Contrastive Spatial Pre-Training for
Geospatial-Visual Representations [90.50864830038202]
We present Contrastive Spatial Pre-Training (CSP), a self-supervised learning framework for geo-tagged images.
We use a dual-encoder to separately encode the images and their corresponding geo-locations, and use contrastive objectives to learn effective location representations from images.
CSP significantly boosts the model performance with 10-34% relative improvement with various labeled training data sampling ratios.
arXiv Detail & Related papers (2023-05-01T23:11:18Z) - Stacking Ensemble Learning in Deep Domain Adaptation for Ophthalmic
Image Classification [61.656149405657246]
Domain adaptation is effective in image classification tasks where obtaining sufficient label data is challenging.
We propose a novel method, named SELDA, for stacking ensemble learning via extending three domain adaptation methods.
The experimental results using Age-Related Eye Disease Study (AREDS) benchmark ophthalmic dataset demonstrate the effectiveness of the proposed model.
arXiv Detail & Related papers (2022-09-27T14:19:00Z) - Exploring Vision Transformers for Fine-grained Classification [0.0]
We propose a multi-stage ViT framework for fine-grained image classification tasks, which localizes the informative image regions without requiring architectural changes.
We demonstrate the value of our approach by experimenting with four popular fine-grained benchmarks: CUB-200-2011, Stanford Cars, Stanford Dogs, and FGVC7 Plant Pathology.
arXiv Detail & Related papers (2021-06-19T23:57:31Z) - DoFE: Domain-oriented Feature Embedding for Generalizable Fundus Image
Segmentation on Unseen Datasets [96.92018649136217]
We present a novel Domain-oriented Feature Embedding (DoFE) framework to improve the generalization ability of CNNs on unseen target domains.
Our DoFE framework dynamically enriches the image features with additional domain prior knowledge learned from multi-source domains.
Our framework generates satisfying segmentation results on unseen datasets and surpasses other domain generalization and network regularization methods.
arXiv Detail & Related papers (2020-10-13T07:28:39Z) - Data Augmentation via Mixed Class Interpolation using Cycle-Consistent
Generative Adversarial Networks Applied to Cross-Domain Imagery [16.870604081967866]
Machine learning driven object detection and classification within non-visible imagery has an important role in many fields.
However, such applications often suffer due to the limited quantity and variety of non-visible spectral domain imagery.
This paper proposes and evaluates a novel data augmentation approach that leverages the more readily available visible-band imagery.
arXiv Detail & Related papers (2020-05-05T18:53:38Z) - Cross-Domain Few-Shot Classification via Learned Feature-Wise
Transformation [109.89213619785676]
Few-shot classification aims to recognize novel categories with only few labeled images in each class.
Existing metric-based few-shot classification algorithms predict categories by comparing the feature embeddings of query images with those from a few labeled images.
While promising performance has been demonstrated, these methods often fail to generalize to unseen domains.
arXiv Detail & Related papers (2020-01-23T18:55:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.