BenthIQ: a Transformer-Based Benthic Classification Model for Coral
Restoration
- URL: http://arxiv.org/abs/2311.13661v1
- Date: Wed, 22 Nov 2023 19:25:31 GMT
- Title: BenthIQ: a Transformer-Based Benthic Classification Model for Coral
Restoration
- Authors: Rupa Kurinchi-Vendhan, Drew Gray, Elijah Cole
- Abstract summary: Coral reefs are vital for marine biodiversity, coastal protection, and supporting human livelihoods globally.
Current methods for creating benthic composition maps often compromise between spatial coverage and resolution.
We introduce BenthIQ, a multi-label semantic segmentation network designed for high-precision classification of underwater substrates.
- Score: 4.931399476945033
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Coral reefs are vital for marine biodiversity, coastal protection, and
supporting human livelihoods globally. However, they are increasingly
threatened by mass bleaching events, pollution, and unsustainable practices
with the advent of climate change. Monitoring the health of these ecosystems is
crucial for effective restoration and management. Current methods for creating
benthic composition maps often compromise between spatial coverage and
resolution. In this paper, we introduce BenthIQ, a multi-label semantic
segmentation network designed for high-precision classification of underwater
substrates, including live coral, algae, rock, and sand. Although commonly
deployed CNNs are limited in learning long-range semantic information,
transformer-based models have recently achieved state-of-the-art performance in
vision tasks such as object detection and image classification. We integrate
the hierarchical Swin Transformer as the backbone of a U-shaped encoder-decoder
architecture for local-global semantic feature learning. Using a real-world
case study in French Polynesia, we demonstrate that our approach outperforms
traditional CNN and attention-based models on pixel-wise classification of
shallow reef imagery.
Related papers
- Deep learning for multi-label classification of coral conditions in the
Indo-Pacific via underwater photogrammetry [24.00646413446011]
This study created a dataset representing common coral conditions and associated stressors in the Indo-Pacific.
It assessed existing classification algorithms and proposed a new multi-label method for automatically detecting coral conditions and extracting ecological information.
The proposed method accurately classified coral conditions as healthy, compromised, dead, and rubble.
arXiv Detail & Related papers (2024-03-09T14:42:16Z) - Affine-Consistent Transformer for Multi-Class Cell Nuclei Detection [76.11864242047074]
We propose a novel Affine-Consistent Transformer (AC-Former), which directly yields a sequence of nucleus positions.
We introduce an Adaptive Affine Transformer (AAT) module, which can automatically learn the key spatial transformations to warp original images for local network training.
Experimental results demonstrate that the proposed method significantly outperforms existing state-of-the-art algorithms on various benchmarks.
arXiv Detail & Related papers (2023-10-22T02:27:02Z) - CoinSeg: Contrast Inter- and Intra- Class Representations for
Incremental Segmentation [85.13209973293229]
Class incremental semantic segmentation aims to strike a balance between the model's stability and plasticity.
We propose Contrast inter- and intra-class representations for Incremental (CoinSeg)
arXiv Detail & Related papers (2023-10-10T07:08:49Z) - Scalable Semantic 3D Mapping of Coral Reefs with Deep Learning [4.8902950939676675]
This paper presents a new paradigm for mapping underwater environments from ego-motion video.
We show high-precision 3D semantic mapping at unprecedented scale with significantly reduced required labor costs.
Our approach significantly scales up coral reef monitoring by taking a leap towards fully automatic analysis of video transects.
arXiv Detail & Related papers (2023-09-22T11:35:10Z) - Pengembangan Model untuk Mendeteksi Kerusakan pada Terumbu Karang dengan
Klasifikasi Citra [3.254879465902239]
This study utilizes a specialized dataset consisting of 923 images collected from Flickr using the Flickr API.
The method employed in this research involves the use of machine learning models, particularly convolutional neural networks (CNN)
It was found that a from-scratch ResNet model can outperform pretrained models in terms of precision and accuracy.
arXiv Detail & Related papers (2023-08-08T15:30:08Z) - PUGAN: Physical Model-Guided Underwater Image Enhancement Using GAN with
Dual-Discriminators [120.06891448820447]
How to obtain clear and visually pleasant images has become a common concern of people.
The task of underwater image enhancement (UIE) has also emerged as the times require.
In this paper, we propose a physical model-guided GAN model for UIE, referred to as PUGAN.
Our PUGAN outperforms state-of-the-art methods in both qualitative and quantitative metrics.
arXiv Detail & Related papers (2023-06-15T07:41:12Z) - FisHook -- An Optimized Approach to Marine Specie Classification using
MobileNetV2 [5.565562836494568]
classification and monitoring of marine species can aid in understanding their distribution, population dynamics, and the impact of human activities on them.
Deep-learning algorithms can now efficiently classify marine species, making it easier to monitor and manage marine ecosystems.
arXiv Detail & Related papers (2023-04-04T04:30:25Z) - Combining Photogrammetric Computer Vision and Semantic Segmentation for
Fine-grained Understanding of Coral Reef Growth under Climate Change [6.335630432207172]
Corals are the primary habitat-building life-form on reefs that support a quarter of the species in the ocean.
For the first time, 3D fine-grained semantic modeling and rugosity evaluation of coral reefs have been completed at millimeter (mm) accuracy.
arXiv Detail & Related papers (2022-12-08T08:09:57Z) - SATS: Self-Attention Transfer for Continual Semantic Segmentation [50.51525791240729]
continual semantic segmentation suffers from the same catastrophic forgetting issue as in continual classification learning.
This study proposes to transfer a new type of information relevant to knowledge, i.e. the relationships between elements within each image.
The relationship information can be effectively obtained from the self-attention maps in a Transformer-style segmentation model.
arXiv Detail & Related papers (2022-03-15T06:09:28Z) - Spatially-Adaptive Image Restoration using Distortion-Guided Networks [51.89245800461537]
We present a learning-based solution for restoring images suffering from spatially-varying degradations.
We propose SPAIR, a network design that harnesses distortion-localization information and dynamically adjusts to difficult regions in the image.
arXiv Detail & Related papers (2021-08-19T11:02:25Z) - Learning CNN filters from user-drawn image markers for coconut-tree
image classification [78.42152902652215]
We present a method that needs a minimal set of user-selected images to train the CNN's feature extractor.
The method learns the filters of each convolutional layer from user-drawn markers in image regions that discriminate classes.
It does not rely on optimization based on backpropagation, and we demonstrate its advantages on the binary classification of coconut-tree aerial images.
arXiv Detail & Related papers (2020-08-08T15:50:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.