Point Label Aware Superpixels for Multi-species Segmentation of
Underwater Imagery
- URL: http://arxiv.org/abs/2202.13487v1
- Date: Sun, 27 Feb 2022 23:46:43 GMT
- Title: Point Label Aware Superpixels for Multi-species Segmentation of
Underwater Imagery
- Authors: Scarlett Raine, Ross Marchant, Brano Kusy, Frederic Maire, Tobias
Fischer
- Abstract summary: Monitoring coral reefs using underwater vehicles increases the range of marine surveys and availability of historical ecological data.
We propose a point label aware method for propagating labels within superpixel regions to obtain augmented ground truth for training a semantic segmentation model.
Our method outperforms prior methods on the UCSD Mosaics dataset by 3.62% for pixel accuracy and 8.35% for mean IoU for the label propagation task.
- Score: 4.195806160139487
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Monitoring coral reefs using underwater vehicles increases the range of
marine surveys and availability of historical ecological data by collecting
significant quantities of images. Analysis of this imagery can be automated
using a model trained to perform semantic segmentation, however it is too
costly and time-consuming to densely label images for training supervised
models. In this letter, we leverage photo-quadrat imagery labeled by ecologists
with sparse point labels. We propose a point label aware method for propagating
labels within superpixel regions to obtain augmented ground truth for training
a semantic segmentation model. Our point label aware superpixel method utilizes
the sparse point labels, and clusters pixels using learned features to
accurately generate single-species segments in cluttered, complex coral images.
Our method outperforms prior methods on the UCSD Mosaics dataset by 3.62% for
pixel accuracy and 8.35% for mean IoU for the label propagation task.
Furthermore, our approach reduces computation time reported by previous
approaches by 76%. We train a DeepLabv3+ architecture and outperform
state-of-the-art for semantic segmentation by 2.91% for pixel accuracy and
9.65% for mean IoU on the UCSD Mosaics dataset and by 4.19% for pixel accuracy
and 14.32% mean IoU for the Eilat dataset.
Related papers
- Semi-Supervised Segmentation via Embedding Matching [0.8896991256227597]
Deep convolutional neural networks are widely used in medical image segmentation but require many labeled images for training.
We propose a novel semi-supervised segmentation method that leverages mostly unlabeled images and a small set of labeled images in training.
The proposed approach yielded a Hausdorff distance with 95th percentile (HD95) of 3.30 and IoU of 0.929, surpassing existing methods achieving HD95 (4.07) and IoU (0.927) at their best.
arXiv Detail & Related papers (2024-07-05T16:49:21Z) - Human-in-the-Loop Segmentation of Multi-species Coral Imagery [3.3564744382205127]
Point label propagation is a technique that uses existing images labeled with sparse points to create augmented ground truth data.
We show that recent advances in large foundation models facilitate the creation of augmented ground truth masks.
We present a labeling method based on human-in-the-loop principles, which greatly enhances annotation efficiency.
arXiv Detail & Related papers (2024-04-15T01:47:44Z) - Learning Semantic Segmentation with Query Points Supervision on Aerial Images [57.09251327650334]
We present a weakly supervised learning algorithm to train semantic segmentation algorithms.
Our proposed approach performs accurate semantic segmentation and improves efficiency by significantly reducing the cost and time required for manual annotation.
arXiv Detail & Related papers (2023-09-11T14:32:04Z) - CorrMatch: Label Propagation via Correlation Matching for
Semi-Supervised Semantic Segmentation [73.89509052503222]
This paper presents a simple but performant semi-supervised semantic segmentation approach, called CorrMatch.
We observe that the correlation maps not only enable clustering pixels of the same category easily but also contain good shape information.
We propose to conduct pixel propagation by modeling the pairwise similarities of pixels to spread the high-confidence pixels and dig out more.
Then, we perform region propagation to enhance the pseudo labels with accurate class-agnostic masks extracted from the correlation maps.
arXiv Detail & Related papers (2023-06-07T10:02:29Z) - Probabilistic Deep Metric Learning for Hyperspectral Image
Classification [91.5747859691553]
This paper proposes a probabilistic deep metric learning framework for hyperspectral image classification.
It aims to predict the category of each pixel for an image captured by hyperspectral sensors.
Our framework can be readily applied to existing hyperspectral image classification methods.
arXiv Detail & Related papers (2022-11-15T17:57:12Z) - Unsupervised Domain Adaptation with Contrastive Learning for OCT
Segmentation [49.59567529191423]
We propose a novel semi-supervised learning framework for segmentation of volumetric images from new unlabeled domains.
We jointly use supervised and contrastive learning, also introducing a contrastive pairing scheme that leverages similarity between nearby slices in 3D.
arXiv Detail & Related papers (2022-03-07T19:02:26Z) - Aerial Scene Parsing: From Tile-level Scene Classification to Pixel-wise
Semantic Labeling [48.30060717413166]
Given an aerial image, aerial scene parsing (ASP) targets to interpret the semantic structure of the image content by assigning a semantic label to every pixel of the image.
We present a large-scale scene classification dataset that contains one million aerial images termed Million-AID.
We also report benchmarking experiments using classical convolutional neural networks (CNNs) to achieve pixel-wise semantic labeling.
arXiv Detail & Related papers (2022-01-06T07:40:47Z) - GeoCLR: Georeference Contrastive Learning for Efficient Seafloor Image
Interpretation [8.837172743444249]
This paper describes Georeference Contrastive Learning of visual Representation (GeoCLR) for efficient training of CNNs.
GeoCLR generates a similar image pair using images taken of nearby locations, and contrasting these with an image pair that is far apart.
A key advantage of this method is that it is self-supervised and does not require any human input for CNN training.
We demonstrate how the latent representations generated by GeoCLR can be used to efficiently guide human annotation efforts.
arXiv Detail & Related papers (2021-08-13T22:42:34Z) - Semantic Segmentation with Generative Models: Semi-Supervised Learning
and Strong Out-of-Domain Generalization [112.68171734288237]
We propose a novel framework for discriminative pixel-level tasks using a generative model of both images and labels.
We learn a generative adversarial network that captures the joint image-label distribution and is trained efficiently using a large set of unlabeled images.
We demonstrate strong in-domain performance compared to several baselines, and are the first to showcase extreme out-of-domain generalization.
arXiv Detail & Related papers (2021-04-12T21:41:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.