Unsupervised Image Semantic Segmentation through Superpixels and Graph
Neural Networks
- URL: http://arxiv.org/abs/2210.11810v1
- Date: Fri, 21 Oct 2022 08:35:18 GMT
- Title: Unsupervised Image Semantic Segmentation through Superpixels and Graph
Neural Networks
- Authors: Moshe Eliasof, Nir Ben Zikri, Eran Treister
- Abstract summary: Unsupervised image segmentation is an important task in many real-world scenarios where labelled data is of scarce availability.
We propose a novel approach that harnesses recent advances in unsupervised learning using a combination of Mutual Information Maximization (MIM), Neural Superpixel and Graph Neural Networks (GNNs) in an end-to-end manner.
- Score: 6.123324869194195
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Unsupervised image segmentation is an important task in many real-world
scenarios where labelled data is of scarce availability. In this paper we
propose a novel approach that harnesses recent advances in unsupervised
learning using a combination of Mutual Information Maximization (MIM), Neural
Superpixel Segmentation and Graph Neural Networks (GNNs) in an end-to-end
manner, an approach that has not been explored yet. We take advantage of the
compact representation of superpixels and combine it with GNNs in order to
learn strong and semantically meaningful representations of images.
Specifically, we show that our GNN based approach allows to model interactions
between distant pixels in the image and serves as a strong prior to existing
CNNs for an improved accuracy. Our experiments reveal both the qualitative and
quantitative advantages of our approach compared to current state-of-the-art
methods over four popular datasets.
Related papers
- Masked Contrastive Graph Representation Learning for Age Estimation [44.96502862249276]
This paper utilizes the property of graph representation learning in dealing with image redundancy information.
We propose a novel Masked Contrastive Graph Representation Learning (MCGRL) method for age estimation.
Experimental results on real-world face image datasets demonstrate the superiority of our proposed method over other state-of-the-art age estimation approaches.
arXiv Detail & Related papers (2023-06-16T15:53:21Z) - Rethinking Unsupervised Neural Superpixel Segmentation [6.123324869194195]
unsupervised learning for superpixel segmentation via CNNs has been studied.
We propose three key elements to improve the efficacy of such networks.
By experimenting with the BSDS500 dataset, we find evidence to the significance of our proposal.
arXiv Detail & Related papers (2022-06-21T09:30:26Z) - In-N-Out Generative Learning for Dense Unsupervised Video Segmentation [89.21483504654282]
In this paper, we focus on the unsupervised Video Object (VOS) task which learns visual correspondence from unlabeled videos.
We propose the In-aNd-Out (INO) generative learning from a purely generative perspective, which captures both high-level and fine-grained semantics.
Our INO outperforms previous state-of-the-art methods by significant margins.
arXiv Detail & Related papers (2022-03-29T07:56:21Z) - Auto-Transfer: Learning to Route Transferrable Representations [77.30427535329571]
We propose a novel adversarial multi-armed bandit approach which automatically learns to route source representations to appropriate target representations.
We see upwards of 5% accuracy improvements compared with the state-of-the-art knowledge transfer methods.
arXiv Detail & Related papers (2022-02-02T13:09:27Z) - Superpixel-based Domain-Knowledge Infusion in Computer Vision [0.7349727826230862]
Superpixels are higher-order perceptual groups of pixels in an image, often carrying much more information than raw pixels.
There is an inherent relational structure to the relationship among different superpixels of an image.
This relational information can convey some form of domain information about the image, e.g. relationship between superpixels representing two eyes in a cat image.
arXiv Detail & Related papers (2021-05-20T01:25:42Z) - Group-Wise Semantic Mining for Weakly Supervised Semantic Segmentation [49.90178055521207]
This work addresses weakly supervised semantic segmentation (WSSS), with the goal of bridging the gap between image-level annotations and pixel-level segmentation.
We formulate WSSS as a novel group-wise learning task that explicitly models semantic dependencies in a group of images to estimate more reliable pseudo ground-truths.
In particular, we devise a graph neural network (GNN) for group-wise semantic mining, wherein input images are represented as graph nodes.
arXiv Detail & Related papers (2020-12-09T12:40:13Z) - Multi-Level Graph Convolutional Network with Automatic Graph Learning
for Hyperspectral Image Classification [63.56018768401328]
We propose a Multi-level Graph Convolutional Network (GCN) with Automatic Graph Learning method (MGCN-AGL) for HSI classification.
By employing attention mechanism to characterize the importance among spatially neighboring regions, the most relevant information can be adaptively incorporated to make decisions.
Our MGCN-AGL encodes the long range dependencies among image regions based on the expressive representations that have been produced at local level.
arXiv Detail & Related papers (2020-09-19T09:26:20Z) - CRNet: Cross-Reference Networks for Few-Shot Segmentation [59.85183776573642]
Few-shot segmentation aims to learn a segmentation model that can be generalized to novel classes with only a few training images.
With a cross-reference mechanism, our network can better find the co-occurrent objects in the two images.
Experiments on the PASCAL VOC 2012 dataset show that our network achieves state-of-the-art performance.
arXiv Detail & Related papers (2020-03-24T04:55:43Z) - Learning Enriched Features for Real Image Restoration and Enhancement [166.17296369600774]
convolutional neural networks (CNNs) have achieved dramatic improvements over conventional approaches for image restoration task.
We present a novel architecture with the collective goals of maintaining spatially-precise high-resolution representations through the entire network.
Our approach learns an enriched set of features that combines contextual information from multiple scales, while simultaneously preserving the high-resolution spatial details.
arXiv Detail & Related papers (2020-03-15T11:04:30Z) - Superpixel Image Classification with Graph Attention Networks [4.714325419968082]
This paper presents a methodology for image classification using Graph Neural Network (GNN) models.
We transform the input images into region adjacency graphs (RAGs), in which regions are superpixels and edges connect neighboring superpixels.
Experiments suggest that Graph Attention Networks (GATs), which combine graph convolutions with self-attention mechanisms, outperforms other GNN models.
arXiv Detail & Related papers (2020-02-13T14:52:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.