Rethinking Unsupervised Neural Superpixel Segmentation
- URL: http://arxiv.org/abs/2206.10213v1
- Date: Tue, 21 Jun 2022 09:30:26 GMT
- Title: Rethinking Unsupervised Neural Superpixel Segmentation
- Authors: Moshe Eliasof, Nir Ben Zikri, Eran Treister
- Abstract summary: unsupervised learning for superpixel segmentation via CNNs has been studied.
We propose three key elements to improve the efficacy of such networks.
By experimenting with the BSDS500 dataset, we find evidence to the significance of our proposal.
- Score: 6.123324869194195
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recently, the concept of unsupervised learning for superpixel segmentation
via CNNs has been studied. Essentially, such methods generate superpixels by
convolutional neural network (CNN) employed on a single image, and such CNNs
are trained without any labels or further information. Thus, such approach
relies on the incorporation of priors, typically by designing an objective
function that guides the solution towards a meaningful superpixel segmentation.
In this paper we propose three key elements to improve the efficacy of such
networks: (i) the similarity of the \emph{soft} superpixelated image compared
to the input image, (ii) the enhancement and consideration of object edges and
boundaries and (iii) a modified architecture based on atrous convolution, which
allow for a wider field of view, functioning as a multi-scale component in our
network. By experimenting with the BSDS500 dataset, we find evidence to the
significance of our proposal, both qualitatively and quantitatively.
Related papers
- Distance Weighted Trans Network for Image Completion [52.318730994423106]
We propose a new architecture that relies on Distance-based Weighted Transformer (DWT) to better understand the relationships between an image's components.
CNNs are used to augment the local texture information of coarse priors.
DWT blocks are used to recover certain coarse textures and coherent visual structures.
arXiv Detail & Related papers (2023-10-11T12:46:11Z) - Single-Image Super-Resolution Reconstruction based on the Differences of
Neighboring Pixels [3.257500143434429]
The deep learning technique was used to increase the performance of single image super-resolution (SISR)
In this paper, we propose the differences of neighboring pixels to regularize the CNN by constructing a graph from the estimated image and the ground-truth image.
The proposed method outperforms the state-of-the-art methods in terms of quantitative and qualitative evaluation of the benchmark datasets.
arXiv Detail & Related papers (2022-12-28T07:30:07Z) - Unsupervised Image Semantic Segmentation through Superpixels and Graph
Neural Networks [6.123324869194195]
Unsupervised image segmentation is an important task in many real-world scenarios where labelled data is of scarce availability.
We propose a novel approach that harnesses recent advances in unsupervised learning using a combination of Mutual Information Maximization (MIM), Neural Superpixel and Graph Neural Networks (GNNs) in an end-to-end manner.
arXiv Detail & Related papers (2022-10-21T08:35:18Z) - Multiscale Convolutional Transformer with Center Mask Pretraining for
Hyperspectral Image Classificationtion [14.33259265286265]
We propose a noval multi-scale convolutional embedding module for hyperspectral images (HSI) to realize effective extraction of spatial-spectral information.
Similar to Mask autoencoder, but our pre-training method only masks the corresponding token of the central pixel in the encoder, and inputs the remaining token into the decoder to reconstruct the spectral information of the central pixel.
arXiv Detail & Related papers (2022-03-09T14:42:26Z) - AF$_2$: Adaptive Focus Framework for Aerial Imagery Segmentation [86.44683367028914]
Aerial imagery segmentation has some unique challenges, the most critical one among which lies in foreground-background imbalance.
We propose Adaptive Focus Framework (AF$), which adopts a hierarchical segmentation procedure and focuses on adaptively utilizing multi-scale representations.
AF$ has significantly improved the accuracy on three widely used aerial benchmarks, as fast as the mainstream method.
arXiv Detail & Related papers (2022-02-18T10:14:45Z) - Superpixel-based Domain-Knowledge Infusion in Computer Vision [0.7349727826230862]
Superpixels are higher-order perceptual groups of pixels in an image, often carrying much more information than raw pixels.
There is an inherent relational structure to the relationship among different superpixels of an image.
This relational information can convey some form of domain information about the image, e.g. relationship between superpixels representing two eyes in a cat image.
arXiv Detail & Related papers (2021-05-20T01:25:42Z) - AINet: Association Implantation for Superpixel Segmentation [82.21559299694555]
We propose a novel textbfAssociation textbfImplantation (AI) module to enable the network to explicitly capture the relations between the pixel and its surrounding grids.
Our method could not only achieve state-of-the-art performance but maintain satisfactory inference efficiency.
arXiv Detail & Related papers (2021-01-26T10:40:13Z) - Probabilistic Graph Attention Network with Conditional Kernels for
Pixel-Wise Prediction [158.88345945211185]
We present a novel approach that advances the state of the art on pixel-level prediction in a fundamental aspect, i.e. structured multi-scale features learning and fusion.
We propose a probabilistic graph attention network structure based on a novel Attention-Gated Conditional Random Fields (AG-CRFs) model for learning and fusing multi-scale representations in a principled manner.
arXiv Detail & Related papers (2021-01-08T04:14:29Z) - PC-RGNN: Point Cloud Completion and Graph Neural Network for 3D Object
Detection [57.49788100647103]
LiDAR-based 3D object detection is an important task for autonomous driving.
Current approaches suffer from sparse and partial point clouds of distant and occluded objects.
In this paper, we propose a novel two-stage approach, namely PC-RGNN, dealing with such challenges by two specific solutions.
arXiv Detail & Related papers (2020-12-18T18:06:43Z) - Learning Enriched Features for Real Image Restoration and Enhancement [166.17296369600774]
convolutional neural networks (CNNs) have achieved dramatic improvements over conventional approaches for image restoration task.
We present a novel architecture with the collective goals of maintaining spatially-precise high-resolution representations through the entire network.
Our approach learns an enriched set of features that combines contextual information from multiple scales, while simultaneously preserving the high-resolution spatial details.
arXiv Detail & Related papers (2020-03-15T11:04:30Z) - Superpixel Segmentation via Convolutional Neural Networks with
Regularized Information Maximization [11.696069523681178]
We propose an unsupervised superpixel segmentation method by optimizing a randomly-d convolutional neural network (CNN) in inference time.
Our method generates superpixels via CNN from a single image without any labels by minimizing a proposed objective function for superpixel segmentation in inference time.
arXiv Detail & Related papers (2020-02-17T04:32:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.