FuSS: Fusing Superpixels for Improved Segmentation Consistency
- URL: http://arxiv.org/abs/2206.02714v1
- Date: Mon, 6 Jun 2022 16:14:19 GMT
- Title: FuSS: Fusing Superpixels for Improved Segmentation Consistency
- Authors: Ian Nunes, Matheus B. Pereira, Hugo Oliveira, Jefersson A. Dos Santos
and Marcus Poggi
- Abstract summary: We propose two approaches to improve the semantic consistency of Open Set Semantic.
First, we propose a method called OpenGMM to model the distribution of pixels for each class in a multimodal manner.
The second approach is a post-processing which uses superpixels to enforce highly homogeneous regions to behave equally.
- Score: 2.7771471571972333
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In this work, we propose two different approaches to improve the semantic
consistency of Open Set Semantic Segmentation. First, we propose a method
called OpenGMM that extends the OpenPCS framework using a Gaussian Mixture of
Models to model the distribution of pixels for each class in a multimodal
manner. The second approach is a post-processing which uses superpixels to
enforce highly homogeneous regions to behave equally, rectifying erroneous
classified pixels within these regions, we also proposed a novel superpixel
method called FuSS. All tests were performed on ISPRS Vaihingen and Potsdam
datasets, and both methods were capable to improve quantitative and qualitative
results for both datasets. Besides that, the post-process with FuSS achieved
state-of-the-art results for both datasets. The official implementation is
available at: \url{https://github.com/iannunes/FuSS}.
Related papers
- A New Dataset and Framework for Real-World Blurred Images Super-Resolution [9.122275433854062]
We develop a new super-resolution dataset specifically tailored for blur images, named the Real-world Blur-kept Super-Resolution (ReBlurSR) dataset.
We propose Perceptual-Blur-adaptive Super-Resolution (PBaSR), which comprises two main modules: the Cross Disentanglement Module (CDM) and the Cross Fusion Module (CFM)
By integrating these two modules, PBaSR achieves commendable performance on both general and blur data without any additional inference and deployment cost.
arXiv Detail & Related papers (2024-07-20T14:07:03Z) - Multiway Point Cloud Mosaicking with Diffusion and Global Optimization [74.3802812773891]
We introduce a novel framework for multiway point cloud mosaicking (named Wednesday)
At the core of our approach is ODIN, a learned pairwise registration algorithm that identifies overlaps and refines attention scores.
Tested on four diverse, large-scale datasets, our method state-of-the-art pairwise and rotation registration results by a large margin on all benchmarks.
arXiv Detail & Related papers (2024-03-30T17:29:13Z) - Learning Invariant Inter-pixel Correlations for Superpixel Generation [12.605604620139497]
Learnable features exhibit constrained discriminative capability, resulting in unsatisfactory pixel grouping performance.
We propose the Content Disentangle Superpixel algorithm to selectively separate the invariant inter-pixel correlations and statistical properties.
The experimental results on four benchmark datasets demonstrate the superiority of our approach to existing state-of-the-art methods.
arXiv Detail & Related papers (2024-02-28T09:46:56Z) - Skeleton-Guided Instance Separation for Fine-Grained Segmentation in
Microscopy [23.848474219551818]
One of the fundamental challenges in microscopy (MS) image analysis is instance segmentation (IS)
We propose a novel one-stage framework named A2B-IS to address this challenge and enhance the accuracy of IS in MS images.
Our method has been thoroughly validated on two large-scale MS datasets.
arXiv Detail & Related papers (2024-01-18T11:14:32Z) - Improving Pixel-based MIM by Reducing Wasted Modeling Capability [77.99468514275185]
We propose a new method that explicitly utilizes low-level features from shallow layers to aid pixel reconstruction.
To the best of our knowledge, we are the first to systematically investigate multi-level feature fusion for isotropic architectures.
Our method yields significant performance gains, such as 1.2% on fine-tuning, 2.8% on linear probing, and 2.6% on semantic segmentation.
arXiv Detail & Related papers (2023-08-01T03:44:56Z) - High-fidelity Pseudo-labels for Boosting Weakly-Supervised Segmentation [17.804090651425955]
Image-level weakly-supervised segmentation (WSSS) reduces the usually vast data annotation cost by surrogate segmentation masks during training.
Our work is based on two techniques for improving CAMs; importance sampling, which is a substitute for GAP, and the feature similarity loss.
We reformulate both techniques based on binomial posteriors of multiple independent binary problems.
This has two benefits; their performance is improved and they become more general, resulting in an add-on method that can boost virtually any WSSS method.
arXiv Detail & Related papers (2023-04-05T17:43:57Z) - FIXED: Frustratingly Easy Domain Generalization with Mixup [53.782029033068675]
Domain generalization (DG) aims to learn a generalizable model from multiple training domains such that it can perform well on unseen target domains.
A popular strategy is to augment training data to benefit generalization through methods such as Mixupcitezhang 2018mixup.
We propose a simple yet effective enhancement for Mixup-based DG, namely domain-invariant Feature mIXup (FIX)
Our approach significantly outperforms nine state-of-the-art related methods, beating the best performing baseline by 6.5% on average in terms of test accuracy.
arXiv Detail & Related papers (2022-11-07T09:38:34Z) - FRIH: Fine-grained Region-aware Image Harmonization [49.420765789360836]
We propose a novel global-local two stages framework for Fine-grained Region-aware Image Harmonization (FRIH)
Our algorithm achieves the best performance on iHarmony4 dataset (PSNR is 38.19 dB) with a lightweight model.
arXiv Detail & Related papers (2022-05-13T04:50:26Z) - Conditional Reconstruction for Open-set Semantic Segmentation [2.7771471571972333]
We propose a novel method called CoReSeg thattackles the issue using class conditional reconstruction of the input images.
It produces better se-mantic consistency in its predictions, resulting in cleanersegmentation maps.
CoRe-Seg outperforms state-of-the-art methods on the Vaihin-gen and Potsdam ISPRS datasets.
arXiv Detail & Related papers (2022-03-02T19:32:29Z) - Layout-to-Image Translation with Double Pooling Generative Adversarial
Networks [76.83075646527521]
We propose a novel Double Pooing GAN (DPGAN) for generating photo-realistic and semantically-consistent results from the input layout.
We also propose a novel Double Pooling Module (DPM), which consists of the Square-shape Pooling Module (SPM) and the Rectangle-shape Pooling Module ( RPM)
arXiv Detail & Related papers (2021-08-29T19:55:14Z) - Similarity-Aware Fusion Network for 3D Semantic Segmentation [87.51314162700315]
We propose a similarity-aware fusion network (SAFNet) to adaptively fuse 2D images and 3D point clouds for 3D semantic segmentation.
We employ a late fusion strategy where we first learn the geometric and contextual similarities between the input and back-projected (from 2D pixels) point clouds.
We show that SAFNet significantly outperforms existing state-of-the-art fusion-based approaches across various data integrity.
arXiv Detail & Related papers (2021-07-04T09:28:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.