Probabilistic Semantic Segmentation Refinement by Monte Carlo Region
Growing
- URL: http://arxiv.org/abs/2005.05856v1
- Date: Tue, 12 May 2020 15:23:57 GMT
- Title: Probabilistic Semantic Segmentation Refinement by Monte Carlo Region
Growing
- Authors: Philipe A. Dias and Henry Medeiros
- Abstract summary: We introduce a fully unsupervised post-processing algorithm that exploits Monte Carlo sampling and pixel similarities to propagate high-confidence pixel labels into regions of low-confidence classification.
Experiments using multiple modern semantic segmentation networks and benchmark datasets demonstrate the effectiveness of our approach for the refinement of segmentation predictions at different levels of coarseness.
- Score: 0.7424262881242935
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Semantic segmentation with fine-grained pixel-level accuracy is a fundamental
component of a variety of computer vision applications. However, despite the
large improvements provided by recent advances in the architectures of
convolutional neural networks, segmentations provided by modern
state-of-the-art methods still show limited boundary adherence. We introduce a
fully unsupervised post-processing algorithm that exploits Monte Carlo sampling
and pixel similarities to propagate high-confidence pixel labels into regions
of low-confidence classification. Our algorithm, which we call probabilistic
Region Growing Refinement (pRGR), is based on a rigorous mathematical
foundation in which clusters are modelled as multivariate normally distributed
sets of pixels. Exploiting concepts of Bayesian estimation and variance
reduction techniques, pRGR performs multiple refinement iterations at varied
receptive fields sizes, while updating cluster statistics to adapt to local
image features. Experiments using multiple modern semantic segmentation
networks and benchmark datasets demonstrate the effectiveness of our approach
for the refinement of segmentation predictions at different levels of
coarseness, as well as the suitability of the variance estimates obtained in
the Monte Carlo iterations as uncertainty measures that are highly correlated
with segmentation accuracy.
Related papers
- A Super-pixel-based Approach to the Stable Interpretation of Neural Networks [20.252282961052945]
We propose a novel pixel strategy to boost the stability and generalizability of gradient-based saliency maps.
We show that the grouping of pixels reduces the variance of the saliency map and improves the generalization behavior of the interpretation method.
arXiv Detail & Related papers (2024-12-19T04:17:32Z) - Reducing Semantic Ambiguity In Domain Adaptive Semantic Segmentation Via Probabilistic Prototypical Pixel Contrast [7.092718945468069]
Domain adaptation aims to reduce the model degradation on the target domain caused by the domain shift between the source and target domains.
Probabilistic proto-typical pixel contrast (PPPC) is a universal adaptation framework that models each pixel embedding as a probability.
PPPC not only helps to address ambiguity at the pixel level, yielding discriminative representations but also significant improvements in both synthetic-to-real and day-to-night adaptation tasks.
arXiv Detail & Related papers (2024-09-27T08:25:03Z) - Learning Invariant Inter-pixel Correlations for Superpixel Generation [12.605604620139497]
Learnable features exhibit constrained discriminative capability, resulting in unsatisfactory pixel grouping performance.
We propose the Content Disentangle Superpixel algorithm to selectively separate the invariant inter-pixel correlations and statistical properties.
The experimental results on four benchmark datasets demonstrate the superiority of our approach to existing state-of-the-art methods.
arXiv Detail & Related papers (2024-02-28T09:46:56Z) - Fine-grained Recognition with Learnable Semantic Data Augmentation [68.48892326854494]
Fine-grained image recognition is a longstanding computer vision challenge.
We propose diversifying the training data at the feature-level to alleviate the discriminative region loss problem.
Our method significantly improves the generalization performance on several popular classification networks.
arXiv Detail & Related papers (2023-09-01T11:15:50Z) - Compound Batch Normalization for Long-tailed Image Classification [77.42829178064807]
We propose a compound batch normalization method based on a Gaussian mixture.
It can model the feature space more comprehensively and reduce the dominance of head classes.
The proposed method outperforms existing methods on long-tailed image classification.
arXiv Detail & Related papers (2022-12-02T07:31:39Z) - Real-Time Scene Text Detection with Differentiable Binarization and
Adaptive Scale Fusion [62.269219152425556]
segmentation-based scene text detection methods have drawn extensive attention in the scene text detection field.
We propose a Differentiable Binarization (DB) module that integrates the binarization process into a segmentation network.
An efficient Adaptive Scale Fusion (ASF) module is proposed to improve the scale robustness by fusing features of different scales adaptively.
arXiv Detail & Related papers (2022-02-21T15:30:14Z) - Semantic Distribution-aware Contrastive Adaptation for Semantic
Segmentation [50.621269117524925]
Domain adaptive semantic segmentation refers to making predictions on a certain target domain with only annotations of a specific source domain.
We present a semantic distribution-aware contrastive adaptation algorithm that enables pixel-wise representation alignment.
We evaluate SDCA on multiple benchmarks, achieving considerable improvements over existing algorithms.
arXiv Detail & Related papers (2021-05-11T13:21:25Z) - Distributional Gaussian Process Layers for Outlier Detection in Image
Segmentation [15.086527565572073]
We propose a parameter efficient Bayesian layer for hierarchical convolutional Gaussian Processes.
Our experiments on brain tissue-segmentation show that the resulting architecture approaches the performance of well-established deterministic segmentation algorithms.
Our uncertainty estimates result in out-of-distribution detection that outperforms the capabilities of previous Bayesian networks.
arXiv Detail & Related papers (2021-04-28T13:37:10Z) - PaDiM: a Patch Distribution Modeling Framework for Anomaly Detection and
Localization [64.39761523935613]
We present a new framework for Patch Distribution Modeling, PaDiM, to concurrently detect and localize anomalies in images.
PaDiM makes use of a pretrained convolutional neural network (CNN) for patch embedding.
It also exploits correlations between the different semantic levels of CNN to better localize anomalies.
arXiv Detail & Related papers (2020-11-17T17:29:18Z) - Diversity inducing Information Bottleneck in Model Ensembles [73.80615604822435]
In this paper, we target the problem of generating effective ensembles of neural networks by encouraging diversity in prediction.
We explicitly optimize a diversity inducing adversarial loss for learning latent variables and thereby obtain diversity in the output predictions necessary for modeling multi-modal data.
Compared to the most competitive baselines, we show significant improvements in classification accuracy, under a shift in the data distribution.
arXiv Detail & Related papers (2020-03-10T03:10:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.