ProPanDL: A Modular Architecture for Uncertainty-Aware Panoptic
Segmentation
- URL: http://arxiv.org/abs/2304.08645v1
- Date: Mon, 17 Apr 2023 22:31:23 GMT
- Title: ProPanDL: A Modular Architecture for Uncertainty-Aware Panoptic
Segmentation
- Authors: Jacob Deery, Chang Won Lee, Steven Waslander
- Abstract summary: We introduce ProPanDL, a family of networks capable of uncertainty-aware panoptic segmentation.
We implement and evaluate ProPanDL variants capable of estimating both parametric (Variance Network) and parameter-free (SampleNet) distributions.
Our results demonstrate that ProPanDL is capable of estimating well-calibrated and meaningful output distributions while still retaining strong performance on the base panoptic segmentation task.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We introduce ProPanDL, a family of networks capable of uncertainty-aware
panoptic segmentation. Unlike existing segmentation methods, ProPanDL is
capable of estimating full probability distributions for both the semantic and
spatial aspects of panoptic segmentation. We implement and evaluate ProPanDL
variants capable of estimating both parametric (Variance Network) and
parameter-free (SampleNet) distributions quantifying pixel-wise spatial
uncertainty. We couple these approaches with two methods (Temperature Scaling
and Evidential Deep Learning) for semantic uncertainty estimation. To evaluate
the uncertainty-aware panoptic segmentation task, we address limitations with
existing approaches by proposing new metrics that enable separate evaluation of
spatial and semantic uncertainty. We additionally propose the use of the energy
score, a proper scoring rule, for more robust evaluation of spatial output
distributions. Using these metrics, we conduct an extensive evaluation of
ProPanDL variants. Our results demonstrate that ProPanDL is capable of
estimating well-calibrated and meaningful output distributions while still
retaining strong performance on the base panoptic segmentation task.
Related papers
- PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion [80.79938369319152]
We design a new pipeline coined PCF-Lift based on our Probabilis-tic Contrastive Fusion (PCF)
Our PCF-lift not only significantly outperforms the state-of-the-art methods on widely used benchmarks including the ScanNet dataset and the Messy Room dataset (4.4% improvement of scene-level PQ)
arXiv Detail & Related papers (2024-10-14T16:06:59Z) - NP-SemiSeg: When Neural Processes meet Semi-Supervised Semantic
Segmentation [87.50830107535533]
Semi-supervised semantic segmentation involves assigning pixel-wise labels to unlabeled images at training time.
Current approaches to semi-supervised semantic segmentation work by predicting pseudo-labels for each pixel from a class-wise probability distribution output by a model.
In this work, we move one step forward by adapting NPs to semi-supervised semantic segmentation, resulting in a new model called NP-SemiSeg.
arXiv Detail & Related papers (2023-08-05T12:42:15Z) - Boosting Semi-Supervised Semantic Segmentation with Probabilistic
Representations [30.672426195148496]
We propose a Probabilistic Representation Contrastive Learning framework to improve representation quality.
We define pixel-wise representations from a new perspective of probability theory.
We also propose to regularize the distribution variance to enhance the reliability of representations.
arXiv Detail & Related papers (2022-10-26T12:47:29Z) - Uncertainty-aware LiDAR Panoptic Segmentation [21.89063036529791]
We introduce a novel approach for solving the task of uncertainty-aware panoptic segmentation using LiDAR point clouds.
Our proposed EvLPSNet network is the first to solve this task efficiently in a sampling-free manner.
We provide several strong baselines combining state-of-the-art panoptic segmentation networks with sampling-free uncertainty estimation techniques.
arXiv Detail & Related papers (2022-10-10T07:54:57Z) - Uncertainty-aware Panoptic Segmentation [21.89063036529791]
We introduce the novel task of uncertainty-aware panoptic segmentation.
It aims to predict per-pixel semantic and instance segmentations, together with per-pixel uncertainty estimates.
We propose the novel top-down Evidential Panoptic Network (EvPSNet) to solve this task.
arXiv Detail & Related papers (2022-06-29T12:07:21Z) - Reinforcement Learning with a Terminator [80.34572413850186]
We learn the parameters of the TerMDP and leverage the structure of the estimation problem to provide state-wise confidence bounds.
We use these to construct a provably-efficient algorithm, which accounts for termination, and bound its regret.
arXiv Detail & Related papers (2022-05-30T18:40:28Z) - Hypernet-Ensemble Learning of Segmentation Probability for Medical Image
Segmentation with Ambiguous Labels [8.841870931360585]
Deep Learning approaches are notoriously overconfident about their prediction with highly polarized label probability.
This is often not desirable for many applications with the inherent label ambiguity even in human annotations.
We propose novel methods to improve the segmentation probability estimation without sacrificing performance in a real-world scenario.
arXiv Detail & Related papers (2021-12-13T14:24:53Z) - Attentional Prototype Inference for Few-Shot Segmentation [128.45753577331422]
We propose attentional prototype inference (API), a probabilistic latent variable framework for few-shot segmentation.
We define a global latent variable to represent the prototype of each object category, which we model as a probabilistic distribution.
We conduct extensive experiments on four benchmarks, where our proposal obtains at least competitive and often better performance than state-of-the-art prototype-based methods.
arXiv Detail & Related papers (2021-05-14T06:58:44Z) - Dive into Ambiguity: Latent Distribution Mining and Pairwise Uncertainty
Estimation for Facial Expression Recognition [59.52434325897716]
We propose a solution, named DMUE, to address the problem of annotation ambiguity from two perspectives.
For the former, an auxiliary multi-branch learning framework is introduced to better mine and describe the latent distribution in the label space.
For the latter, the pairwise relationship of semantic feature between instances are fully exploited to estimate the ambiguity extent in the instance space.
arXiv Detail & Related papers (2021-04-01T03:21:57Z) - Inter-class Discrepancy Alignment for Face Recognition [55.578063356210144]
We propose a unified framework calledInter-class DiscrepancyAlignment(IDA)
IDA-DAO is used to align the similarity scores considering the discrepancy between the images and its neighbors.
IDA-SSE can provide convincing inter-class neighbors by introducing virtual candidate images generated with GAN.
arXiv Detail & Related papers (2021-03-02T08:20:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.