Global and Local Features through Gaussian Mixture Models on Image
Semantic Segmentation
- URL: http://arxiv.org/abs/2207.09162v1
- Date: Tue, 19 Jul 2022 10:10:49 GMT
- Title: Global and Local Features through Gaussian Mixture Models on Image
Semantic Segmentation
- Authors: Darwin Saire and Ad\'in Ram\'irez Rivera
- Abstract summary: We propose an internal structure for the feature representations while extracting a global representation that supports the former.
During training, we predict a Gaussian Mixture Model from the data, which, merged with the skip connections and the decoding stage, helps avoid wrong inductive biases.
Our results show that we can improve semantic segmentation by providing both learning representations (global and local) with a clustering behavior and combining them.
- Score: 0.38073142980732994
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The semantic segmentation task aims at dense classification at the pixel-wise
level. Deep models exhibited progress in tackling this task. However, one
remaining problem with these approaches is the loss of spatial precision, often
produced at the segmented objects' boundaries. Our proposed model addresses
this problem by providing an internal structure for the feature representations
while extracting a global representation that supports the former. To fit the
internal structure, during training, we predict a Gaussian Mixture Model from
the data, which, merged with the skip connections and the decoding stage, helps
avoid wrong inductive biases. Furthermore, our results show that we can improve
semantic segmentation by providing both learning representations (global and
local) with a clustering behavior and combining them. Finally, we present
results demonstrating our advances in Cityscapes and Synthia datasets.
Related papers
- Mesh Denoising Transformer [104.5404564075393]
Mesh denoising is aimed at removing noise from input meshes while preserving their feature structures.
SurfaceFormer is a pioneering Transformer-based mesh denoising framework.
New representation known as Local Surface Descriptor captures local geometric intricacies.
Denoising Transformer module receives the multimodal information and achieves efficient global feature aggregation.
arXiv Detail & Related papers (2024-05-10T15:27:43Z) - Synthetic-to-Real Domain Generalized Semantic Segmentation for 3D Indoor
Point Clouds [69.64240235315864]
This paper introduces the synthetic-to-real domain generalization setting to this task.
The domain gap between synthetic and real-world point cloud data mainly lies in the different layouts and point patterns.
Experiments on the synthetic-to-real benchmark demonstrate that both CINMix and multi-prototypes can narrow the distribution gap.
arXiv Detail & Related papers (2022-12-09T05:07:43Z) - Towards Understanding and Mitigating Dimensional Collapse in Heterogeneous Federated Learning [112.69497636932955]
Federated learning aims to train models across different clients without the sharing of data for privacy considerations.
We study how data heterogeneity affects the representations of the globally aggregated models.
We propose sc FedDecorr, a novel method that can effectively mitigate dimensional collapse in federated learning.
arXiv Detail & Related papers (2022-10-01T09:04:17Z) - Fully Self-Supervised Learning for Semantic Segmentation [46.6602159197283]
We present a fully self-supervised framework for semantic segmentation(FS4).
We propose a bootstrapped training scheme for semantic segmentation, which fully leveraged the global semantic knowledge for self-supervision.
We evaluate our method on the large-scale COCO-Stuff dataset and achieved 7.19 mIoU improvements on both things and stuff objects.
arXiv Detail & Related papers (2022-02-24T09:38:22Z) - Global Aggregation then Local Distribution for Scene Parsing [99.1095068574454]
We show that our approach can be modularized as an end-to-end trainable block and easily plugged into existing semantic segmentation networks.
Our approach allows us to build new state of the art on major semantic segmentation benchmarks including Cityscapes, ADE20K, Pascal Context, Camvid and COCO-stuff.
arXiv Detail & Related papers (2021-07-28T03:46:57Z) - Denoise and Contrast for Category Agnostic Shape Completion [48.66519783934386]
We present a deep learning model that exploits the power of self-supervision to perform 3D point cloud completion.
A denoising pretext task provides the network with the needed local cues, decoupled from the high-level semantics.
contrastive learning maximizes the agreement between variants of the same shape with different missing portions.
arXiv Detail & Related papers (2021-03-30T20:33:24Z) - Boosting Semi-supervised Image Segmentation with Global and Local Mutual
Information Regularization [9.994508738317585]
We present a novel semi-supervised segmentation method that leverages mutual information (MI) on categorical distributions.
We evaluate the method on three challenging publicly-available datasets for medical image segmentation.
arXiv Detail & Related papers (2021-03-08T15:13:25Z) - Group-Wise Semantic Mining for Weakly Supervised Semantic Segmentation [49.90178055521207]
This work addresses weakly supervised semantic segmentation (WSSS), with the goal of bridging the gap between image-level annotations and pixel-level segmentation.
We formulate WSSS as a novel group-wise learning task that explicitly models semantic dependencies in a group of images to estimate more reliable pseudo ground-truths.
In particular, we devise a graph neural network (GNN) for group-wise semantic mining, wherein input images are represented as graph nodes.
arXiv Detail & Related papers (2020-12-09T12:40:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.