Training Semantic Segmentation on Heterogeneous Datasets
- URL: http://arxiv.org/abs/2301.07634v1
- Date: Wed, 18 Jan 2023 16:22:40 GMT
- Title: Training Semantic Segmentation on Heterogeneous Datasets
- Authors: Panagiotis Meletis, Gijs Dubbelman
- Abstract summary: We explore semantic segmentation beyond the conventional, single-dataset homogeneous training.
We propose a unified framework, that incorporates heterogeneous datasets in a single-network training pipeline.
Our framework first curates heterogeneous datasets to bring them into a common format and then trains a single-backbone FCN on all of them simultaneously.
- Score: 5.584060970507507
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We explore semantic segmentation beyond the conventional, single-dataset
homogeneous training and bring forward the problem of Heterogeneous Training of
Semantic Segmentation (HTSS). HTSS involves simultaneous training on multiple
heterogeneous datasets, i.e. datasets with conflicting label spaces and
different (weak) annotation types from the perspective of semantic
segmentation. The HTSS formulation exposes deep networks to a larger and
previously unexplored aggregation of information that can potentially enhance
semantic segmentation in three directions: i) performance: increased
segmentation metrics on seen datasets, ii) generalization: improved
segmentation metrics on unseen datasets, and iii) knowledgeability: increased
number of recognizable semantic concepts. To research these benefits of HTSS,
we propose a unified framework, that incorporates heterogeneous datasets in a
single-network training pipeline following the established FCN standard. Our
framework first curates heterogeneous datasets to bring them into a common
format and then trains a single-backbone FCN on all of them simultaneously. To
achieve this, it transforms weak annotations, which are incompatible with
semantic segmentation, to per-pixel labels, and hierarchizes their label spaces
into a universal taxonomy. The trained HTSS models demonstrate performance and
generalization gains over a wide range of datasets and extend the inference
label space entailing hundreds of semantic classes.
Related papers
- Frequency-based Matcher for Long-tailed Semantic Segmentation [22.199174076366003]
We focus on a relatively under-explored task setting, long-tailed semantic segmentation (LTSS)
We propose a dual-metric evaluation system and construct the LTSS benchmark to demonstrate the performance of semantic segmentation methods and long-tailed solutions.
We also propose a transformer-based algorithm to improve LTSS, frequency-based matcher, which solves the oversuppression problem by one-to-many matching.
arXiv Detail & Related papers (2024-06-06T09:57:56Z) - Navya3DSeg -- Navya 3D Semantic Segmentation Dataset & split generation
for autonomous vehicles [63.20765930558542]
3D semantic data are useful for core perception tasks such as obstacle detection and ego-vehicle localization.
We propose a new dataset, Navya 3D (Navya3DSeg), with a diverse label space corresponding to a large scale production grade operational domain.
It contains 23 labeled sequences and 25 supplementary sequences without labels, designed to explore self-supervised and semi-supervised semantic segmentation benchmarks on point clouds.
arXiv Detail & Related papers (2023-02-16T13:41:19Z) - Open-world Semantic Segmentation via Contrasting and Clustering
Vision-Language Embedding [95.78002228538841]
We propose a new open-world semantic segmentation pipeline that makes the first attempt to learn to segment semantic objects of various open-world categories without any efforts on dense annotations.
Our method can directly segment objects of arbitrary categories, outperforming zero-shot segmentation methods that require data labeling on three benchmark datasets.
arXiv Detail & Related papers (2022-07-18T09:20:04Z) - Automatic universal taxonomies for multi-domain semantic segmentation [1.4364491422470593]
Training semantic segmentation models on multiple datasets has sparked a lot of recent interest in the computer vision community.
established datasets have mutually incompatible labels which disrupt principled inference in the wild.
We address this issue by automatic construction of universal through iterative dataset integration.
arXiv Detail & Related papers (2022-07-18T08:53:17Z) - SemAffiNet: Semantic-Affine Transformation for Point Cloud Segmentation [94.11915008006483]
We propose SemAffiNet for point cloud semantic segmentation.
We conduct extensive experiments on the ScanNetV2 and NYUv2 datasets.
arXiv Detail & Related papers (2022-05-26T17:00:23Z) - Learning Semantic Segmentation from Multiple Datasets with Label Shifts [101.24334184653355]
This paper proposes UniSeg, an effective approach to automatically train models across multiple datasets with differing label spaces.
Specifically, we propose two losses that account for conflicting and co-occurring labels to achieve better generalization performance in unseen domains.
arXiv Detail & Related papers (2022-02-28T18:55:19Z) - Scaling up Multi-domain Semantic Segmentation with Sentence Embeddings [81.09026586111811]
We propose an approach to semantic segmentation that achieves state-of-the-art supervised performance when applied in a zero-shot setting.
This is achieved by replacing each class label with a vector-valued embedding of a short paragraph that describes the class.
The resulting merged semantic segmentation dataset of over 2 Million images enables training a model that achieves performance equal to that of state-of-the-art supervised methods on 7 benchmark datasets.
arXiv Detail & Related papers (2022-02-04T07:19:09Z) - Improving Semi-Supervised and Domain-Adaptive Semantic Segmentation with
Self-Supervised Depth Estimation [94.16816278191477]
We present a framework for semi-adaptive and domain-supervised semantic segmentation.
It is enhanced by self-supervised monocular depth estimation trained only on unlabeled image sequences.
We validate the proposed model on the Cityscapes dataset.
arXiv Detail & Related papers (2021-08-28T01:33:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.