HS3: Learning with Proper Task Complexity in Hierarchically Supervised
Semantic Segmentation
- URL: http://arxiv.org/abs/2111.02333v1
- Date: Wed, 3 Nov 2021 16:33:29 GMT
- Title: HS3: Learning with Proper Task Complexity in Hierarchically Supervised
Semantic Segmentation
- Authors: Shubhankar Borse, Hong Cai, Yizhe Zhang, Fatih Porikli
- Abstract summary: We propose Hierarchically Supervised Semantic (HS3), a training scheme that supervises intermediate layers in a segmentation network to learn meaningful representations by varying task complexity.
Our proposed HS3-Fuse framework further improves segmentation predictions and achieves state-of-the-art results on two large segmentation benchmarks: NYUD-v2 and Cityscapes.
- Score: 81.87943324048756
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: While deeply supervised networks are common in recent literature, they
typically impose the same learning objective on all transitional layers despite
their varying representation powers.
In this paper, we propose Hierarchically Supervised Semantic Segmentation
(HS3), a training scheme that supervises intermediate layers in a segmentation
network to learn meaningful representations by varying task complexity. To
enforce a consistent performance vs. complexity trade-off throughout the
network, we derive various sets of class clusters to supervise each
transitional layer of the network. Furthermore, we devise a fusion framework,
HS3-Fuse, to aggregate the hierarchical features generated by these layers,
which can provide rich semantic contexts and further enhance the final
segmentation. Extensive experiments show that our proposed HS3 scheme
considerably outperforms vanilla deep supervision with no added inference cost.
Our proposed HS3-Fuse framework further improves segmentation predictions and
achieves state-of-the-art results on two large segmentation benchmarks: NYUD-v2
and Cityscapes.
Related papers
- S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving [12.406655155106424]
We propose S3PT a novel scene semantics and structure guided clustering to provide more scene-consistent objectives for self-supervised training.
Our contributions are threefold: First, we incorporate semantic distribution consistent clustering to encourage better representation of rare classes such as motorcycles or animals.
Second, we introduce object diversity consistent spatial clustering, to handle imbalanced and diverse object sizes, ranging from large background areas to small objects such as pedestrians and traffic signs.
Third, we propose a depth-guided spatial clustering to regularize learning based on geometric information of the scene, thus further refining region separation on the feature level.
arXiv Detail & Related papers (2024-10-30T15:00:06Z) - Weakly Supervised Semantic Segmentation by Knowledge Graph Inference [11.056545020611397]
This paper introduces a graph reasoning-based approach to enhance Weakly Supervised Semantic (WSSS)
The aim is to improve WSSS holistically by simultaneously enhancing both the multi-label classification and segmentation network stages.
We have achieved state-of-the-art performance in WSSS on the PASCAL VOC 2012 and MS-COCO datasets.
arXiv Detail & Related papers (2023-09-25T11:50:19Z) - SemAffiNet: Semantic-Affine Transformation for Point Cloud Segmentation [94.11915008006483]
We propose SemAffiNet for point cloud semantic segmentation.
We conduct extensive experiments on the ScanNetV2 and NYUv2 datasets.
arXiv Detail & Related papers (2022-05-26T17:00:23Z) - Deep Hierarchical Semantic Segmentation [76.40565872257709]
hierarchical semantic segmentation (HSS) aims at structured, pixel-wise description of visual observation in terms of a class hierarchy.
HSSN casts HSS as a pixel-wise multi-label classification task, only bringing minimal architecture change to current segmentation models.
With hierarchy-induced margin constraints, HSSN reshapes the pixel embedding space, so as to generate well-structured pixel representations.
arXiv Detail & Related papers (2022-03-27T15:47:44Z) - Towards holistic scene understanding: Semantic segmentation and beyond [2.7920304852537536]
This dissertation addresses visual scene understanding and enhances segmentation performance and generalization, training efficiency of networks, and holistic understanding.
First, we investigate semantic segmentation in the context of street scenes and train semantic segmentation networks on combinations of various datasets.
In Chapter 2 we design a framework of hierarchical classifiers over a single convolutional backbone, and train it end-to-end on a combination of pixel-labeled datasets.
In Chapter 3 we propose a weakly-supervised algorithm for training with bounding box-level and image-level supervision instead of only with per-pixel supervision.
arXiv Detail & Related papers (2022-01-16T19:18:11Z) - Three Ways to Improve Semantic Segmentation with Self-Supervised Depth
Estimation [90.87105131054419]
We present a framework for semi-supervised semantic segmentation, which is enhanced by self-supervised monocular depth estimation from unlabeled image sequences.
We validate the proposed model on the Cityscapes dataset, where all three modules demonstrate significant performance gains.
arXiv Detail & Related papers (2020-12-19T21:18:03Z) - A Holistically-Guided Decoder for Deep Representation Learning with
Applications to Semantic Segmentation and Object Detection [74.88284082187462]
One common strategy is to adopt dilated convolutions in the backbone networks to extract high-resolution feature maps.
We propose one novel holistically-guided decoder which is introduced to obtain the high-resolution semantic-rich feature maps.
arXiv Detail & Related papers (2020-12-18T10:51:49Z) - Cross-Attention in Coupled Unmixing Nets for Unsupervised Hyperspectral
Super-Resolution [79.97180849505294]
We propose a novel coupled unmixing network with a cross-attention mechanism, CUCaNet, to enhance the spatial resolution of HSI.
Experiments are conducted on three widely-used HS-MS datasets in comparison with state-of-the-art HSI-SR models.
arXiv Detail & Related papers (2020-07-10T08:08:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.