Contour-Aware Equipotential Learning for Semantic Segmentation
- URL: http://arxiv.org/abs/2210.00223v1
- Date: Sat, 1 Oct 2022 08:45:44 GMT
- Title: Contour-Aware Equipotential Learning for Semantic Segmentation
- Authors: Xu Yin, Dongbo Min, Yuchi Huo and Sung-Eui Yoon
- Abstract summary: We present the equipotential learning (EPL) method to learn and infer semantic boundaries.
This paper is the first attempt to address the boundary segmentation problem with field regression and contour learning.
The proposed EPL module can benefit the off-the-shelf fully convolutional network models when recognizing semantic boundary areas.
- Score: 35.09077032446148
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: With increasing demands for high-quality semantic segmentation in the
industry, hard-distinguishing semantic boundaries have posed a significant
threat to existing solutions. Inspired by real-life experience, i.e., combining
varied observations contributes to higher visual recognition confidence, we
present the equipotential learning (EPL) method. This novel module transfers
the predicted/ground-truth semantic labels to a self-defined potential domain
to learn and infer decision boundaries along customized directions. The
conversion to the potential domain is implemented via a lightweight
differentiable anisotropic convolution without incurring any parameter
overhead. Besides, the designed two loss functions, the point loss and the
equipotential line loss implement anisotropic field regression and
category-level contour learning, respectively, enhancing prediction
consistencies in the inter/intra-class boundary areas. More importantly, EPL is
agnostic to network architectures, and thus it can be plugged into most
existing segmentation models. This paper is the first attempt to address the
boundary segmentation problem with field regression and contour learning.
Meaningful performance improvements on Pascal Voc 2012 and Cityscapes
demonstrate that the proposed EPL module can benefit the off-the-shelf fully
convolutional network models when recognizing semantic boundary areas. Besides,
intensive comparisons and analysis show the favorable merits of EPL for
distinguishing semantically-similar and irregular-shaped categories.
Related papers
- Open Set Domain Adaptation with Vision-language models via Gradient-aware Separation [0.6118897979046375]
Open-Set Domain Adaptation (OSDA) confronts the dual challenge of aligning known-class distributions across domains.<n>We propose to harness Contrastive Language-Image Pretraining (CLIP) to address these limitations.
arXiv Detail & Related papers (2025-05-16T12:31:17Z) - SemiDAViL: Semi-supervised Domain Adaptation with Vision-Language Guidance for Semantic Segmentation [9.311853182451289]
We propose a language-guided Semi-supervised Domain Adaptation (SSDA) setting for semantic segmentation.
We harness the semantic generalization capabilities inherent in vision-language models (VLMs) to establish a synergistic framework.
Our approach demonstrates substantial performance improvements over contemporary state-of-the-art (SoTA) methodologies.
arXiv Detail & Related papers (2025-04-08T19:14:34Z) - Prior2Former -- Evidential Modeling of Mask Transformers for Assumption-Free Open-World Panoptic Segmentation [74.55677741919035]
We propose Prior2Former (P2F), the first approach for segmentation vision transformers rooted in evidential learning.<n>P2F extends the mask vision transformer architecture by incorporating a Beta prior for computing model uncertainty in pixel-wise binary mask assignments.<n>Unlike most segmentation models addressing unknown classes, P2F operates without access to OOD data samples or contrastive training on void (i.e., unlabeled) classes.
arXiv Detail & Related papers (2025-04-07T08:53:14Z) - Category-Adaptive Cross-Modal Semantic Refinement and Transfer for Open-Vocabulary Multi-Label Recognition [59.203152078315235]
We propose a novel category-adaptive cross-modal semantic refinement and transfer (C$2$SRT) framework to explore the semantic correlation.
The proposed framework consists of two complementary modules, i.e., intra-category semantic refinement (ISR) module and inter-category semantic transfer (IST) module.
Experiments on OV-MLR benchmarks clearly demonstrate that the proposed C$2$SRT framework outperforms current state-of-the-art algorithms.
arXiv Detail & Related papers (2024-12-09T04:00:18Z) - Physically Feasible Semantic Segmentation [58.17907376475596]
State-of-the-art semantic segmentation models are typically optimized in a data-driven fashion.
Our method, Physically Feasible Semantic (PhyFea), extracts explicit physical constraints that govern spatial class relations.
PhyFea yields significant performance improvements in mIoU over each state-of-the-art network we use.
arXiv Detail & Related papers (2024-08-26T22:39:08Z) - Semantic Connectivity-Driven Pseudo-labeling for Cross-domain
Segmentation [89.41179071022121]
Self-training is a prevailing approach in cross-domain semantic segmentation.
We propose a novel approach called Semantic Connectivity-driven pseudo-labeling.
This approach formulates pseudo-labels at the connectivity level and thus can facilitate learning structured and low-noise semantics.
arXiv Detail & Related papers (2023-12-11T12:29:51Z) - QuadFormer: Quadruple Transformer for Unsupervised Domain Adaptation in
Power Line Segmentation of Aerial Images [12.840195641761323]
We propose a novel framework designed for domain adaptive semantic segmentation.
The hierarchical quadruple transformer combines cross-attention and self-attention mechanisms to adapt transferable context.
We present two datasets - ARPLSyn and ARPLReal - to further advance research in unsupervised domain adaptive powerline segmentation.
arXiv Detail & Related papers (2022-11-29T03:15:27Z) - Distribution Regularized Self-Supervised Learning for Domain Adaptation
of Semantic Segmentation [3.284878354988896]
This paper proposes a pixel-level distribution regularization scheme (DRSL) for self-supervised domain adaptation of semantic segmentation.
In a typical setting, the classification loss forces the semantic segmentation model to greedily learn the representations that capture inter-class variations.
We capture pixel-level intra-class variations through class-aware multi-modal distribution learning.
arXiv Detail & Related papers (2022-06-20T09:52:49Z) - Semi-supervised Domain Adaptive Structure Learning [72.01544419893628]
Semi-supervised domain adaptation (SSDA) is a challenging problem requiring methods to overcome both 1) overfitting towards poorly annotated data and 2) distribution shift across domains.
We introduce an adaptive structure learning method to regularize the cooperation of SSL and DA.
arXiv Detail & Related papers (2021-12-12T06:11:16Z) - HSVA: Hierarchical Semantic-Visual Adaptation for Zero-Shot Learning [74.76431541169342]
Zero-shot learning (ZSL) tackles the unseen class recognition problem, transferring semantic knowledge from seen classes to unseen ones.
We propose a novel hierarchical semantic-visual adaptation (HSVA) framework to align semantic and visual domains.
Experiments on four benchmark datasets demonstrate HSVA achieves superior performance on both conventional and generalized ZSL.
arXiv Detail & Related papers (2021-09-30T14:27:50Z) - Unsupervised Domain Adaptation for Semantic Segmentation via Low-level
Edge Information Transfer [27.64947077788111]
Unsupervised domain adaptation for semantic segmentation aims to make models trained on synthetic data adapt to real images.
Previous feature-level adversarial learning methods only consider adapting models on the high-level semantic features.
We present the first attempt at explicitly using low-level edge information, which has a small inter-domain gap, to guide the transfer of semantic information.
arXiv Detail & Related papers (2021-09-18T11:51:31Z) - InverseForm: A Loss Function for Structured Boundary-Aware Segmentation [80.39674800972182]
We present a novel boundary-aware loss term for semantic segmentation using an inverse-transformation network.
This plug-in loss term complements the cross-entropy loss in capturing boundary transformations.
We analyze the quantitative and qualitative effects of our loss function on three indoor and outdoor segmentation benchmarks.
arXiv Detail & Related papers (2021-04-06T18:52:45Z) - Active Boundary Loss for Semantic Segmentation [58.72057610093194]
This paper proposes a novel active boundary loss for semantic segmentation.
It can progressively encourage the alignment between predicted boundaries and ground-truth boundaries during end-to-end training.
Experimental results show that training with the active boundary loss can effectively improve the boundary F-score and mean Intersection-over-Union.
arXiv Detail & Related papers (2021-02-04T15:47:54Z) - Unsupervised Domain Adaptation in Semantic Segmentation via Orthogonal
and Clustered Embeddings [25.137859989323537]
We propose an effective Unsupervised Domain Adaptation (UDA) strategy, based on a feature clustering method.
We introduce two novel learning objectives to enhance the discriminative clustering performance.
arXiv Detail & Related papers (2020-11-25T10:06:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.