Progressive Glass Segmentation
- URL: http://arxiv.org/abs/2209.02280v1
- Date: Tue, 6 Sep 2022 08:11:17 GMT
- Title: Progressive Glass Segmentation
- Authors: Letian Yu, Haiyang Mei, Wen Dong, Ziqi Wei, Li Zhu, Yuxin Wang, Xin
Yang
- Abstract summary: Glass does not have its own visual appearances but only transmit/reflect the appearances of its surroundings.
Existing methods typically explore and combine useful cues from different levels of features in the deep network.
We develop a Discriminability Enhancement (DE) module which enables level-specific features to be a more discriminative representation.
- Score: 26.438341615170614
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Glass is very common in the real world. Influenced by the uncertainty about
the glass region and the varying complex scenes behind the glass, the existence
of glass poses severe challenges to many computer vision tasks, making glass
segmentation as an important computer vision task. Glass does not have its own
visual appearances but only transmit/reflect the appearances of its
surroundings, making it fundamentally different from other common objects. To
address such a challenging task, existing methods typically explore and combine
useful cues from different levels of features in the deep network. As there
exists a characteristic gap between level-different features, i.e., deep layer
features embed more high-level semantics and are better at locating the target
objects while shallow layer features have larger spatial sizes and keep richer
and more detailed low-level information, fusing these features naively thus
would lead to a sub-optimal solution. In this paper, we approach the effective
features fusion towards accurate glass segmentation in two steps. First, we
attempt to bridge the characteristic gap between different levels of features
by developing a Discriminability Enhancement (DE) module which enables
level-specific features to be a more discriminative representation, alleviating
the features incompatibility for fusion. Second, we design a
Focus-and-Exploration Based Fusion (FEBF) module to richly excavate useful
information in the fusion process by highlighting the common and exploring the
difference between level-different features.
Related papers
- Instruction-Guided Fusion of Multi-Layer Visual Features in Large Vision-Language Models [50.98559225639266]
We investigate the contributions of visual features from different encoder layers using 18 benchmarks spanning 6 task categories.
Our findings reveal that multilayer features provide complementary strengths with varying task dependencies, and uniform fusion leads to suboptimal performance.
We propose the instruction-guided vision aggregator, a module that dynamically integrates multi-layer visual features based on textual instructions.
arXiv Detail & Related papers (2024-12-26T05:41:31Z) - Glass Segmentation with Multi Scales and Primary Prediction Guiding [2.66512000865131]
Glass-like objects can be seen everywhere in our daily life which are hard for existing methods to segment them.
We propose MGNet, which consists of a FineRescaling and Merging module (FRM) to improve the ability to extract semantics.
We supervise the model with a novel loss function with the uncertainty-aware loss to produce high-confidence segmentation maps.
arXiv Detail & Related papers (2024-02-13T16:14:32Z) - RFENet: Towards Reciprocal Feature Evolution for Glass Segmentation [44.45402982171703]
We introduce a Reciprocal Feature Evolution Network (RFENet) for effective glass-like object segmentation.
RFENet achieves state-of-the-art performance on three popular public datasets.
arXiv Detail & Related papers (2023-07-12T11:45:22Z) - Progressive Multi-scale Consistent Network for Multi-class Fundus Lesion
Segmentation [28.58972084293778]
We propose a progressive multi-scale consistent network (PMCNet) that integrates the proposed progressive feature fusion (PFF) block and dynamic attention block (DAB)
PFF block progressively integrates multi-scale features from adjacent encoding layers, facilitating feature learning of each layer by aggregating fine-grained details and high-level semantics.
DAB is designed to dynamically learn the attentive cues from the fused features at different scales, thus aiming to smooth the essential conflicts existing in multi-scale features.
arXiv Detail & Related papers (2022-05-31T12:10:01Z) - AF$_2$: Adaptive Focus Framework for Aerial Imagery Segmentation [86.44683367028914]
Aerial imagery segmentation has some unique challenges, the most critical one among which lies in foreground-background imbalance.
We propose Adaptive Focus Framework (AF$), which adopts a hierarchical segmentation procedure and focuses on adaptively utilizing multi-scale representations.
AF$ has significantly improved the accuracy on three widely used aerial benchmarks, as fast as the mainstream method.
arXiv Detail & Related papers (2022-02-18T10:14:45Z) - Perception-and-Regulation Network for Salient Object Detection [8.026227647732792]
We propose a novel global attention unit that adaptively regulates the feature fusion process by explicitly modeling interdependencies between features.
The perception part uses the structure of fully-connected layers in classification networks to learn the size and shape of objects.
An imitating eye observation module (IEO) is further employed for improving the global perception ability of the network.
arXiv Detail & Related papers (2021-07-27T02:38:40Z) - Enhanced Boundary Learning for Glass-like Object Segmentation [55.45473926510806]
This paper aims to solve the glass-like object segmentation problem via enhanced boundary learning.
In particular, we first propose a novel refined differential module for generating finer boundary cues.
An edge-aware point-based graph convolution network module is proposed to model the global shape representation along the boundary.
arXiv Detail & Related papers (2021-03-29T16:18:57Z) - Salient Object Detection via Integrity Learning [104.13483971954233]
Integrity is the concept of highlighting all parts that belong to a certain salient object.
To facilitate integrity learning for salient object detection, we design a novel Integrity Cognition Network (ICON)
ICON explores three important components to learn strong integrity features.
arXiv Detail & Related papers (2021-01-19T14:53:12Z) - DFNet: Discriminative feature extraction and integration network for
salient object detection [6.959742268104327]
We focus on two aspects of challenges in saliency detection using Convolutional Neural Networks.
Firstly, since salient objects appear in various sizes, using single-scale convolution would not capture the right size.
Secondly, using multi-level features helps the model use both local and global context.
arXiv Detail & Related papers (2020-04-03T13:56:41Z) - Global Context-Aware Progressive Aggregation Network for Salient Object
Detection [117.943116761278]
We propose a novel network named GCPANet to integrate low-level appearance features, high-level semantic features, and global context features.
We show that the proposed approach outperforms the state-of-the-art methods both quantitatively and qualitatively.
arXiv Detail & Related papers (2020-03-02T04:26:10Z) - Cross-layer Feature Pyramid Network for Salient Object Detection [102.20031050972429]
We propose a novel Cross-layer Feature Pyramid Network to improve the progressive fusion in salient object detection.
The distributed features per layer own both semantics and salient details from all other layers simultaneously, and suffer reduced loss of important information.
arXiv Detail & Related papers (2020-02-25T14:06:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.