Glass Segmentation with Multi Scales and Primary Prediction Guiding
- URL: http://arxiv.org/abs/2402.08571v2
- Date: Wed, 14 Feb 2024 05:23:29 GMT
- Title: Glass Segmentation with Multi Scales and Primary Prediction Guiding
- Authors: Zhiyu Xu and Qingliang Chen
- Abstract summary: Glass-like objects can be seen everywhere in our daily life which are hard for existing methods to segment them.
We propose MGNet, which consists of a FineRescaling and Merging module (FRM) to improve the ability to extract semantics.
We supervise the model with a novel loss function with the uncertainty-aware loss to produce high-confidence segmentation maps.
- Score: 2.66512000865131
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Glass-like objects can be seen everywhere in our daily life which are very
hard for existing methods to segment them. The properties of transparencies
pose great challenges of detecting them from the chaotic background and the
vague separation boundaries further impede the acquisition of their exact
contours. Moving machines which ignore glasses have great risks of crashing
into transparent barriers or difficulties in analysing objects reflected in the
mirror, thus it is of substantial significance to accurately locate glass-like
objects and completely figure out their contours. In this paper, inspired by
the scale integration strategy and the refinement method, we proposed a
brand-new network, named as MGNet, which consists of a Fine-Rescaling and
Merging module (FRM) to improve the ability to extract spatially relationship
and a Primary Prediction Guiding module (PPG) to better mine the leftover
semantics from the fused features. Moreover, we supervise the model with a
novel loss function with the uncertainty-aware loss to produce high-confidence
segmentation maps. Unlike the existing glass segmentation models that must be
trained on different settings with respect to varied datasets, our model are
trained under consistent settings and has achieved superior performance on
three popular public datasets. Code is available at
Related papers
- Fourier Boundary Features Network with Wider Catchers for Glass Segmentation [12.465008923418406]
We propose a new method for constraining the segmentation of reflection surface and penetrating glass.
The proposed method yields better segmentation performance compared with the state-of-the-art (SOTA) methods in glass image segmentation.
arXiv Detail & Related papers (2024-05-15T15:52:27Z) - ZoomNeXt: A Unified Collaborative Pyramid Network for Camouflaged Object Detection [70.11264880907652]
Recent object (COD) attempts to segment objects visually blended into their surroundings, which is extremely complex and difficult in real-world scenarios.
We propose an effective unified collaborative pyramid network that mimics human behavior when observing vague images and camouflaged zooming in and out.
Our framework consistently outperforms existing state-of-the-art methods in image and video COD benchmarks.
arXiv Detail & Related papers (2023-10-31T06:11:23Z) - RFENet: Towards Reciprocal Feature Evolution for Glass Segmentation [44.45402982171703]
We introduce a Reciprocal Feature Evolution Network (RFENet) for effective glass-like object segmentation.
RFENet achieves state-of-the-art performance on three popular public datasets.
arXiv Detail & Related papers (2023-07-12T11:45:22Z) - A bioinspired three-stage model for camouflaged object detection [8.11866601771984]
We propose a three-stage model that enables coarse-to-fine segmentation in a single iteration.
Our model employs three decoders to sequentially process subsampled features, cropped features, and high-resolution original features.
Our network surpasses state-of-the-art CNN-based counterparts without unnecessary complexities.
arXiv Detail & Related papers (2023-05-22T02:01:48Z) - Rethinking Range View Representation for LiDAR Segmentation [66.73116059734788]
"Many-to-one" mapping, semantic incoherence, and shape deformation are possible impediments against effective learning from range view projections.
We present RangeFormer, a full-cycle framework comprising novel designs across network architecture, data augmentation, and post-processing.
We show that, for the first time, a range view method is able to surpass the point, voxel, and multi-view fusion counterparts in the competing LiDAR semantic and panoptic segmentation benchmarks.
arXiv Detail & Related papers (2023-03-09T16:13:27Z) - High-resolution Iterative Feedback Network for Camouflaged Object
Detection [128.893782016078]
Spotting camouflaged objects that are visually assimilated into the background is tricky for object detection algorithms.
We aim to extract the high-resolution texture details to avoid the detail degradation that causes blurred vision in edges and boundaries.
We introduce a novel HitNet to refine the low-resolution representations by high-resolution features in an iterative feedback manner.
arXiv Detail & Related papers (2022-03-22T11:20:21Z) - Zoom In and Out: A Mixed-scale Triplet Network for Camouflaged Object
Detection [0.0]
We propose a mixed-scale triplet network, bf ZoomNet, which mimics the behavior of humans when observing vague images.
Specifically, our ZoomNet employs the zoom strategy to learn the discriminative mixed-scale semantics by the designed scale integration unit and hierarchical mixed-scale unit.
Our proposed highly task-friendly model consistently surpasses the existing 23 state-of-the-art methods on four public datasets.
arXiv Detail & Related papers (2022-03-05T09:13:52Z) - Salient Objects in Clutter [130.63976772770368]
This paper identifies and addresses a serious design bias of existing salient object detection (SOD) datasets.
This design bias has led to a saturation in performance for state-of-the-art SOD models when evaluated on existing datasets.
We propose a new high-quality dataset and update the previous saliency benchmark.
arXiv Detail & Related papers (2021-05-07T03:49:26Z) - Enhanced Boundary Learning for Glass-like Object Segmentation [55.45473926510806]
This paper aims to solve the glass-like object segmentation problem via enhanced boundary learning.
In particular, we first propose a novel refined differential module for generating finer boundary cues.
An edge-aware point-based graph convolution network module is proposed to model the global shape representation along the boundary.
arXiv Detail & Related papers (2021-03-29T16:18:57Z) - Progressive Self-Guided Loss for Salient Object Detection [102.35488902433896]
We present a progressive self-guided loss function to facilitate deep learning-based salient object detection in images.
Our framework takes advantage of adaptively aggregated multi-scale features to locate and detect salient objects effectively.
arXiv Detail & Related papers (2021-01-07T07:33:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.