Enhanced Boundary Learning for Glass-like Object Segmentation
- URL: http://arxiv.org/abs/2103.15734v1
- Date: Mon, 29 Mar 2021 16:18:57 GMT
- Title: Enhanced Boundary Learning for Glass-like Object Segmentation
- Authors: Hao He, Xiangtai Li, Guangliang Cheng, Jianping Shi, Yunhai Tong,
Gaofeng Meng, V\'eronique Prinet, Lubin Weng
- Abstract summary: This paper aims to solve the glass-like object segmentation problem via enhanced boundary learning.
In particular, we first propose a novel refined differential module for generating finer boundary cues.
An edge-aware point-based graph convolution network module is proposed to model the global shape representation along the boundary.
- Score: 55.45473926510806
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Glass-like objects such as windows, bottles, and mirrors exist widely in the
real world. Sensing these objects has many applications, including robot
navigation and grasping. However, this task is very challenging due to the
arbitrary scenes behind glass-like objects. This paper aims to solve the
glass-like object segmentation problem via enhanced boundary learning. In
particular, we first propose a novel refined differential module for generating
finer boundary cues. Then an edge-aware point-based graph convolution network
module is proposed to model the global shape representation along the boundary.
Both modules are lightweight and effective, which can be embedded into various
segmentation models. Moreover, we use these two modules to design a decoder to
get accurate segmentation results, especially on the boundary. Extensive
experiments on three recent glass-like object segmentation datasets, including
Trans10k, MSD, and GDD, show that our approach establishes new state-of-the-art
performances. We also offer the generality and superiority of our approach
compared with recent methods on three general segmentation datasets, including
Cityscapes, BDD, and COCO Stuff. Code and models will be available at
(\url{https://github.com/hehao13/EBLNet})
Related papers
- Glass Segmentation with Multi Scales and Primary Prediction Guiding [2.66512000865131]
Glass-like objects can be seen everywhere in our daily life which are hard for existing methods to segment them.
We propose MGNet, which consists of a FineRescaling and Merging module (FRM) to improve the ability to extract semantics.
We supervise the model with a novel loss function with the uncertainty-aware loss to produce high-confidence segmentation maps.
arXiv Detail & Related papers (2024-02-13T16:14:32Z) - RFENet: Towards Reciprocal Feature Evolution for Glass Segmentation [44.45402982171703]
We introduce a Reciprocal Feature Evolution Network (RFENet) for effective glass-like object segmentation.
RFENet achieves state-of-the-art performance on three popular public datasets.
arXiv Detail & Related papers (2023-07-12T11:45:22Z) - Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast
Contrastive Fusion [110.84357383258818]
We propose a novel approach to lift 2D segments to 3D and fuse them by means of a neural field representation.
The core of our approach is a slow-fast clustering objective function, which is scalable and well-suited for scenes with a large number of objects.
Our approach outperforms the state-of-the-art on challenging scenes from the ScanNet, Hypersim, and Replica datasets.
arXiv Detail & Related papers (2023-06-07T17:57:45Z) - A bioinspired three-stage model for camouflaged object detection [8.11866601771984]
We propose a three-stage model that enables coarse-to-fine segmentation in a single iteration.
Our model employs three decoders to sequentially process subsampled features, cropped features, and high-resolution original features.
Our network surpasses state-of-the-art CNN-based counterparts without unnecessary complexities.
arXiv Detail & Related papers (2023-05-22T02:01:48Z) - SupeRGB-D: Zero-shot Instance Segmentation in Cluttered Indoor
Environments [67.34330257205525]
In this work, we explore zero-shot instance segmentation (ZSIS) from RGB-D data to identify unseen objects in a semantic category-agnostic manner.
We present a method that uses annotated objects to learn the objectness'' of pixels and generalize to unseen object categories in cluttered indoor environments.
arXiv Detail & Related papers (2022-12-22T17:59:48Z) - GlassNet: Label Decoupling-based Three-stream Neural Network for Robust
Image Glass Detection [1.1825946875790057]
We exploit label decoupling to decompose the labeled ground-truth (GT) map into an interior-diffusion map and a boundary-diffusion map.
The GT map in collaboration with the two newly generated maps breaks the imbalanced distribution of the object boundary, leading to improved glass detection quality.
We develop an attention-based boundary-aware feature Mosaic module to integrate multi-modal information.
arXiv Detail & Related papers (2021-08-25T08:33:49Z) - Learning to Associate Every Segment for Video Panoptic Segmentation [123.03617367709303]
We learn coarse segment-level matching and fine pixel-level matching together.
We show that our per-frame computation model can achieve new state-of-the-art results on Cityscapes-VPS and VIPER datasets.
arXiv Detail & Related papers (2021-06-17T13:06:24Z) - BoundarySqueeze: Image Segmentation as Boundary Squeezing [104.43159799559464]
We propose a novel method for fine-grained high-quality image segmentation of both objects and scenes.
Inspired by dilation and erosion from morphological image processing techniques, we treat the pixel level segmentation problems as squeezing object boundary.
Our method yields large gains on COCO, Cityscapes, for both instance and semantic segmentation and outperforms previous state-of-the-art PointRend in both accuracy and speed under the same setting.
arXiv Detail & Related papers (2021-05-25T04:58:51Z) - Improving Semantic Segmentation via Decoupled Body and Edge Supervision [89.57847958016981]
Existing semantic segmentation approaches either aim to improve the object's inner consistency by modeling the global context, or refine objects detail along their boundaries by multi-scale feature fusion.
In this paper, a new paradigm for semantic segmentation is proposed.
Our insight is that appealing performance of semantic segmentation requires textitexplicitly modeling the object textitbody and textitedge, which correspond to the high and low frequency of the image.
We show that the proposed framework with various baselines or backbone networks leads to better object inner consistency and object boundaries.
arXiv Detail & Related papers (2020-07-20T12:11:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.