Meticulous Object Segmentation
- URL: http://arxiv.org/abs/2012.07181v1
- Date: Sun, 13 Dec 2020 23:38:40 GMT
- Title: Meticulous Object Segmentation
- Authors: Chenglin Yang, Yilin Wang, Jianming Zhang, He Zhang, Zhe Lin, Alan
Yuille
- Abstract summary: We propose and study a task named Meticulous Object segmentation (MOS)
MeticulousNet leverages a dedicated decoder to capture the object boundary details.
We provide empirical evidence showing that MeticulousNet can reveal pixel-accurate segmentation boundaries.
- Score: 37.48446050876045
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Compared with common image segmentation tasks targeted at low-resolution
images, higher resolution detailed image segmentation receives much less
attention. In this paper, we propose and study a task named Meticulous Object
Segmentation (MOS), which is focused on segmenting well-defined foreground
objects with elaborate shapes in high resolution images (e.g. 2k - 4k). To this
end, we propose the MeticulousNet which leverages a dedicated decoder to
capture the object boundary details. Specifically, we design a Hierarchical
Point-wise Refining (HierPR) block to better delineate object boundaries, and
reformulate the decoding process as a recursive coarse to fine refinement of
the object mask. To evaluate segmentation quality near object boundaries, we
propose the Meticulosity Quality (MQ) score considering both the mask coverage
and boundary precision. In addition, we collect a MOS benchmark dataset
including 600 high quality images with complex objects. We provide
comprehensive empirical evidence showing that MeticulousNet can reveal
pixel-accurate segmentation boundaries and is superior to state-of-the-art
methods for high resolution object segmentation tasks.
Related papers
- 1st Place Solution for MOSE Track in CVPR 2024 PVUW Workshop: Complex Video Object Segmentation [72.54357831350762]
We propose a semantic embedding video object segmentation model and use the salient features of objects as query representations.
We trained our model on a large-scale video object segmentation dataset.
Our model achieves first place (textbf84.45%) in the test set of Complex Video Object Challenge.
arXiv Detail & Related papers (2024-06-07T03:13:46Z) - Referring Camouflaged Object Detection [97.90911862979355]
Ref-COD aims to segment specified camouflaged objects based on a small set of referring images with salient target objects.
We first assemble a large-scale dataset, called R2C7K, which consists of 7K images covering 64 object categories in real-world scenarios.
arXiv Detail & Related papers (2023-06-13T04:15:37Z) - High-Quality Entity Segmentation [110.55724145851725]
CropFormer is designed to tackle the intractability of instance-level segmentation on high-resolution images.
It improves mask prediction by fusing high-res image crops that provide more fine-grained image details and the full image.
With CropFormer, we achieve a significant AP gain of $1.9$ on the challenging entity segmentation task.
arXiv Detail & Related papers (2022-11-10T18:58:22Z) - BoundarySqueeze: Image Segmentation as Boundary Squeezing [104.43159799559464]
We propose a novel method for fine-grained high-quality image segmentation of both objects and scenes.
Inspired by dilation and erosion from morphological image processing techniques, we treat the pixel level segmentation problems as squeezing object boundary.
Our method yields large gains on COCO, Cityscapes, for both instance and semantic segmentation and outperforms previous state-of-the-art PointRend in both accuracy and speed under the same setting.
arXiv Detail & Related papers (2021-05-25T04:58:51Z) - Superpixel-based Refinement for Object Proposal Generation [3.1981440103815717]
We introduce a new superpixel-based refinement approach on top of the state-of-the-art object proposal system AttentionMask.
Our experiments show an improvement of up to 26.4% in terms of average recall compared to original AttentionMask.
arXiv Detail & Related papers (2021-01-12T16:06:48Z) - AinnoSeg: Panoramic Segmentation with High Perfomance [4.867465475957119]
Current panoramic segmentation algorithms are more concerned with context semantics, but the details of image are not processed enough.
Aiming to address these issues, this paper presents some useful tricks.
All these operations named AinnoSeg, AinnoSeg can achieve state-of-art performance on the well-known dataset ADE20K.
arXiv Detail & Related papers (2020-07-21T04:16:46Z) - Improving Semantic Segmentation via Decoupled Body and Edge Supervision [89.57847958016981]
Existing semantic segmentation approaches either aim to improve the object's inner consistency by modeling the global context, or refine objects detail along their boundaries by multi-scale feature fusion.
In this paper, a new paradigm for semantic segmentation is proposed.
Our insight is that appealing performance of semantic segmentation requires textitexplicitly modeling the object textitbody and textitedge, which correspond to the high and low frequency of the image.
We show that the proposed framework with various baselines or backbone networks leads to better object inner consistency and object boundaries.
arXiv Detail & Related papers (2020-07-20T12:11:22Z) - A Multi-Level Approach to Waste Object Segmentation [10.20384144853726]
We address the problem of localizing waste objects from a color image and an optional depth image.
Our method integrates the intensity and depth information at multiple levels of spatial granularity.
We create a new RGBD waste object segmentation, MJU-Waste, that is made public to facilitate future research in this area.
arXiv Detail & Related papers (2020-07-08T16:49:25Z) - Revisiting Sequence-to-Sequence Video Object Segmentation with
Multi-Task Loss and Skip-Memory [4.343892430915579]
Video Object (VOS) is an active research area of the visual domain.
Current approaches lose objects in longer sequences, especially when the object is small or briefly occluded.
We build upon a sequence-to-sequence approach that employs an encoder-decoder architecture together with a memory module for exploiting the sequential data.
arXiv Detail & Related papers (2020-04-25T15:38:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.