Mask Encoding for Single Shot Instance Segmentation
- URL: http://arxiv.org/abs/2003.11712v2
- Date: Wed, 6 May 2020 12:44:26 GMT
- Title: Mask Encoding for Single Shot Instance Segmentation
- Authors: Rufeng Zhang, Zhi Tian, Chunhua Shen, Mingyu You, Youliang Yan
- Abstract summary: We propose a simple singleshot instance segmentation framework, termed mask encoding based instance segmentation (MEInst)
Instead of predicting the two-dimensional mask directly, MEInst distills it into a compact and fixed-dimensional representation vector.
We show that the much simpler and flexible one-stage instance segmentation method, can also achieve competitive performance.
- Score: 97.99956029224622
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: To date, instance segmentation is dominated by twostage methods, as pioneered
by Mask R-CNN. In contrast, one-stage alternatives cannot compete with Mask
R-CNN in mask AP, mainly due to the difficulty of compactly representing masks,
making the design of one-stage methods very challenging. In this work, we
propose a simple singleshot instance segmentation framework, termed mask
encoding based instance segmentation (MEInst). Instead of predicting the
two-dimensional mask directly, MEInst distills it into a compact and
fixed-dimensional representation vector, which allows the instance segmentation
task to be incorporated into one-stage bounding-box detectors and results in a
simple yet efficient instance segmentation framework. The proposed one-stage
MEInst achieves 36.4% in mask AP with single-model (ResNeXt-101-FPN backbone)
and single-scale testing on the MS-COCO benchmark. We show that the much
simpler and flexible one-stage instance segmentation method, can also achieve
competitive performance. This framework can be easily adapted for other
instance-level recognition tasks. Code is available at:
https://git.io/AdelaiDet
Related papers
- SOLO: A Simple Framework for Instance Segmentation [84.00519148562606]
"instance categories" assigns categories to each pixel within an instance according to the instance's location.
"SOLO" is a simple, direct, and fast framework for instance segmentation with strong performance.
Our approach achieves state-of-the-art results for instance segmentation in terms of both speed and accuracy.
arXiv Detail & Related papers (2021-06-30T09:56:54Z) - SipMask: Spatial Information Preservation for Fast Image and Video
Instance Segmentation [149.242230059447]
We propose a fast single-stage instance segmentation method called SipMask.
It preserves instance-specific spatial information by separating mask prediction of an instance to different sub-regions of a detected bounding-box.
In terms of real-time capabilities, SipMask outperforms YOLACT with an absolute gain of 3.0% (mask AP) under similar settings.
arXiv Detail & Related papers (2020-07-29T12:21:00Z) - CenterMask: single shot instance segmentation with point representation [16.464056972736838]
We propose a single-shot instance segmentation method, which is simple, fast and accurate.
The proposed CenterMask achieves 34.5 mask AP with a speed of 12.3 fps, using a single-model with single-scale training/testing.
Our method can be easily embedded to other one-stage object detectors such as FCOS and performs well.
arXiv Detail & Related papers (2020-04-09T09:35:15Z) - SOLOv2: Dynamic and Fast Instance Segmentation [102.15325936477362]
We build a simple, direct, and fast instance segmentation framework with strong performance.
We take one step further by dynamically learning the mask head of the object segmenter.
We demonstrate a simple direct instance segmentation system, outperforming a few state-of-the-art methods in both speed and accuracy.
arXiv Detail & Related papers (2020-03-23T09:44:21Z) - PointINS: Point-based Instance Segmentation [117.38579097923052]
Mask representation in instance segmentation with Point-of-Interest (PoI) features is challenging because learning a high-dimensional mask feature for each instance requires a heavy computing burden.
We propose an instance-aware convolution, which decomposes this mask representation learning task into two tractable modules.
Along with instance-aware convolution, we propose PointINS, a simple and practical instance segmentation approach.
arXiv Detail & Related papers (2020-03-13T08:24:58Z) - BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation [103.74690082121079]
In this work, we achieve improved mask prediction by effectively combining instance-level information with semantic information with lower-level fine-granularity.
Our main contribution is a blender module which draws inspiration from both top-down and bottom-up instance segmentation approaches.
BlendMask can effectively predict dense per-pixel position-sensitive instance features with very few channels, and learn attention maps for each instance with merely one convolution layer.
arXiv Detail & Related papers (2020-01-02T03:30:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.