BoxInst: High-Performance Instance Segmentation with Box Annotations
- URL: http://arxiv.org/abs/2012.02310v1
- Date: Thu, 3 Dec 2020 22:27:55 GMT
- Title: BoxInst: High-Performance Instance Segmentation with Box Annotations
- Authors: Zhi Tian, Chunhua Shen, Xinlong Wang, Hao Chen
- Abstract summary: We present a high-performance method that can achieve mask-level instance segmentation with only bounding-box annotations for training.
Our core idea is to exploit the loss of learning masks in instance segmentation, with no modification to the segmentation network itself.
- Score: 102.10713189544947
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: We present a high-performance method that can achieve mask-level instance
segmentation with only bounding-box annotations for training. While this
setting has been studied in the literature, here we show significantly stronger
performance with a simple design (e.g., dramatically improving previous best
reported mask AP of 21.1% in Hsu et al. (2019) to 31.6% on the COCO dataset).
Our core idea is to redesign the loss of learning masks in instance
segmentation, with no modification to the segmentation network itself. The new
loss functions can supervise the mask training without relying on mask
annotations. This is made possible with two loss terms, namely, 1) a surrogate
term that minimizes the discrepancy between the projections of the ground-truth
box and the predicted mask; 2) a pairwise loss that can exploit the prior that
proximal pixels with similar colors are very likely to have the same category
label. Experiments demonstrate that the redesigned mask loss can yield
surprisingly high-quality instance masks with only box annotations. For
example, without using any mask annotations, with a ResNet-101 backbone and 3x
training schedule, we achieve 33.2% mask AP on COCO test-dev split (vs. 39.1%
of the fully supervised counterpart). Our excellent experiment results on COCO
and Pascal VOC indicate that our method dramatically narrows the performance
gap between weakly and fully supervised instance segmentation.
Code is available at: https://git.io/AdelaiDet
Related papers
- Mask Transfiner for High-Quality Instance Segmentation [95.74244714914052]
We present Mask Transfiner for high-quality and efficient instance segmentation.
Our approach only processes detected error-prone tree nodes and self-corrects their errors in parallel.
Our code and trained models will be available at http://vis.xyz/pub/transfiner.
arXiv Detail & Related papers (2021-11-26T18:58:22Z) - Mask is All You Need: Rethinking Mask R-CNN for Dense and
Arbitrary-Shaped Scene Text Detection [11.390163890611246]
Mask R-CNN is widely adopted as a strong baseline for arbitrary-shaped scene text detection and spotting.
There may exist multiple instances in one proposal, which makes it difficult for the mask head to distinguish different instances and degrades the performance.
We propose instance-aware mask learning in which the mask head learns to predict the shape of the whole instance rather than classify each pixel to text or non-text.
arXiv Detail & Related papers (2021-09-08T04:32:29Z) - Real-time Instance Segmentation with Discriminative Orientation Maps [0.16311150636417257]
We propose a real-time instance segmentation framework termed OrienMask.
A mask head is added to predict some discriminative orientation maps.
All instances that match with the same anchor size share a common orientation map.
arXiv Detail & Related papers (2021-06-23T07:27:35Z) - Mask Encoding for Single Shot Instance Segmentation [97.99956029224622]
We propose a simple singleshot instance segmentation framework, termed mask encoding based instance segmentation (MEInst)
Instead of predicting the two-dimensional mask directly, MEInst distills it into a compact and fixed-dimensional representation vector.
We show that the much simpler and flexible one-stage instance segmentation method, can also achieve competitive performance.
arXiv Detail & Related papers (2020-03-26T02:51:17Z) - SOLOv2: Dynamic and Fast Instance Segmentation [102.15325936477362]
We build a simple, direct, and fast instance segmentation framework with strong performance.
We take one step further by dynamically learning the mask head of the object segmenter.
We demonstrate a simple direct instance segmentation system, outperforming a few state-of-the-art methods in both speed and accuracy.
arXiv Detail & Related papers (2020-03-23T09:44:21Z) - PointINS: Point-based Instance Segmentation [117.38579097923052]
Mask representation in instance segmentation with Point-of-Interest (PoI) features is challenging because learning a high-dimensional mask feature for each instance requires a heavy computing burden.
We propose an instance-aware convolution, which decomposes this mask representation learning task into two tractable modules.
Along with instance-aware convolution, we propose PointINS, a simple and practical instance segmentation approach.
arXiv Detail & Related papers (2020-03-13T08:24:58Z) - BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation [103.74690082121079]
In this work, we achieve improved mask prediction by effectively combining instance-level information with semantic information with lower-level fine-granularity.
Our main contribution is a blender module which draws inspiration from both top-down and bottom-up instance segmentation approaches.
BlendMask can effectively predict dense per-pixel position-sensitive instance features with very few channels, and learn attention maps for each instance with merely one convolution layer.
arXiv Detail & Related papers (2020-01-02T03:30:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.