Deep Variational Instance Segmentation
- URL: http://arxiv.org/abs/2007.11576v2
- Date: Sat, 24 Oct 2020 06:17:44 GMT
- Title: Deep Variational Instance Segmentation
- Authors: Jialin Yuan, Chao Chen, Li Fuxin
- Abstract summary: State-of-the-art algorithms often employ two separate stages, the first one generating object proposals and the second one recognizing and refining the boundaries.
We propose a novel algorithm that directly utilizes a fully convolutional network (FCN) to predict instance labels.
- Score: 7.334808870313923
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Instance Segmentation, which seeks to obtain both class and instance labels
for each pixel in the input image, is a challenging task in computer vision.
State-of-the-art algorithms often employ two separate stages, the first one
generating object proposals and the second one recognizing and refining the
boundaries. Further, proposals are usually based on detectors such as faster
R-CNN which search for boxes in the entire image exhaustively. In this paper,
we propose a novel algorithm that directly utilizes a fully convolutional
network (FCN) to predict instance labels. Specifically, we propose a
variational relaxation of instance segmentation as minimizing an optimization
functional for a piecewise-constant segmentation problem, which can be used to
train an FCN end-to-end. It extends the classical Mumford-Shah variational
segmentation problem to be able to handle permutation-invariant labels in the
ground truth of instance segmentation. Experiments on PASCAL VOC 2012, Semantic
Boundaries dataset(SBD), and the MSCOCO 2017 dataset show that the proposed
approach efficiently tackle the instance segmentation task. The source code and
trained models will be released with the paper.
Related papers
- Complete Instances Mining for Weakly Supervised Instance Segmentation [6.177842623752537]
We propose a novel approach for weakly supervised instance segmentation (WSIS) using only image-level labels.
We use MaskIoU heads to predict the integrity scores of proposals and a Complete Instances Mining (CIM) strategy to explicitly model the redundant segmentation problem.
Our approach allows the network to become aware of multiple instances and complete instances, and we further improve its robustness through the incorporation of an Anti-noise strategy.
arXiv Detail & Related papers (2024-02-12T13:16:47Z) - Unsupervised Universal Image Segmentation [59.0383635597103]
We propose an Unsupervised Universal model (U2Seg) adept at performing various image segmentation tasks.
U2Seg generates pseudo semantic labels for these segmentation tasks via leveraging self-supervised models.
We then self-train the model on these pseudo semantic labels, yielding substantial performance gains.
arXiv Detail & Related papers (2023-12-28T18:59:04Z) - Synthetic Instance Segmentation from Semantic Image Segmentation Masks [15.477053085267404]
We propose a novel paradigm called Synthetic Instance (SISeg)
SISeg instance segmentation results by leveraging image masks generated by existing semantic segmentation models.
In other words, the proposed model does not need extra manpower or higher computational expenses.
arXiv Detail & Related papers (2023-08-02T05:13:02Z) - Collaborative Propagation on Multiple Instance Graphs for 3D Instance
Segmentation with Single-point Supervision [63.429704654271475]
We propose a novel weakly supervised method RWSeg that only requires labeling one object with one point.
With these sparse weak labels, we introduce a unified framework with two branches to propagate semantic and instance information.
Specifically, we propose a Cross-graph Competing Random Walks (CRW) algorithm that encourages competition among different instance graphs.
arXiv Detail & Related papers (2022-08-10T02:14:39Z) - PointInst3D: Segmenting 3D Instances by Points [136.7261709896713]
We propose a fully-convolutional 3D point cloud instance segmentation method that works in a per-point prediction fashion.
We find the key to its success is assigning a suitable target to each sampled point.
Our approach achieves promising results on both ScanNet and S3DIS benchmarks.
arXiv Detail & Related papers (2022-04-25T02:41:46Z) - Beyond the Prototype: Divide-and-conquer Proxies for Few-shot
Segmentation [63.910211095033596]
Few-shot segmentation aims to segment unseen-class objects given only a handful of densely labeled samples.
We propose a simple yet versatile framework in the spirit of divide-and-conquer.
Our proposed approach, named divide-and-conquer proxies (DCP), allows for the development of appropriate and reliable information.
arXiv Detail & Related papers (2022-04-21T06:21:14Z) - SOLO: A Simple Framework for Instance Segmentation [84.00519148562606]
"instance categories" assigns categories to each pixel within an instance according to the instance's location.
"SOLO" is a simple, direct, and fast framework for instance segmentation with strong performance.
Our approach achieves state-of-the-art results for instance segmentation in terms of both speed and accuracy.
arXiv Detail & Related papers (2021-06-30T09:56:54Z) - Towards unsupervised phone and word segmentation using self-supervised
vector-quantized neural networks [36.23526776206862]
We investigate segmenting and clustering speech into low-bitrate phone-like sequences without supervision.
We specifically constrain pretrained self-supervised vector-quantized (VQ) neural networks so that blocks of contiguous feature vectors are assigned to the same code.
arXiv Detail & Related papers (2020-12-14T14:17:33Z) - 1st Place Solutions for OpenImage2019 -- Object Detection and Instance
Segmentation [116.25081559037872]
This article introduces the solutions of the two champion teams, MMfruit' for the detection track and MMfruitSeg' for the segmentation track, in OpenImage Challenge 2019.
It is commonly known that for an object detector, the shared feature at the end of the backbone is not appropriate for both classification and regression.
We propose the Decoupling Head (DH) to disentangle the object classification and regression via the self-learned optimal feature extraction.
arXiv Detail & Related papers (2020-03-17T06:45:07Z) - Towards Bounding-Box Free Panoptic Segmentation [16.4548904544277]
We introduce a new Bounding-Box Free Network (BBFNet) for panoptic segmentation.
BBFNet predicts coarse watershed levels and uses them to detect large instance candidates where boundaries are well defined.
For smaller instances, whose boundaries are less reliable, BBFNet also predicts instance centers by means of Hough voting followed by mean-shift to reliably detect small objects.
arXiv Detail & Related papers (2020-02-18T16:34:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.