Contour Loss for Instance Segmentation via k-step Distance
Transformation Image
- URL: http://arxiv.org/abs/2102.10854v1
- Date: Mon, 22 Feb 2021 09:35:35 GMT
- Title: Contour Loss for Instance Segmentation via k-step Distance
Transformation Image
- Authors: Xiaolong Guo, Xiaosong Lan, Kunfeng Wang, Shuxiao Li
- Abstract summary: Instance segmentation aims to locate targets in the image and segment each target area at pixel level.
Mask R-CNN is a classic method of instance segmentation, but its predicted masks are unclear and inaccurate near contours.
We propose a novel loss function, called contour loss, which can assure more accurate instance segmentation.
- Score: 5.02853371403908
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Instance segmentation aims to locate targets in the image and segment each
target area at pixel level, which is one of the most important tasks in
computer vision. Mask R-CNN is a classic method of instance segmentation, but
we find that its predicted masks are unclear and inaccurate near contours. To
cope with this problem, we draw on the idea of contour matching based on
distance transformation image and propose a novel loss function, called contour
loss. Contour loss is designed to specifically optimize the contour parts of
the predicted masks, thus can assure more accurate instance segmentation. In
order to make the proposed contour loss to be jointly trained under modern
neural network frameworks, we design a differentiable k-step distance
transformation image calculation module, which can approximately compute
truncated distance transformation images of the predicted mask and
corresponding ground-truth mask online. The proposed contour loss can be
integrated into existing instance segmentation methods such as Mask R-CNN, and
combined with their original loss functions without modification of the
inference network structures, thus has strong versatility. Experimental results
on COCO show that contour loss is effective, which can further improve instance
segmentation performances.
Related papers
- Contour-weighted loss for class-imbalanced image segmentation [2.183832403223894]
Image segmentation is critically important in almost all medical image analysis for automatic interpretations and processing.
It is often challenging to perform image segmentation due to data imbalance between intra- and inter-class.
We propose a new methodology to address the issue, with a compact yet effective contour-weighted loss function.
arXiv Detail & Related papers (2024-06-07T07:43:52Z) - Rasterized Edge Gradients: Handling Discontinuities Differentiably [25.85191317712521]
We present a novel method for computing gradients at discontinuities for rendering approximations.
Our method elegantly simplifies the traditionally complex problem through a carefully designed approximation strategy.
We showcase our method in human head scene reconstruction, demonstrating handling of camera images and segmentation masks.
arXiv Detail & Related papers (2024-05-03T22:42:00Z) - Variance-insensitive and Target-preserving Mask Refinement for
Interactive Image Segmentation [68.16510297109872]
Point-based interactive image segmentation can ease the burden of mask annotation in applications such as semantic segmentation and image editing.
We introduce a novel method, Variance-Insensitive and Target-Preserving Mask Refinement to enhance segmentation quality with fewer user inputs.
Experiments on GrabCut, Berkeley, SBD, and DAVIS datasets demonstrate our method's state-of-the-art performance in interactive image segmentation.
arXiv Detail & Related papers (2023-12-22T02:31:31Z) - PairingNet: A Learning-based Pair-searching and -matching Network for
Image Fragments [6.694162736590122]
We propose a learning-based image fragment pair-searching and -matching approach to solve the challenging restoration problem.
Our proposed network achieves excellent pair-searching accuracy, reduces matching errors, and significantly reduces computational time.
arXiv Detail & Related papers (2023-12-14T07:43:53Z) - SEMI-PointRend: Improved Semiconductor Wafer Defect Classification and
Segmentation as Rendering [0.31317409221921133]
PointRend is an iterative segmentation algorithm inspired by image rendering in computer graphics.
We show that SEMI-PointRend can outperforms Mask R-CNN by up to 18.8% in terms of segmentation mean average precision.
arXiv Detail & Related papers (2023-02-19T13:12:28Z) - Image Inpainting by End-to-End Cascaded Refinement with Mask Awareness [66.55719330810547]
Inpainting arbitrary missing regions is challenging because learning valid features for various masked regions is nontrivial.
We propose a novel mask-aware inpainting solution that learns multi-scale features for missing regions in the encoding phase.
Our framework is validated both quantitatively and qualitatively via extensive experiments on three public datasets.
arXiv Detail & Related papers (2021-04-28T13:17:47Z) - Image Inpainting with Edge-guided Learnable Bidirectional Attention Maps [85.67745220834718]
We present an edge-guided learnable bidirectional attention map (Edge-LBAM) for improving image inpainting of irregular holes.
Our Edge-LBAM method contains dual procedures,including structure-aware mask-updating guided by predict edges.
Extensive experiments show that our Edge-LBAM is effective in generating coherent image structures and preventing color discrepancy and blurriness.
arXiv Detail & Related papers (2021-04-25T07:25:16Z) - An Empirical Method to Quantify the Peripheral Performance Degradation
in Deep Networks [18.808132632482103]
convolutional neural network (CNN) kernels compound with each convolutional layer.
Deeper and deeper networks combined with stride-based down-sampling means that the propagation of this region can end up covering a non-negligable portion of the image.
Our dataset is constructed by inserting objects into high resolution backgrounds, thereby allowing us to crop sub-images which place target objects at specific locations relative to the image border.
By probing the behaviour of Mask R-CNN across a selection of target locations, we see clear patterns of performance degredation near the image boundary, and in particular in the image corners.
arXiv Detail & Related papers (2020-12-04T18:00:47Z) - Invariant Deep Compressible Covariance Pooling for Aerial Scene
Categorization [80.55951673479237]
We propose a novel invariant deep compressible covariance pooling (IDCCP) to solve nuisance variations in aerial scene categorization.
We conduct extensive experiments on the publicly released aerial scene image data sets and demonstrate the superiority of this method compared with state-of-the-art methods.
arXiv Detail & Related papers (2020-11-11T11:13:07Z) - Category Level Object Pose Estimation via Neural Analysis-by-Synthesis [64.14028598360741]
In this paper we combine a gradient-based fitting procedure with a parametric neural image synthesis module.
The image synthesis network is designed to efficiently span the pose configuration space.
We experimentally show that the method can recover orientation of objects with high accuracy from 2D images alone.
arXiv Detail & Related papers (2020-08-18T20:30:47Z) - Multi-Margin based Decorrelation Learning for Heterogeneous Face
Recognition [90.26023388850771]
This paper presents a deep neural network approach to extract decorrelation representations in a hyperspherical space for cross-domain face images.
The proposed framework can be divided into two components: heterogeneous representation network and decorrelation representation learning.
Experimental results on two challenging heterogeneous face databases show that our approach achieves superior performance on both verification and recognition tasks.
arXiv Detail & Related papers (2020-05-25T07:01:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.