CycleSegNet: Object Co-segmentation with Cycle Refinement and Region
Correspondence
- URL: http://arxiv.org/abs/2101.01308v1
- Date: Tue, 5 Jan 2021 01:35:19 GMT
- Title: CycleSegNet: Object Co-segmentation with Cycle Refinement and Region
Correspondence
- Authors: Guankai Li, Chi Zhang, Guosheng Lin
- Abstract summary: CycleSegNet is a novel framework for the co-segmentation task.
Our proposed method significantly outperforms the existing networks and achieves new state-of-the-art performance.
- Score: 44.811987693952965
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Image co-segmentation is an active computer vision task which aims to segment
the common objects in a set of images. Recently, researchers design various
learning-based algorithms to handle the co-segmentation task. The main
difficulty in this task is how to effectively transfer information between
images to infer the common object regions. In this paper, we present
CycleSegNet, a novel framework for the co-segmentation task. Our network design
has two key components: a region correspondence module which is the basic
operation for exchanging information between local image regions, and a cycle
refinement module which utilizes ConvLSTMs to progressively update image
embeddings and exchange information in a cycle manner. Experiment results on
four popular benchmark datasets -- PASCAL VOC dataset, MSRC dataset, Internet
dataset and iCoseg dataset demonstrate that our proposed method significantly
outperforms the existing networks and achieves new state-of-the-art
performance.
Related papers
- Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation [63.15257949821558]
Referring Remote Sensing Image (RRSIS) is a new challenge that combines computer vision and natural language processing.
Traditional Referring Image (RIS) approaches have been impeded by the complex spatial scales and orientations found in aerial imagery.
We introduce the Rotated Multi-Scale Interaction Network (RMSIN), an innovative approach designed for the unique demands of RRSIS.
arXiv Detail & Related papers (2023-12-19T08:14:14Z) - COMNet: Co-Occurrent Matching for Weakly Supervised Semantic
Segmentation [13.244183864948848]
We propose a novel Co-Occurrent Matching Network (COMNet), which can promote the quality of the CAMs and enforce the network to pay attention to the entire parts of objects.
Specifically, we perform inter-matching on paired images that contain common classes to enhance the corresponded areas, and construct intra-matching on a single image to propagate the semantic features across the object regions.
The experiments on the Pascal VOC 2012 and MS-COCO datasets show that our network can effectively boost the performance of the baseline model and achieve new state-of-the-art performance.
arXiv Detail & Related papers (2023-09-29T03:55:24Z) - Deep Attention Unet: A Network Model with Global Feature Perception
Ability [12.087640144194246]
This paper proposes a new type of UNet image segmentation algorithm based on channel self attention mechanism and residual connection called.
In my experiment, the new network model improved mIOU by 2.48% compared to traditional UNet on the FoodNet dataset.
arXiv Detail & Related papers (2023-04-21T09:12:29Z) - CRCNet: Few-shot Segmentation with Cross-Reference and Region-Global
Conditional Networks [59.85183776573642]
Few-shot segmentation aims to learn a segmentation model that can be generalized to novel classes with only a few training images.
We propose a Cross-Reference and Local-Global Networks (CRCNet) for few-shot segmentation.
Our network can better find the co-occurrent objects in the two images with a cross-reference mechanism.
arXiv Detail & Related papers (2022-08-23T06:46:18Z) - Few-shot Segmentation with Optimal Transport Matching and Message Flow [50.9853556696858]
It is essential for few-shot semantic segmentation to fully utilize the support information.
We propose a Correspondence Matching Network (CMNet) with an Optimal Transport Matching module.
Experiments on PASCAL VOC 2012, MS COCO, and FSS-1000 datasets show that our network achieves new state-of-the-art few-shot segmentation performance.
arXiv Detail & Related papers (2021-08-19T06:26:11Z) - Interactive Video Object Segmentation Using Global and Local Transfer
Modules [51.93009196085043]
We develop a deep neural network, which consists of the annotation network (A-Net) and the transfer network (T-Net)
Given user scribbles on a frame, A-Net yields a segmentation result based on the encoder-decoder architecture.
We train the entire network in two stages, by emulating user scribbles and employing an auxiliary loss.
arXiv Detail & Related papers (2020-07-16T06:49:07Z) - CRNet: Cross-Reference Networks for Few-Shot Segmentation [59.85183776573642]
Few-shot segmentation aims to learn a segmentation model that can be generalized to novel classes with only a few training images.
With a cross-reference mechanism, our network can better find the co-occurrent objects in the two images.
Experiments on the PASCAL VOC 2012 dataset show that our network achieves state-of-the-art performance.
arXiv Detail & Related papers (2020-03-24T04:55:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.