Delving into the Cyclic Mechanism in Semi-supervised Video Object
Segmentation
- URL: http://arxiv.org/abs/2010.12176v1
- Date: Fri, 23 Oct 2020 05:40:53 GMT
- Title: Delving into the Cyclic Mechanism in Semi-supervised Video Object
Segmentation
- Authors: Yuxi Li, Ning Xu, Jinlong Peng, John See, Weiyao Lin
- Abstract summary: A cyclic mechanism is incorporated to the standard semi-supervised process to produce more robust representations.
We introduce a simple gradient correction module, which extends the offline pipeline to an online method.
Finally, we develop cycle effective receptive field (cycle-ERF) based on gradient correction to provide a new perspective into analyzing object-specific regions of interests.
- Score: 37.3336313567187
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we address several inadequacies of current video object
segmentation pipelines. Firstly, a cyclic mechanism is incorporated to the
standard semi-supervised process to produce more robust representations. By
relying on the accurate reference mask in the starting frame, we show that the
error propagation problem can be mitigated. Next, we introduce a simple
gradient correction module, which extends the offline pipeline to an online
method while maintaining the efficiency of the former. Finally we develop cycle
effective receptive field (cycle-ERF) based on gradient correction to provide a
new perspective into analyzing object-specific regions of interests. We conduct
comprehensive experiments on challenging benchmarks of DAVIS17 and Youtube-VOS,
demonstrating that the cyclic mechanism is beneficial to segmentation quality.
Related papers
- Improving Weakly-supervised Video Instance Segmentation by Leveraging Spatio-temporal Consistency [9.115508086522887]
We introduce a weakly-supervised method called Eigen VIS that achieves competitive accuracy compared to other VIS approaches.
This method is based on two key innovations: a Temporal Eigenvalue Loss (TEL) and a clip-level Quality Co-efficient (QCC)
The code is available on https://github.com/farnooshar/EigenVIS.
arXiv Detail & Related papers (2024-08-29T16:05:05Z) - Appearance-Based Refinement for Object-Centric Motion Segmentation [85.2426540999329]
We introduce an appearance-based refinement method that leverages temporal consistency in video streams to correct inaccurate flow-based proposals.
Our approach involves a sequence-level selection mechanism that identifies accurate flow-predicted masks as exemplars.
Its performance is evaluated on multiple video segmentation benchmarks, including DAVIS, YouTube, SegTrackv2, and FBMS-59.
arXiv Detail & Related papers (2023-12-18T18:59:51Z) - Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation [76.68301884987348]
We propose a simple yet effective approach for self-supervised video object segmentation (VOS)
Our key insight is that the inherent structural dependencies present in DINO-pretrained Transformers can be leveraged to establish robust-temporal segmentation correspondences in videos.
Our method demonstrates state-of-the-art performance across multiple unsupervised VOS benchmarks and excels in complex real-world multi-object video segmentation tasks.
arXiv Detail & Related papers (2023-11-29T18:47:17Z) - Stable Nonconvex-Nonconcave Training via Linear Interpolation [51.668052890249726]
This paper presents a theoretical analysis of linearahead as a principled method for stabilizing (large-scale) neural network training.
We argue that instabilities in the optimization process are often caused by the nonmonotonicity of the loss landscape and show how linear can help by leveraging the theory of nonexpansive operators.
arXiv Detail & Related papers (2023-10-20T12:45:12Z) - FODVid: Flow-guided Object Discovery in Videos [12.792602427704395]
We focus on building a generalizable solution that avoids overfitting to the individual intricacies.
To solve Video Object (VOS) in an unsupervised setting, we propose a new pipeline (FODVid) based on the idea of guiding segmentation outputs.
arXiv Detail & Related papers (2023-07-10T07:55:42Z) - Exploring the Semi-supervised Video Object Segmentation Problem from a
Cyclic Perspective [36.4057004419079]
In this paper, we place the semi-supervised video object segmentation problem into a cyclic workflow.
We show that a cyclic mechanism incorporated to the standard sequential flow can produce more consistent representations for pixel-wise correspondance.
We also develop cycle effective receptive field (cycle-ERF) based on gradient correction process to provide a new perspective into analyzing object-specific regions of interests.
arXiv Detail & Related papers (2021-11-02T01:50:23Z) - Unsupervised Instance Segmentation in Microscopy Images via Panoptic
Domain Adaptation and Task Re-weighting [86.33696045574692]
We propose a Cycle Consistency Panoptic Domain Adaptive Mask R-CNN (CyC-PDAM) architecture for unsupervised nuclei segmentation in histopathology images.
We first propose a nuclei inpainting mechanism to remove the auxiliary generated objects in the synthesized images.
Secondly, a semantic branch with a domain discriminator is designed to achieve panoptic-level domain adaptation.
arXiv Detail & Related papers (2020-05-05T11:08:26Z) - Self-supervised Equivariant Attention Mechanism for Weakly Supervised
Semantic Segmentation [93.83369981759996]
We propose a self-supervised equivariant attention mechanism (SEAM) to discover additional supervision and narrow the gap.
Our method is based on the observation that equivariance is an implicit constraint in fully supervised semantic segmentation.
We propose consistency regularization on predicted CAMs from various transformed images to provide self-supervision for network learning.
arXiv Detail & Related papers (2020-04-09T14:57:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.