WiCo: Win-win Cooperation of Bottom-up and Top-down Referring Image
Segmentation
- URL: http://arxiv.org/abs/2306.10750v1
- Date: Mon, 19 Jun 2023 07:49:29 GMT
- Title: WiCo: Win-win Cooperation of Bottom-up and Top-down Referring Image
Segmentation
- Authors: Zesen Cheng, Peng Jin, Hao Li, Kehan Li, Siheng Li, Xiangyang Ji,
Chang Liu and Jie Chen
- Abstract summary: We build Win-win Cooperation (WiCo) to exploit complementary nature of two types of methods on both interaction and integration aspects.
With our WiCo, several prominent top-down and bottom-up combinations achieve remarkable improvements on three common datasets with reasonable extra costs.
- Score: 37.53063869243558
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The top-down and bottom-up methods are two mainstreams of referring
segmentation, while both methods have their own intrinsic weaknesses. Top-down
methods are chiefly disturbed by Polar Negative (PN) errors owing to the lack
of fine-grained cross-modal alignment. Bottom-up methods are mainly perturbed
by Inferior Positive (IP) errors due to the lack of prior object information.
Nevertheless, we discover that two types of methods are highly complementary
for restraining respective weaknesses but the direct average combination leads
to harmful interference. In this context, we build Win-win Cooperation (WiCo)
to exploit complementary nature of two types of methods on both interaction and
integration aspects for achieving a win-win improvement. For the interaction
aspect, Complementary Feature Interaction (CFI) provides fine-grained
information to top-down branch and introduces prior object information to
bottom-up branch for complementary feature enhancement. For the integration
aspect, Gaussian Scoring Integration (GSI) models the gaussian performance
distributions of two branches and weightedly integrates results by sampling
confident scores from the distributions. With our WiCo, several prominent
top-down and bottom-up combinations achieve remarkable improvements on three
common datasets with reasonable extra costs, which justifies effectiveness and
generality of our method.
Related papers
- Dual-Center Graph Clustering with Neighbor Distribution [48.904324854543894]
We propose a novel Dual-Center Graph Clustering (DCGC) approach based on neighbor distribution properties.<n>Our proposed method includes representation learning with neighbor distribution and dual-center optimization.
arXiv Detail & Related papers (2025-07-18T09:17:04Z) - Exploring Generalized Gait Recognition: Reducing Redundancy and Noise within Indoor and Outdoor Datasets [24.242460774158463]
Generalized gait recognition aims to achieve robust performance across diverse domains.<n>Mixed-dataset training is widely used to enhance generalization.<n>We propose a unified framework that systematically improves cross-domain gait recognition.
arXiv Detail & Related papers (2025-05-21T06:46:09Z) - Byzantine-Robust Gossip: Insights from a Dual Approach [15.69624587054777]
This paper investigates Byzantine-resilient algorithms in a decentralized setting, where devices communicate directly with one another.
We provide both global and local clipping rules in the special case of average consensus, with tight convergence guarantees.
arXiv Detail & Related papers (2024-05-06T13:22:54Z) - Unifying Feature and Cost Aggregation with Transformers for Semantic and Visual Correspondence [51.54175067684008]
This paper introduces a Transformer-based integrative feature and cost aggregation network designed for dense matching tasks.
We first show that feature aggregation and cost aggregation exhibit distinct characteristics and reveal the potential for substantial benefits stemming from the judicious use of both aggregation processes.
Our framework is evaluated on standard benchmarks for semantic matching, and also applied to geometric matching, where we show that our approach achieves significant improvements compared to existing methods.
arXiv Detail & Related papers (2024-03-17T07:02:55Z) - A Robust Negative Learning Approach to Partial Domain Adaptation Using
Source Prototypes [0.8895157045883034]
This work proposes a robust Partial Domain Adaptation (PDA) framework that mitigates the negative transfer problem.
It includes diverse, complementary label feedback, alleviating the effect of incorrect feedback and promoting pseudo-label refinement.
We conducted a series of comprehensive experiments, including an ablation analysis, covering a range of partial domain adaptation tasks.
arXiv Detail & Related papers (2023-09-07T07:26:27Z) - Rethinking Clustering-Based Pseudo-Labeling for Unsupervised
Meta-Learning [146.11600461034746]
Method for unsupervised meta-learning, CACTUs, is a clustering-based approach with pseudo-labeling.
This approach is model-agnostic and can be combined with supervised algorithms to learn from unlabeled data.
We prove that the core reason for this is lack of a clustering-friendly property in the embedding space.
arXiv Detail & Related papers (2022-09-27T19:04:36Z) - Cooperative Distribution Alignment via JSD Upper Bound [7.071749623370137]
Unsupervised distribution alignment estimates a transformation that maps two or more source distributions to a shared aligned distribution.
This task has many applications including generative modeling, unsupervised domain adaptation, and socially aware learning.
We propose to unify and generalize previous flow-based approaches under a single non-adversarial framework.
arXiv Detail & Related papers (2022-07-05T20:09:03Z) - Robust Upper Bounds for Adversarial Training [4.971729553254843]
We introduce a new approach to adversarial training by minimizing an upper bound of the adversarial loss.
This bound is based on a holistic expansion of the network instead of separate bounds for each layer.
We derive two new methods with the proposed approach.
arXiv Detail & Related papers (2021-12-17T01:52:35Z) - Semi-supervised Domain Adaptive Structure Learning [72.01544419893628]
Semi-supervised domain adaptation (SSDA) is a challenging problem requiring methods to overcome both 1) overfitting towards poorly annotated data and 2) distribution shift across domains.
We introduce an adaptive structure learning method to regularize the cooperation of SSL and DA.
arXiv Detail & Related papers (2021-12-12T06:11:16Z) - Light Field Saliency Detection with Dual Local Graph Learning
andReciprocative Guidance [148.9832328803202]
We model the infor-mation fusion within focal stack via graph networks.
We build a novel dual graph modelto guide the focal stack fusion process using all-focus pat-terns.
arXiv Detail & Related papers (2021-10-02T00:54:39Z) - Contradictory Structure Learning for Semi-supervised Domain Adaptation [67.89665267469053]
Current adversarial adaptation methods attempt to align the cross-domain features.
Two challenges remain unsolved: 1) the conditional distribution mismatch and 2) the bias of the decision boundary towards the source domain.
We propose a novel framework for semi-supervised domain adaptation by unifying the learning of opposite structures.
arXiv Detail & Related papers (2020-02-06T22:58:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.