A Unified Two-Stage Group Semantics Propagation and Contrastive Learning
Network for Co-Saliency Detection
- URL: http://arxiv.org/abs/2208.06615v1
- Date: Sat, 13 Aug 2022 10:14:50 GMT
- Title: A Unified Two-Stage Group Semantics Propagation and Contrastive Learning
Network for Co-Saliency Detection
- Authors: Zhenshan Tan, Cheng Chen, Keyu Wen, Yuzhuo Qin, Xiaodong Gu
- Abstract summary: Two-stage grOup semantics PropagatIon and Contrastive learning NETwork (TopicNet) for CoSOD.
We present a unified Two-stage grOup semantics PropagatIon and Contrastive learning NETwork (TopicNet) for CoSOD.
- Score: 11.101111632948394
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Co-saliency detection (CoSOD) aims at discovering the repetitive salient
objects from multiple images. Two primary challenges are group semantics
extraction and noise object suppression. In this paper, we present a unified
Two-stage grOup semantics PropagatIon and Contrastive learning NETwork
(TopicNet) for CoSOD. TopicNet can be decomposed into two substructures,
including a two-stage group semantics propagation module (TGSP) to address the
first challenge and a contrastive learning module (CLM) to address the second
challenge. Concretely, for TGSP, we design an image-to-group propagation module
(IGP) to capture the consensus representation of intra-group similar features
and a group-to-pixel propagation module (GPP) to build the relevancy of
consensus representation. For CLM, with the design of positive samples, the
semantic consistency is enhanced. With the design of negative samples, the
noise objects are suppressed. Experimental results on three prevailing
benchmarks reveal that TopicNet outperforms other competitors in terms of
various evaluation metrics.
Related papers
- Co-Salient Object Detection with Semantic-Level Consensus Extraction and
Dispersion [27.120768849942145]
Co-salient object detection aims to highlight the common salient object in each image.
We propose a hierarchical Transformer module for extracting semantic-level consensus.
A Transformer-based dispersion module takes into account the variation of the co-salient object in different scenes.
arXiv Detail & Related papers (2023-09-14T14:39:07Z) - Weakly-Supervised Concealed Object Segmentation with SAM-based Pseudo
Labeling and Multi-scale Feature Grouping [40.07070188661184]
Weakly-Supervised Concealed Object (WSCOS) aims to segment objects well blended with surrounding environments.
It is hard to distinguish concealed objects from the background due to the intrinsic similarity.
We propose a new WSCOS method to address these two challenges.
arXiv Detail & Related papers (2023-05-18T14:31:34Z) - USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text
Retrieval [115.28586222748478]
Image-Text Retrieval (ITR) aims at searching for the target instances that are semantically relevant to the given query from the other modality.
Existing approaches typically suffer from two major limitations.
arXiv Detail & Related papers (2023-01-17T12:42:58Z) - GCoNet+: A Stronger Group Collaborative Co-Salient Object Detector [156.43671738038657]
We present a novel end-to-end group collaborative learning network, termed GCoNet+.
GCoNet+ can effectively and efficiently identify co-salient objects in natural scenes.
arXiv Detail & Related papers (2022-05-30T23:49:19Z) - A Unified Transformer Framework for Group-based Segmentation:
Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection [59.21990697929617]
Humans tend to mine objects by learning from a group of images or several frames of video since we live in a dynamic world.
Previous approaches design different networks on similar tasks separately, and they are difficult to apply to each other.
We introduce a unified framework to tackle these issues, term as UFO (UnifiedObject Framework for Co-Object Framework)
arXiv Detail & Related papers (2022-03-09T13:35:19Z) - Clustering by Maximizing Mutual Information Across Views [62.21716612888669]
We propose a novel framework for image clustering that incorporates joint representation learning and clustering.
Our method significantly outperforms state-of-the-art single-stage clustering methods across a variety of image datasets.
arXiv Detail & Related papers (2021-07-24T15:36:49Z) - CoADNet: Collaborative Aggregation-and-Distribution Networks for
Co-Salient Object Detection [91.91911418421086]
Co-Salient Object Detection (CoSOD) aims at discovering salient objects that repeatedly appear in a given query group containing two or more relevant images.
One challenging issue is how to effectively capture co-saliency cues by modeling and exploiting inter-image relationships.
We present an end-to-end collaborative aggregation-and-distribution network (CoADNet) to capture both salient and repetitive visual patterns from multiple images.
arXiv Detail & Related papers (2020-11-10T04:28:11Z) - Pairwise Relation Learning for Semi-supervised Gland Segmentation [90.45303394358493]
We propose a pairwise relation-based semi-supervised (PRS2) model for gland segmentation on histology images.
This model consists of a segmentation network (S-Net) and a pairwise relation network (PR-Net)
We evaluate our model against five recent methods on the GlaS dataset and three recent methods on the CRAG dataset.
arXiv Detail & Related papers (2020-08-06T15:02:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.