CoADNet: Collaborative Aggregation-and-Distribution Networks for
Co-Salient Object Detection
- URL: http://arxiv.org/abs/2011.04887v1
- Date: Tue, 10 Nov 2020 04:28:11 GMT
- Title: CoADNet: Collaborative Aggregation-and-Distribution Networks for
Co-Salient Object Detection
- Authors: Qijian Zhang, Runmin Cong, Junhui Hou, Chongyi Li, Yao Zhao
- Abstract summary: Co-Salient Object Detection (CoSOD) aims at discovering salient objects that repeatedly appear in a given query group containing two or more relevant images.
One challenging issue is how to effectively capture co-saliency cues by modeling and exploiting inter-image relationships.
We present an end-to-end collaborative aggregation-and-distribution network (CoADNet) to capture both salient and repetitive visual patterns from multiple images.
- Score: 91.91911418421086
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Co-Salient Object Detection (CoSOD) aims at discovering salient objects that
repeatedly appear in a given query group containing two or more relevant
images. One challenging issue is how to effectively capture co-saliency cues by
modeling and exploiting inter-image relationships. In this paper, we present an
end-to-end collaborative aggregation-and-distribution network (CoADNet) to
capture both salient and repetitive visual patterns from multiple images.
First, we integrate saliency priors into the backbone features to suppress the
redundant background information through an online intra-saliency guidance
structure. After that, we design a two-stage aggregate-and-distribute
architecture to explore group-wise semantic interactions and produce the
co-saliency features. In the first stage, we propose a group-attentional
semantic aggregation module that models inter-image relationships to generate
the group-wise semantic representations. In the second stage, we propose a
gated group distribution module that adaptively distributes the learned group
semantics to different individuals in a dynamic gating mechanism. Finally, we
develop a group consistency preserving decoder tailored for the CoSOD task,
which maintains group constraints during feature decoding to predict more
consistent full-resolution co-saliency maps. The proposed CoADNet is evaluated
on four prevailing CoSOD benchmark datasets, which demonstrates the remarkable
performance improvement over ten state-of-the-art competitors.
Related papers
- Comprehensive Generative Replay for Task-Incremental Segmentation with Concurrent Appearance and Semantic Forgetting [49.87694319431288]
Generalist segmentation models are increasingly favored for diverse tasks involving various objects from different image sources.
We propose a Comprehensive Generative (CGR) framework that restores appearance and semantic knowledge by synthesizing image-mask pairs.
Experiments on incremental tasks (cardiac, fundus and prostate segmentation) show its clear advantage for alleviating concurrent appearance and semantic forgetting.
arXiv Detail & Related papers (2024-06-28T10:05:58Z) - IDRNet: Intervention-Driven Relation Network for Semantic Segmentation [34.09179171102469]
Co-occurrent visual patterns suggest that pixel relation modeling facilitates dense prediction tasks.
Despite the impressive results, existing paradigms often suffer from inadequate or ineffective contextual information aggregation.
We propose a novel textbfIntervention-textbfDriven textbfRelation textbfNetwork.
arXiv Detail & Related papers (2023-10-16T18:37:33Z) - Weakly-Supervised Concealed Object Segmentation with SAM-based Pseudo
Labeling and Multi-scale Feature Grouping [40.07070188661184]
Weakly-Supervised Concealed Object (WSCOS) aims to segment objects well blended with surrounding environments.
It is hard to distinguish concealed objects from the background due to the intrinsic similarity.
We propose a new WSCOS method to address these two challenges.
arXiv Detail & Related papers (2023-05-18T14:31:34Z) - A Unified Two-Stage Group Semantics Propagation and Contrastive Learning
Network for Co-Saliency Detection [11.101111632948394]
Two-stage grOup semantics PropagatIon and Contrastive learning NETwork (TopicNet) for CoSOD.
We present a unified Two-stage grOup semantics PropagatIon and Contrastive learning NETwork (TopicNet) for CoSOD.
arXiv Detail & Related papers (2022-08-13T10:14:50Z) - Global-and-Local Collaborative Learning for Co-Salient Object Detection [162.62642867056385]
The goal of co-salient object detection (CoSOD) is to discover salient objects that commonly appear in a query group containing two or more relevant images.
We propose a global-and-local collaborative learning architecture, which includes a global correspondence modeling (GCM) and a local correspondence modeling (LCM)
The proposed GLNet is evaluated on three prevailing CoSOD benchmark datasets, demonstrating that our model trained on a small dataset (about 3k images) still outperforms eleven state-of-the-art competitors trained on some large datasets (about 8k-200k images)
arXiv Detail & Related papers (2022-04-19T14:32:41Z) - Tasks Integrated Networks: Joint Detection and Retrieval for Image
Search [99.49021025124405]
In many real-world searching scenarios (e.g., video surveillance), the objects are seldom accurately detected or annotated.
We first introduce an end-to-end Integrated Net (I-Net), which has three merits.
We further propose an improved I-Net, called DC-I-Net, which makes two new contributions.
arXiv Detail & Related papers (2020-09-03T03:57:50Z) - Bidirectional Graph Reasoning Network for Panoptic Segmentation [126.06251745669107]
We introduce a Bidirectional Graph Reasoning Network (BGRNet) to mine the intra-modular and intermodular relations within and between foreground things and background stuff classes.
BGRNet first constructs image-specific graphs in both instance and semantic segmentation branches that enable flexible reasoning at the proposal level and class level.
arXiv Detail & Related papers (2020-04-14T02:32:10Z) - Cascaded Human-Object Interaction Recognition [175.60439054047043]
We introduce a cascade architecture for a multi-stage, coarse-to-fine HOI understanding.
At each stage, an instance localization network progressively refines HOI proposals and feeds them into an interaction recognition network.
With our carefully-designed human-centric relation features, these two modules work collaboratively towards effective interaction understanding.
arXiv Detail & Related papers (2020-03-09T17:05:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.